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PREFACE 

THE  time  which  has  elapsed  since  the  publication  of  the  first  edition  of 
this  treatise  has  been  a  period  of  great  activity  in  the  development  of  the 
Theory  of  Functions  of  a  real  variable.  In  particular,  the  introduction  of  the 
Lebesgue  Integral,  which  was  new  in  1907,  has  since  produced  its  full  effect, 
in  the  generalization  of  the  Theory  of  Integration,  and  upon  the  theory  of  the 
representation  of  functions  by  means  of  Fourier's,  and  other,  series  and  integrals. 

In  order  to  give  an  adequate  account  of  the  subject  in  its  present  condition, 
a  large  amount  of  new  matter  has  had  to  be  introduced;  and  this  has  made  it 
necessary  to  divide  the  treatise  into  two  volumes.  The  matter  contained  in 
the  first  edition  has  been  carefully  revised,  amplified,  and  in  many  cases  re- 
written. 

The  parts  of  the  subject  which  were  dealt  with  in  the  first  five  chapters 
of  the  first  edition  have  been  expanded  into  the  eight  chapters  of  the  present 
first  volume  of  the  new  edition.  With  a  view  to  greater  unity  of  treatment  of 
the  Theory  of  Integration,  some  theorems  which  appeared  in  Chapter  VI,  of 
the  first  edition,  have  however  been  included  in  the  present  volume.  A  con- 
siderable part  of  the  Theory  of  Integration,  in  relation  to  series  and  sequences, 
still  however  remains  for  treatment  in  Volume  n. 

On  controversial  matters  connected  with  the  fundamentals  of  the  Theory 
of  Aggregates,  the  considerable  diversity  of  opinion  which  has  arisen  amongst 
Mathematicians  has  been  taken  into  account,  but  in  general  no  attempt  has 
been  made  to  give  dogmatic  decisions  between  opposed  opinions.  In  view  of  the 
delicate  questions  which  arise  as  to  the  legitimacy  and  meaning  of  the  axiom 
known  as  the  Multiplicative  Axiom,  or  as  the  Principle  of  Zermelo,  the  policy 
has  been  adopted  of  so  framing  the  proofs  of  theorems  as  to  avoid  an  appeal 
to  the  axiom,  whenever  that  course  appeared  to  be  possible;  in  other  cases, 
the  necessity  for  the  employment  of  the  axiom  has  been  pointed  out. 

Ample  references  to  sources  of  information  are  given  throughout,  but  such 
references  do  not  provide  the  means  for  compiling  a  complete  list  of  writings 
on  the  subject.  No  attempt  has  been  made  to  settle  questions  of  priority  of 
discovery. 

My  thanks  are  due  to  Dr  H.  F.  Baker,  F.R.S.,  Lowndean  Professor  of 
Astronomy  and  Geometry,  in  the  University  of  Cambridge,  who  has  kindly 
read  nearly  all  the  proofs  as  they  passed  through  the  Press. 

E.  W.  HOBSON. 

CHRIST'S  COLLEGE,  CAMBRIDGE. 
October  10,  1920. 
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theory  of  functions  of  a  real  variable,  as  developed  during  the  last  few 
J-  decades,  is  a  body  of  doctrine  resting,  first  upon  a  definite  conception  of  the 
arithmetic  continuum  which  forms  the  field  of  the  variable,  and  which  includes 
a  precise  arithmetic  theory  of  the  nature  of  a  limit,  and  secondly,  upon  a  defi- 
nite conception  of  the  nature  of  the  functional  relation.  The  procedure  of  the 
theory  consists  largely  in  the  development,  based  upon  precise  definitions,  of 
a  classification  of  functions,  according  as  they  possess,  or  do  not  possess,  certain 
peculiarities,  such  as  continuity,  differentiability,  &c.,  throughout  the  domain 
of  the  variable,  or  at  points  forming  a  selected  set  contained  in  that  domain. 
The  detailed  consequences  of  the  presence,  or  of  the  absence,  of  such  peculiarities 
are  then  traced  out,  and  are  applied  for  the  purpose  of  obtaining  conditions 
for  the  validity  of  the  processes  of  Mathematical  Analysis.  These  processes, 
which  have  been  long  employed  in  the  so-called  Infinitesimal  Calculus,  consist 
essentially  in  the  ascertainment  of  the  existence,  and  in  the  evaluation,  of  limits, 
and  are  subject,  in  every  case,  to  restrictive  assumptions  which  are  necessary 
conditions  of  their  validity.  The  object  to  be  obtained  by  the  theory  of  func- 
tions of  a  real  variable  consists  then  largely  in  the  precise  formulation  of 
necessary  and  sufficient  conditions  for  the  validity  of  the  limiting  processes  of 
Analysis.  A  necessary  requisite  in  such  formulation  is  a  language  descriptive 
of  particular  aggregates  of  values  of  the  variable,  in  relation  to  which  functions 
possess  definite  peculiarities.  This  language  is  provided  by  the  Theory  of  Sets 
of  Points,  also  known,  in  its  more  general  aspects,  as  the  Theory  of  Aggregates, 
which  contains  an  analysis  of  the  peculiarities  of  structure  and  of  distribution  in 
the  field  of  the  variable  which  such  sets  of  points  may  possess.  This  theory,  which 
had  its  origin  in  the  exigencies  of  a  critical  theory  of  functions,  and  has  since 
received  wide  applications,  not  only  in  Pure  Analysis,  but  also  in  Geometry, 
must  be  regarded  as  an  integral  part  of  the  subject.  A  most  important  part 
of  the  theory  of  functions  is  the  theory  of  the  representation  of  functions  in  a 
prescribed  manner,  especially  by  means  of  series  or  sequences  of  functions  of 
prescribed  types.  Much  progress  has  recently  been  made  in  this  part  of  the  sub- 
ject, results  having  been  obtained  which  have  led  to  a  classification  of  functions 
in  accordance  with  the  modes  of  representation  of  which  they  are  capable.  The 
special  case  of  the  conditions  of  representability  of  functions  by  means  of 
trigonometrical  series  was  historically  the  starting-point  in  which  a  great 
part  of  the  modern  development  of  the  theory  of  functions  of  a  real  variable 
had  its  origin. 
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The  course  of  study,  of  which  the  present  treatise  is  the  outcome,  followed 
an  order  very  similar  to  the  historical  order  in  which  the  subject  was  developed. 
Commencing  with  the  study  of  Fourier's  series,  in  their  application  to  the 
problems  of  Mathematical  Physics,  and  provided  with  a  knowledge  of  the 
Differential  and  Integral  Calculus,  of  the  traditional  kind  in  which  notions  of 
the  nature  of  continuity  and  of  limits  founded  on  an  uncritical  use  of  intuitions 
of  space  and  time  are  the  stock  in  trade,  I  was  led,  by  the  difficulties  connected 
with  the  theory  of  these  series,  and  through  an  attempt  to  understand  the 
literature  which  deals  with  them,  to  a  study  of  the  theories  of  real  number, 
due  to  Cantor  and  Dedekind,  and  to  that  of  the  theory  of  sets  of  points.  A 
study  of  the  foundations  of  the  Integral  Calculus,  and  of  the  general  theory  of 
functions  of  a  real  variable  formed  the  natural  continuation  of  the  course.  The 
present  work  has  been  written  with  the  object  of  presenting  in  a  connected  form, 
and  of  thus  rendering  more  easily  accessible  than  hitherto,  the  chief  results 
which  are  to  be  found  scattered  through  a  very  large  number  of  memoirs, 
periodicals,  and  treatises.  I  have  endeavoured,  as  far  as  possible,  to  fill  up  gaps 
in  the  various  theories  which  occur  in  different  parts  of  the  subject.  The  proofs 
of  theorems  have  in  many  cases  been  simplified,  often  in  accordance  with  deve- 
lopments of  the  theory  later  in  date  than  the  original  proofs;  other  theorems 
have  been  given  in  a  form  more  general  than  that  in  which  they  were  first 
discovered.  In  the  literature  of  the  subject,  errors  are  not  infrequent,  largely 
owing  to  the  fact  that  spatial  intuition  affords  an  inadequate  corrective  of  the 
theories  involved,  and  is  indeed  in  some  cases  almost  misleading.  Although  I 
have  made  every  endeavour  to  attain  to  accuracy  both  in  form  and  in  substance, 
it  is  practically  certain  that  the  present  work  will  form  no  exception  to  the  rule  of 
fallibility.  Where  I  have  called  attention  to  what  I  regard  as  inadequate  state- 
ments or  errors  on  the  part  of  other  writers,  I  have  done  so  solely  for  the  pur- 
pose of  directing  the  attention  of  students  to  the  points  in  questions,  and  with 
full  consciousness  that,  at  least  in  some  cases,  close  examination  might  shew 
that  what  appeared  to  me  to  be  erroneous  was  rather  due  to  some  misapprehen- 
sion on  my  part  of  the  meaning  of  the  writers  to  whom  reference  is  made.  On 
some  points  connected  with  the  theory  of  aggregates,  which  are  at  present 
matters  of  controversy,  I  have  expressed  definite  opinions,  although  I  fully 
recognize  that,  on  such  matters,  a  dogmatic  attitude  of  mind  is  at  the  present 
time  wholly  out  of  place,  and  not  unlikely  to  be  avenged  when  the  points 
concerned  are  finally  settled  to  the  general  satisfaction  of  mathematicians. 

Chapter  I  contains  a  discussion  of  Number,  and  includes  a  full  account  of 
the  theories  of  Real  Number,  due  to  Cantor  and  Dedekind.  Whilst  an  indica- 
tion has  been  given  of  the  fundamental  notions  upon  which  the  conceptions  of 
cardinal  and  ordinal  numbers  rest,  I  have  not  attempted  to  reduce  these  fun- 
damental notions  to  a  minimum  of  indefinables  from  which  the  whole  theory 
might  be  deduced  by  means  of  formal  logic.  A  slight  perusal  of  the  extremely 
extensive  literature  of  the  Philosophy  of  Arithmetic  will  shew  that  any  such 
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attempt  could  only  have  been  made  by  entering  upon  a  prolonged  discussion 
of  a  philosophical  character,  wholly  unsuited  to  a  treatise  of  professedly  mathe- 
matical complexion,  and  that  any  views  expressed  would  have  but  little  pros- 
pect of  giving  general  satisfaction  to  logicians  and  philosophers.  The  modern 
theory  of  Real  Numbers  has  been  the  object  of  much  criticism  by  philosophers 
and  others.  It  has  been  represented  that  the  modern  extension  of  the  notion 
of  number  to  the  case  of  irrational  numbers  is  a  sophistical  attempt  to  obliterate 
the  fundamental  distinction  between  the  discrete  and  the  continuous.  I  venture 
to  think  that  such  objections  consist,  in  large  part  at  least,  of  criticisms  of  the 
current  terminology  of  the  mathematical  theories,  especially  in  respect  of  the 
extensions  of  the  use  of  the  word  "number,"  and  I  think  it  probable  that  many 
of  these  criticisms  would  not  survive  a  fair  examination  of  the  theories  them- 
selves apart  from  the  language  in  which  they  are  expressed.  An  appropriate 
terminology,  although  a  matter  of  convention,  is  no  doubt  a  very  important 
matter  in  relation  to  such  fundamental  matters,  as  it  is  conducive  to  clearness 
of  thought;  but  the  substance  of  the  theories  is  of  incomparably  greater  im- 
portance than  the  forms  in  which  they  are  expressed,  and  those  theories  may 
be  found  on  examination  to  be  essentially  sound,  even  if  their  terminology  be 
regarded  as  in  some  respects  defective. 

Chapter  n  contains  an  exposition  of  the  theory  of  sets  of  points,  and  in- 
cludes an  account  of  transfinite  cardinal  and  ordinal  Arithmetic,  of  a  somewhat 
simpler  and  less  general  character  than  will  be  met  with  in  the  treatment  of 
the  general  theory  of  aggregates,  in  Chapter  ill.  Students  who  do  not  care  to 
embark  upon  the  discussions  in  Chapter  III  will  find  a  study  of  Chapter  II 
amply  sufficient  to  enable  them  to  apply  the  ideas  there  developed  in  the 
general  theory  of  functions.  A  slight  account  only  has  been  given  of  the  pro- 
perties of  plane  sets  of  points.  An  account  of  the  important  recent  investigations 
which  had  their  origin  in  Jordan's  theorem,  that  a  closed  curve  divides  plane 
space  into  two  regions,  would  have  occupied  more  space  than  was  at  my 
disposal.  This  omission  will  be  less  felt  than  might  have  been  the  case,  were 
not  an  excellent  account  of  this  subject  to  be  found  in  Dr  W.  H.  Young's 
treatise  on  the  theory  of  sets  of  points,  which  has  appeared  since  this  portion 
of  the  present  work  was  printed. 

In  Chapter  IV,  there  will  be  found  a  discussion  of  the  main  properties  of 
functions,  in  relation  to  continuity,  discontinuity,  &c.,  and  investigations  of 
the  properties  of  important  classes  of  functions.  Although  the  treatise  is 
mainly  one  of  functions  of  a  single  variable,  a  considerable  amount  of  space 
has  been  devoted  to  the  consideration  of  functions  of  two  variables,  not  only  on 
account  of  the  intrinsic  importance  of  that  subject,  but  because  no  adequate 
consideration  of  the  properties  of  functions  of  a  single  variable  is  possible  with- 
out the  use  of  functions  of  two  variables,  as  is  seen,  for  example,  from  the  con- 
sideration that  a  function  defined  by  means  of  a  sequence  of  functions  of  a 
single  variable  is  virtually  defined  as  a  limit  of  a  function  of  two  variables. 
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The  foundations  of  the  Integral  Calculus,  as  based  upon  Riemann's  defini- 
tion of  a  definite  integral,  and  its  extensions,  are  discussed  in  Chapter  V, 
where  an  account  of  the  development  of  the  subject  from  the  point  of  view  of 
Lebesgue's  new  definition  of  the  definite  Integral  is  also  given.  In  later  parts 
of  the  book  I  have  introduced  extensions  of  Lebesgue's  definition,  to  the  cases 
of  improper  integrals,  taken  over  finite  or  infinite  domains,  regarded  as  the 
limits  of  sequences  of  Lebesgue  integrals. 

Chapter  vi  is  concerned  with  functions  defined  as  the  limits  of  sequences 
of  functions,  and  contains  an  account  of  the  principal  properties  of  functions 
represented  by  series,  and  a  discussion  of  important  matters  connected  with 
the  modes  of  convergence  of  series  through  whole  intervals,  or  in  the  neigh- 
bourhood of  particular  points.  Various  matters  relating  to  the  processes  of 
the  Integral  Calculus,  which  had  not  been  considered  in  Chapter  v,  are  here 
dealt  with,  because  their  adequate  treatment  presupposes  a  knowledge  of  the 
theorems  relating  to  the  convergence  of  sequences  of  functions.  An  account 
of  the  very  general  results  recently  obtained  by  Baire,  relating  to  the  repre- 
sentability  of  functions  by  means  of  series,  will  be  found  in  this  Chapter. 

Chapter  vn  is  devoted  to  the  theory  of  Fourier's  series.  No  apology  is 
needed  for  the  selection  of  this  particular  mode  of  representation  of  functions 
for  full  discussion  in  a  treatise  on  the  theory  of  functions  of  a  real  variable,  in 
view  of  the  historical  relation  of  Fourier's  series  to  the  development  of  the 
general  theory.  The  history  of  the  theory  of  Fourier's  series  is  exceedingly 
instructive,  not  merely  from  the  point  of  view  of  the  mathematician,  but  also 
from  that  of  the  epistemologist.  I  have  therefore  endeavoured,  in  my  treat- 
ment of  the  subject,  to  preserve  as  much  of  the  historical  element  as  was 
possible  in  an  account  which  should  contain,  in  a  moderate  compass,  not  only 
indications  of  the  various  stages  of  development  of  the  subject,  but  also  the 
most  recent  results  that  have  been  obtained.  I  have  made  full  use  of  the 
greater  generality  which  can  be  introduced  into  many  of  the  known  results 
by  means  of  the  employment  of  the  theory  of  integration  developed  by 
Lebesgue. 

In  the  preparation  of  the  work,  the  treatises  from  which  I  have  most  largely 
drawn  information  are  the  German  edition  of  Dini's  treatise  on  the  subject, 
Stolz's  Grundziige  der  Differential-  und  Integral- Rechnung,  Schoenflies'  Bericht 
entitled  "Die  Entwickelung  der  Lehre  von  den  Punktmannigfaltigkeiten,"  and 
the  various  treatises  on  different  parts  of  the  subject  by  Borel  and  Lebesgue. 
I  have  consulted  a  very  large  number  of  memoirs,  articles,  notes,  and  books. 
far  too  numerous  to  be  here  particularized.  In  respect  to  the  references 
given  throughout  the  book,  I  wish  it  to  be  understood  that  I  have  made  no 
attempt  to  settle  questions  of  priority  of  discovery.  The  references  given 
are  to  be  regarded  solely  as  indicating  sources  of  information  from  which 
I  have  drawn,  or  where  more  detailed  information  on  the  various  topics  is 
to  be  found. 
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I  owe  a  debt  of  gratitude  to  my  friend  Mr  J.  W.  Sharpe,  formerly  Fellow 
of  Gonville  and  Caius  College,  who  has  read  with  the  greatest  care  the  proofs 
of  about  two-thirds  of  the  book.  Many  points  of  difficulty  I  have  fully  discussed 
with  him;  many  obscurities  of  expression  have  been  removed,  and  many  im- 
provements in  substance  have  been  made,  owing  to  the  care  he  has  bestowed 
in  reading  the  proofs.  I  felt  it  as  a  great  loss  when,  owing  to  a  temporary 
failure  of  health,  he  was  unable  to  continue  his  laborious  work.  To  Dr  H.  F. 
Baker,  F.R.S.,  Fellow  of  St  John's  College,  and  Cayley  Lecturer  in  Mathe- 
matics, who  has  kindly  read  some  of  the  earlier  proofs,  I  owe  several  valuable 
suggestions.  On  several  points  connected  with  the  treatment  of  Number  in 
Chapter  I,  I  have  had  the  advantage  of  consulting  Dr  James  Ward,  F.B.A., 
Fellow  of  Trinity  College,  and  Professor  of  Mental  Philosophy  and  Logic  in 
the  University. 

My  thanks  are  due  to  the  officials  of  the  University  Press  for  the  readiness 
with  which  they  have  met  my  views,  and  for  the  care  which  they  have  bestowed 
upon  the  work  connected  with  the  printing.  I  desire  especially  to  express  my 
sense  of  the  value  of  the  excellent  work  done  by  the  readers  of  the  Press;  to 
their  care  is  due  the  elimination  of  many  typographical  and  other  blemishes 
which  would  otherwise  have  remained  undetected. 


E.  W.  HOBSON. 


CHRIST'S  COLLEGE,  CAMBRIDGE. 
May  15,  1907. 
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CORRIGENDA 

Page  3.  In  line  9  from  the  foot  of  the  page,  for  "  Every  part  of  the  aggregate  "  read  "  Every  part 
of  the  aggregate  which  contains  more  than  one  element." 

In  line  3  from  the  foot  of  the  page,  after  " Ml  a  part  of  it"  insert  "containing  more 
than  one  element. " 

Page  22.   In  the  first  note,  for  "  Burmann  "  read  "  Biermann." 
Page  74.   In  the  first  note,  for  "Bord"  read  "  Borel." 

Page  118.  In  example  2,  line  7,  for  "  Of  each  rational  number,  there  is  a  double  representation" 
read  "  Of  each  rational  number,  not  represented  by  a  recurring  radix-fraction,  there 
is  a  double  representation." 

Page  123.    In  the  last  line  of  the  note,  for  "  G.  H."  read  "  W.  H." 

Page  125.  Line  7  from  the  foot  of  the  page  "  It  will  be  shewn... ."  This  statement  is  incorrect, 
see  page  200. 

Page  152.   In  line  20,  for  "  W.  N."  read  "  G.  N." 


CHAPTER  I 

• 

NUMBER 

1.    THE  operation  of  counting,  in  which  the  integral  numbers  are  employed, 
I    can  be  carried  out  by  a  mind  to  which  discrete  objects,  which  may  be  either 
physical  or  ideal*,  are  presented,  and  which  possesses  certain  fundamental 
notions  which  we  proceed  to  specify. 

(1)  The  notion  of  unity,  a  form  under  which  an  object  is  conceived  when 
it  is  regarded  as  a  single  one.   An  object  so  regarded  may  be  either  of  a  material 
or  of  a  purely  abstract  or  ideal  nature,  and  may  be  recognized,  for  all  other 
purposes  than  that  of  counting,  as  possessing  any  degree  of  complexity.    It  is 
sufficient,  in  order  that  the  object  may  be  regarded  under  the  form  of  unity, 
that  it  be  so  far  distinct  from  other  objects,  as  to  be  recognized  at  the  time 
when  it  is  counted,  as  discrete  and  identifiable.     What  external  marks  are 
necessary  that  an  object  may  be  so  recognized  as  discrete,  is  a  matter  for  the 
judgment  of  the  mind  at  the  time  when  the  object  is  counted.    The  unity 
under  which  the  object  is  apprehended  is  a  formal  or  logical,  rather  than  a 
natural  unity ;  it  is  more  or  less  arbitrarily  attributed  to  the  object  by  the 
mind. 

(2)  The  notion  of  a  collection  or  aggregate  of  objects,  which  is  conceived 
of  as  containing  more  or  fewer  objects,  or  as  possessing  a  greater  or  less  degree 
of  plurality.    A  group  of  objects  regarded  as  an  aggregate  is  conceived  of,  not 
merely  as  a  plurality  of  objects  to  each  of  which  unity  is  ascribed  as  in  (1), 
but  also  as  itself  an  object  to  which  unity  is  ascribed  when  it  is  regarded  as 
a  single  whole.    The  single  objects  of  which  the  aggregate  is  composed  may 
be  spoken  of  as  the  elements  of  the  aggregate  ;  such  elements  need  not  possess 
any  parity  as  regards  size  or  any  other  special  quality,  but  may  be  of  the 
most  diverse  characters :  a  certain  logical  parity  is  however  ascribed  to  them 
in  the  process  of  counting,  in  virtue  of  the  fact  that  each  of  them  is  regarded 
as  a  single  object.    A  sensibly  continuous  presentation  cannot  be  regarded  as 
an  aggregate  containing  a  plurality  of  elements,  until  the  mind  has  recognized 
in  it  sufficiently  distinct  lines  of  division  to  serve  the  purpose  of  marking  oft* 

*  It  is  held  by  some  authors  that  the  operation  of  counting  is  primarily  applicable  to  physical 
objects  only.  Thus,  J.  S.  Mill  writes :—  "  The  fact  asserted  in  the  definition  of  a  number  is  a 
physical  fact.  Each  of  the  numbers,  two,  three,  four,  etc.,  denotes  physical  phenomena,  and  con- 
notes a  physical  property  of  these  phenomena."  See  Logic,  9th  edition,  vol.  n,  p.  150.  That 
objects  which  are  not  physical,  can  be  counted,  was  maintained  by  Leibnitz  and  by  Locke.  See 
also  Frege's  Grundlagen  der  Arithmetic,  Breslau,  1884,  where  an  account  is  given  of  various  views 
as  to  the  nature  and  origin  of  the  idea  of  Number. 
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distinct  objects  within  it,  the  totality  of  which  makes  up  the  whole  presenta- 
tion ;  for  instance,  the  history  of  a  country  could  be  regarded  as  an  aggregate 
of  distinct  periods,  only  when  sufficiently  salient  features  had  -been  recognized 
in  that  history  to  warrant  a  judgment  that  periods  were  to  be  found  in  it, 
each  of  which  had  a  sufficient  degree  of  discreteness  to  be  subsumed  under  the 
form  of  unity.  In,  actual  counting,  the  aggregate  is  not  necessarily  determinate 
before  the  counting  is  commenced,  but  becomes  so  when  the  process  is  com- 
pleted ;  the  notion  of  an  aggregate  is  thus  still  necessary  to  the  process  of 
counting,  if  the  process  is  ever  to  come  to  an  end,  or  to  be  conceived  of  as 
having  come  to  an  end. 

It  has  been  held*  that,  when  an  aggregate  is  counted,  the  elements  must 
remain  distinct  from  one  another,  not  disappearing  or  combining  with  each 
other  during  the  process.  That  this  condition  is  unnecessary  may  be  seen, 
for  example,  by  considering  the  case  of  counting  breakers  on  the  sea-shore, 
or  that  of  counting  the  vibrations  of  a  pendulum ;  thus  no  physical  perma- 
nence, but  only  an  ideal  one,  is  necessary. 

A  discussion  of  the  characteristics  which  an  aggregate  (not  necessarily 
finite)  must  possess,  in  order  that  it  may  be  an  object  of  mathematical  thought, 
will  be^given  in  Chapter  iv. 

(3)  The  notion  of  order,  in  virtue  of  which  relative  rank  is  given  to  each 
object  in  a  collection,  so  that  the  collection  becomes  an  ordered  aggregate.   In 
actual  counting,  the  order  is  assigned  to  the  objects  during  the  process  itself, 
as  an  order  in  time,  and  this  may  be  done  in  an  arbitrary  manner;  the  order 
of  the  elements  in  an  aggregate  may,  however,  be  assigned  in  a  manner 
dependent  upon  their  sizes,  weights,  or  other  qualities,  or  in  accordance  with 
their  positions  in  space.    Order  may,  however,  be  regarded  as  an  abstract 
conception,  independent  of  a  particular  mode  of  ordering ;  for  an  aggregate 
to  be  an  ordered  one,  it  is  necessary  that  in  some  manner  or  other,  each 
element  be  recognized  as  possessing  a  certain  rank,  in  virtue  of  which  it  is 
known  as  regards  any  two  elements  which  may  be  chosen,  which  of  them  has 
the  lower,  and  which  the  higher  rank.    An  element  is  said  to  precede  any 
other  element  of  higher  rank  than  itself. 

(4)  The  notion  of  correspondence,  which  underlies  the  process  of  tallying. 
The  elements  of  one  aggregate  may  be  made  to  stand  in  some  logical  rela- 
tion with  those  of  another  one,  so  that  a  definite  element  of  one  aggregate  is 
regarded  as  correspondent  to  a  definite  element  of  another  aggregate. 

The  correspondence  may  be  complete,  in  the  sense  that,  to  every  element 
of  either  aggregate  there  corresponds  one  element,  and  one  only,  of  the  other 
aggregate;  or  the  correspondence  may  be  incomplete,  in  which  case  one  of 
the  aggregates  has  one  or  more  elements  to  which  no  elements  in  the  other 

*  See  Helmholtz's  Ziihlen  und  Mennen,  Leipzig,  1887;  Wiatens.  Abhandl.  vol.  ni,  p.  372. 
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aggregate  correspond.  In  the  latter  case  we  say  that  the  aggregate  with  the 
superfluous  element  or  elements  contains  more  elements  than  the  other  aggre- 
gate, and  that  the  latter  contains  fewer  elements  than  the  former. 

A  correspondence  between  two  aggregates  is  defined  when  specifications 
or  rules  are  laid  down  which  suffice  to  decide'which  element  of  one  aggregate 
corresponds  to  each  element  of  the  other;  so  that,  in  the  case  of  complete 
correspondence,  no  element  of  either  aggregate  is  without  a  correspondent  one 
in  the  other. 

Whether,  or  how  far,  these  fundamental  notions  of  unity,  aggregate,  order, 
and  correspondence  should  be  regarded  as  derived  empirically  from  experience, 
by  a  process  of  abstraction,  or  whether  it  must  be  held  that  they  are  original 
forms  which  the  mind  possesses  prior  to,  and  as  the  necessary  conditions  of 
the  possibility  of  such  experience,  are  questions  into  which  it  is  beyond  our 
province  to  enter.  It  is  certain  that  civilized  man  possesses  these  funda- 
mental notions,  and  it  is  highly  probable  that  primitive  man  possessed  them 
long  before  the  notion  of  abstract  number  had  appeared  in  an  explicit 
and  developed  form.  The  investigation  of  the  origin  of  these  notions, 
and  their  further  analysis,  are  matters  for  the  Psychologist  and  for  the 
Philosopher.  Mathematical  Science,  as  any  other  special  science,  must  take  its 
fundamental  notions  as  data ;  it  is  concerned  with  the  analysis  of  them,  only 
so  far  as  suffices  to  establish  that  they  possess  the  degree  of  definiteness 
which  such  data  must  have,  if  they  are  to  lie  at  the  base  of  a  logically  ordered 
system. 

ORDINAL  NUMBERS. 

2.  If  from  an  ordered  aggregate  some  of  the  elements  are  removed,  the 
aggregate  which  remains  is  said  to  be  a  part  of  the  original  aggregate.  It 
will  be  observed  that  the  relative  order  of  any  two  elements  in  the  part  is  the 
same  as  the  relative  order  of  those  elements  in  the  original  aggregate. 

An  ordered  aggregate  is  said  to  be  finite  when  it  satisfies  the  following 
conditions : 

(1)  There  is  one  element  which  has  lower  rank  than  any  of  the  others. 

(2)  There  is  one  element  which  has  higher,  rank  than  any  of  the  others. 

wK«c'>     '-fr^fA't\\    <^*"vP\.t    nw»ax    P~  r.£    .V.A ', ftjM^'1 

(3)  Every  part  of  the  aggregateAhas  an  element  whicn  has  nigner'rank  than 

every  other  element  in  the  part,  and  also  it  has  an  element  which  has  lower 
rank  than  any  other  element  in  the  part. 

These  conditions  are  equivalent  to  the  statement  that  a  finite  aggregate, 
and  also  each  part  of  it,  has  a  first  and  a  last  element. 

Every  part  of  a  finite  ordered  aggregate  is  also  a  finite  ordered  aggregate. 

If  M  be  the  aggregate,  and  Ml  a  part  of  it^then  Ml  has  a  highest  and  a 
lowest  element ;  also  every  part  of  Mlt  being  also  a  part  of  M,  has  a  lowest  and 
a  highest  element ;  therefore  M^  is  itself  finite. 

1  —  2 
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3.  Two  finite  ordered  aggregates  are  said  to  be  similar  when  they  can  be 
made  completely  to  correspond,  so  that  to  each  element  of  either  of  them  there 
corresponds  a  single  element  of  the  other,  and  so  that  to  any  two  elements 
P,  Q  of  the  one  there  correspond  two  elements  P',  Q'  of  the  other,  which  have 
the  same  relation  as  regards  rank ;  viz.  that  if  P  is  of  lower  rank  than  Q, 
then  P'  is  of  lower  rank  than  Q',  and  if  P  is  of  higher  rank  than  Q,  then  P'  is 
of  higher  rank  than  Q'. 

Two  finite  ordered  aggregates  which  are  similar  are  said  to  have  the  same 
ordinal  number. 

If  each  of  two  ordered  aggregates  is  similar  to  a  third,  they  are  similar  to 
one  another.  For  if  an  element  P  -of  the  first  corresponds  to  an  element  R  of 
the  third,  and  the  element  Q  of  the  second  corresponds  to  R,  it  is  clear  that 
if  we  make  P  correspond  to  Q,  the  first  two  aggregates  are  made  to  correspond 
in  such  a  way  that  the  relative  order  is  preserved. 

It  thus  appears  that  an  ordinal  number  is  characteristic  of  a  class  of  similar 
ordered  aggregates. 

An  aggregate  which  consists  of  a  single  element  A  is  said  to  have  the 
ordinal  number  one,  denoted  by  the  symbol  1.  The  ordinal  number  1  is 
characteristic  of  every  aggregate  which  consists  of  a  single  element. 

If,  to  the  aggregate  which  consists  of  an  element  A,  we  adjoin  a  new 
element  B,  and  assign  to  B  a  higher  rank  than  A,  we  obtain  an  aggregate 
(A,  B)  which  has  an  ordinal  number  2,  characteristic  of  all  aggregates  which 
are  formed  in  this  manner ;  A  is  said  to  be  the  first  element,  B  the  second. 
If  to  an  ordered  aggregate  (A,  B),  of  which  the  ordinal  number  is  2,  we  adjoin 
another  element  C,  and  regard  this  as  having  higher  rank  than  A  and  B,  we 
obtain  an  ordered  aggregate  (A,  B,  C),  of  which  the  ordinal  number  is  called 
3,  and  is  characteristic  of  all  ordered  aggregates  formed  in  this  manner. 
Proceeding  in  this  way,  if  we  have  formed  an  ordered  aggregate  (A,B,C,...  H), 
of  which  the  ordinal  number  is  n,  and  adjoin  to  this  aggregate  a  new  element 
K,  we  obtain  a  new  aggregate  (A,  B,  C,  ...  H,  K),  of  which  the  ordinal  number 
•n  is  different  from  n. 

Any  ordered  aggregate  which  is  formed  in  the  manner  described  is  finite. 

This  can  be  proved  by  induction.  Let  us  assume  that  M  is  a  finite  ordered 
aggregate  :  it  will  then  be  proved  that  (M,  e),  the  ordered  aggregate  obtained 
by  adjoining  an  element  e  of  higher  rank  than  the  elements  of^Jf,  is  also 
finite.  Since  M  has  a  lowest  element,  (M,  e}  has  the  same  lowest  element, 
also  (M,  e}  has  a  highest  element  e.  Again  if  Mj  is  a  part  of  (M,  e)  which 
does  not  contain  e,  then  M^  is  a  part  of  M,  and  therefore  has  a  highest  and 
a  lowest  element.  If  Ml  is  a  part  of  (M,  e}  which  contains  e,  let  it  be  (M.2>  e), 
where  M2  is  a  part  of  M,  and  therefore  contains  a  lowest  element  which  is 
also  the  lowest  element  of  (Mz,  e) ;  also  (MZ)  e)  contains  a  highest  element  e. 
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It  has  thus  been  shewn  that  (M,  e)  satisfies  the  requisite  conditions  that 
it  should  be  finite,  provided  M  does  so.  The  aggregates  A, (A,  B)  are  clearly 
finite :  hence  the  method  of  induction  proves  that  every  ordered  aggregate 
which  can  be  formed  by  continually  adjoining  new  elements  to  an  aggregate 
which  originally  contained  one  element  is  a  finite  one. 

Conversely,  it  can  be  shewn  that  every  finite  ordered  aggregate  can  be 
formed  in  the  manner  above  described. 

Let  M  be  a  finite  ordered  aggregate,  and  let  e'  be  its  highest  element,  thus 
M=(M1}e').  Now  Jl/j,  being  a  part  of  M,  has  a  highest  element  e",  thus 
Ml  =  (M»,  e"},  or  M=(M2,  e",  e').  Proceeding  in  this  manner,  if  we  do  not 
reach  an  aggregate  Mr  which  contains  a  single  element  only,  we  shall  have 

found  a  part  ( e'",  e",  e,  e)  of  M  which  has  no  element  of  lowest  rank. 

But  this  is  impossible,  since  M  is  by  hypothesis  finite,  and  therefore  contains 
no  part  without  a  lowest  element.  It  has  thus  been  shewn  that  M  can  be 
reduced,  in  the  manner  indicated,  to  an  aggregate  with  a  single  element :  and 
conversely,  starting  with  this  latter  aggregate,  M  is  obtained  by  adjoining  to 
it  successively  new  elements. 

A  finite  ordered  aggregate  is  not  similar  to  any  part  of  itself. 

This  theorem  may  also  be  proved  by  induction.  For  if  we  assume  that  the 
finite  ordered  aggregate  M  is  not  similar  to  any  part  of  itself,  it  can  be  shewn 
that  the  same  holds  for  (M,  e).  If  possible  let  Ml  be  a  part  of  (M,  e)  which  is 
similar  to  (M,  e);  then  if  Ml  contains  e,  it  must  be  of  the  form  (M2,  e},  and  if 
(M2,  e)  is  similar  to  (M,  e),  Mz  must  be  similar  to  M,  which  is  contrary  to 
the  hypothesis  that  M  contains  no  part  similar  to  itself.  If  Ml  does  not 
contain  e,  it  must  be  of  the  form  (3/2,/),  where /is  the  element  which  corre- 
sponds to  e  in  (M,  e)\  in  this  case  again  M«  is  similar  to  M ,  and  is  a  part  of- 
it;  thus  we  have  again  a  contradiction.  The  theorem  holds  for  (A,  B),  and 
therefore  generally. 

It  follows  from  this  theorem  that  the  ordinal  numbers  1,  2, 3, ...  which  have 
been  defined  as  the  ordinal  numbers  of  aggregates  (A),  (A,  B),  (A,  B,  (7),  ... 
are  all  different  from  one  another,  for  each  of  these  aggregates  being  a  part 
of  each  of  those  which  follow  it,  cannot  be  similar  to  any  of  the  aggregates 
which  follow  it. 

Each  of  the  ordinal  numbers  is  to  be  regarded  as  a  unique  ideal  object 
in  that  it  is  a  permanent  object  for  thought.  The  relation  of  an  ordinal 
number  to  an  ordered  aggregate  of  objects  which  is  characterised  by  that 
number,  may  be  illustrated  by  the  analogy  of  the  relation  between  the  colour 
red,  and  a  particular  red  object. 

4.  A  simply  infinite  ascending  aggregate,  or  simple  sequence,  is  an  ordered 
aggregate  which  has  no  element  of  higher  rank  than  all  the  others,  and  is 
such  that  every  part  which  has  an  element  of  higher  rank  than  all  the  other 
elements  in  that  part  is  a  finite  ordered  aggregate. 
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The  fundamental  assumption  must  be  made  that  such  an  aggregate  may 
be  regarded  as  a  definite  object  which  possesses  certain  properties  that  can 
be  formulated.  The  justification  for  the  assumption  is  to  be  found  in  the 
fact  that  no  contradiction  arises  in  the  theory  based  on  it. 

It  follows  from  this  definition  that,  in  a  simple  sequence  there  is  one 
element  of  lower  rank  than  all  the  others;  and  further,  that  every  part  of 
the  simple  sequence  has  an  element  of  lower  rank  than  all  the  other  elements 
in  that  part. 

A  simply  infinite  ascending  aggregate  differs  from  a  finite  ordered  aggre- 
gate in  having  no  element  which  is  of  higher  rank  than  all  the  other  elements. 

The  totality  of  ordinal  numbers  forms  a  simply  infinite  ascending  aggre- 
gate ;  these  objects  may  be  represented  by  a  set  of  signs 

a,  &  7,  8,... 

or  1,2,3,4,... 

where  it  is  assumed  that  some  adequate  scheme  of  such  signs  has  been 
devised. 

The  order  of  the  elements  is  assigned  by  the  successive  formation,  as 
above;  of  aggregates  having  the  various  elements  for  their  ordinal  numbers, 
and  it  has  been  shewn  that,  if  an  aggregate  has  the  ordinal  number  ??,  another 
aggregate  having  a  different  ordinal  number  n,  taken  to  be  of  next  higher 
rank  than  n,  can  be  formed.  There  exists  therefore  no  highest  ordinal  number. 

Instead  of  using  the  expressions  "of  higher  rank  "  and  "  of  lower  rank,"  it 
is  usual  to  say  that  a  number  ra  is  less  than  a  number  n,  when  m  is  of  lower 
rank  than  n  in  the  ordered  aggregate  of  ordinal  numbers,  and  that  n  is  greater 
than  m.  The  terms  "  greater  "  and  "  less  "  are  borrowed  from  the  language 
primarily  applicable  to  the  description  of  magnitudes :  but  in  pure  arithmetic 
and  pure  analysis  generally,  they  are  used  only  in  the  sense  in  which  they 
indicate  higher  or  lower  rank,  and  this  rank  has  no  necessary  reference  to 
relations  of  magnitude  or  of  measurable  quantity. 

The  operation  of  counting  a  finite  aggregate  of  objects  of  any  kind  may 
be  conceived  of  as  the  process  of  putting  the  objects  into  correspondence  with 
the  elements  of  the  aggregate  of  ordinal  numbers,  in  such  a  way  that,  when 
any  ordinal  number  has  an  element  of  the  aggregate  which  corresponds  to  it, 
each  of  the  preceding  ordinal  numbers  also  has  an  element  which  corresponds 
to  it. '  The  finite  aggregate  is  usually  ordered  by  the  process  itself,  the  ranks 
of  the  various  elements  being  successively  assigned  to  them  as  the  counting 
proceeds.  Those  ordinal  numbers  which  are  employed  in  counting  such  an 
aggregate  may  be  regarded  as  forming  an  aggregate  which  is  similar  to  the 
given  aggregate,  as  ordered  by  the  process  of  counting.  The  last  of  the  ordinal 
numbers  employed  in  counting  a  finite  aggregate  is  the  ordinal  number,  or 
simply  the  number  (Anzahl)  of  the  ordered  aggregate. 
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The  theorem  that  an  ordered  aggregate  is  not  similar  to  any  of  its  parts 
holds  only  as  regards  finite  aggregates.  It  will  appear  in  the  course  of  the 
discussion  in  Chapter  iv  that  every  aggregate  which  is  not  finite  has  parts 
which  are  similar  to  the  whole ;  and  this  property  is  sometimes  taken  as  the 
basis  of  the  definition  of  an  infinite,  or  transfinite,  aggregate.  For  example, 
the  aggregate  of  ordinal  numbers  1,  2,  3, ...  is  similar  to  the  part  2,  4,  6, ... 
which  contains  the  even  numbers  only. 


MATHEMATICAL  INDUCTION. 

5.  The  proofs  of  theorems  in  §  3  have  been  referred  to  as  proofs  by  induc- 
tion. In  its  general  form  the  principle  of  Mathematical  Induction  may  be 
stated  as  follows: 

If,  in  respect  of  a  given  simply  infinite  ordered  aggregate,  it  be  known 
(1)  that,  in  case  any  element  of  the  aggregate  possesses  a  certain  property  P, 
the  element  of  next  higher  rank  also  possesses  the  property  P,  (2)  that  the 
element  of  lowest  rank  possesses  the  property  P;  it  then  follows  that  every 
element  of  the  aggregate  possesses  the  property  P. 

The  truth  of  the  principle  follows  as  a  consequence  of  the  properties 
assigned  to  the  aggregate  by  means  of  the  definition  in  §  4. 

For,  let  it  be  assumed,  if  possible,  that  there  exists  a  part  of  the  given 
aggregate  A,  such  that  the  elements  belonging  to  that  part  do  not  possess 
the  property  P.  This  part  contains  one  or  more  elements ;  let  M  be  such  an 
element.  Let  us  consider  the  finite  aggregate  which  consists  of  all  those 
elements  of  A  which  are  of  rank  not  higher  than  that  of  M.  Of  this  finite 
aggregate  there  exists  a  part  B,  such  that  no  element  of  B  possesses  the 
property  P.  The  aggregate  B  contains  the  element  M,  and  possibly  other 
elements.  Since  B,  being  part  of  a  finite  aggregate,  is  itself  a  finite  aggregate, 
it  contains  an  element  m  of  lower  rank  than  all  the  others.  By  hypothesis,  m 
cannot  be  the  element  of  lowest  rank  in  A,  since  the  latter  possesses  the 
property  P.  Therefore  there  is  in  A  an  element  m'  of  next  lower  rank  than 
m;  and  m'  possesses  the  property  P.  But  by  hypothesis,  if  m'  possesses  that 
property,  so  also  does  ra.  The  assumption  that  A  contains  a  part  such  that 
the  elements  of  the  part  do  not  possess  the  property  P  has  thus  been  shewn 
to  lead  to  a  contradiction.  The  truth  of  the  principle  has  therefore  been 
established. 

It  has  been  maintained  by  Poincare*  that  the  principle  of  Mathematical 
Induction  is  a  special  characteristic  of  mathematical  reasoning  as  distinct 

• 

*  See  his  work  La  science  ct  V hypothetic,  p.  19  ;  also  an  article  in  the  Revue  de  Metaphysique, 
vol.  n,  p.  371.  For  a,.criticism  of  Poincare's  view,  see  an  article  by  A.  Padoa,  Proc.  of  the  fifth 
annual  Congress  of  Mathematician*,  vol.  n,  p.  471. 
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from  that  of  general  Logic,  not  being  reducible  to  the  principle  of  contradic- 
tion, as  it  essentially  involves  the  employment  of  an  unending  chain  of 
syllogisms.  That  this  is  the  case  is  suggested  by  the  form  in  which  the 
principle  is  frequently  applied  in  elementary  mathematical  treatises,  viz.  that 
since  the  property  P  holds  good  for  the  first  element,  it  therefore  holds  good 
for  the  second,  and  therefore  for  the  third,  etc.;  arid  consequently,  by  the 
employment  of  an  unending  set  of  such  syllogisms,  it  is  inferred  that  the 
property  P  holds  good  for  all  the  elements  of  the  infinite  aggregate.  It  has 
however  been  shewn  above  that  it  is  not  necessary  to  state  the  principle  in 
this  form,  but  that  the  truth  of  the  principle  follows  by  applying  the  ordinary 
Logic  to  deduce  the  consequences  of  the  possession  by  the  infinite  ordered 
aggregate  of  certain  definite  properties,  in  accordance  with  the  principle  of 
contradiction. 

CARDINAL  NUMBERS. 

6.    If  any  finite  ordered  aggregate  be  re-ordered  in  any  manner,  the  new 
ordered,  aggregate  is  finite,  and  has  the  same  ordinal  number  as  the  original 


one. 


In  order  to  prove  this  theorem,  the  following  particular  case  will  be  first 
established  : — If  Q  is  a  finite  ordered  aggregate,  the  aggregate  (Q,  e),  obtained 
by  adjoining  to  Q  a  new  element  e  of  higher  rank  than  all  the  elements  of  Q, 
is  similar  to  (e,  Q),  in  which  e  has  a  lower  rank  than  all  the  elements  of  Q. 
For  let  Q  =.  (Q1}  /),  and  let  us  assume  that  the  theorem  holds  for  Qlt  i.e.  that 
(Qi,  e)  is  similar  to  (e,  Qx);  it  follows,  since  a  complete  correspondence  can  be 
established  between  the  elements  of  (Q1?  e}  and  (e,  QJ,  that  the  same  is  true 
of  the  two  aggregates  (Q1}  e,f)  and  (e,  Q\,f).  Now  (Qlt  e, /)  is  similar  to 
(Qi>f>  e)>  since  Ql  can  be  made  to  correspond  to  itself,  e  to/1,  and/"  to  e,  there- 
fore (Qi,f,  e)  is  similar  to  (e,  Qi,f\  or  (Q,  e)  to  (e,  Q),  and  thus  the  theorem 
holds  for  Q=(Q1,f),  provided  it  holds  for  Q^  Now  it  clearly  holds  if  Ql 
consists  of  a  single  element;  hence  by  induction  it  holds  for  any  finite  ordered 
aggregate  Q.  To  prove  the  theorem  in  the  general  case,  let  us  assume  that 
it  is  true  for  an  aggregate  M ;  it  will  then  be  shewn  to  be  true  for  (M,  e).  For 
let  an  aggregate  obtained  by  re-ordering  (M,  e)  be  (R,  e,  S),  where  either  R 
or  S  may  be  absent;  (R,  e,  S)  is  similar  to  (R,  S,  e),  for  R  corresponds  with 
itself,  and  it  has  been  shewn  above  that  (e,  S)  is  similar  to  (S,  e).  Since  (R,  S) 
is  by  hypothesis  similar  to  M,  it  follows  that  (R,  S,  e)  is  similar  to  (M,e),  and 
therefore  (R,  e,  S}  is  similar  to  (M,  e).  The  theorem  clearly  holds  for  an  ag- 
gregate ( A,  B)  which  contains  two  elements,  hence  by  induction  it  holds  for 
every  finite  ordered  aggregate. 

It  follows  from  the  theorem  which  has  been  established  above,  that,  for 
any  aggregate  which  can  be  ordered  as  a  finite  ordered  aggregate,  the  ordinal 
number  is  independent  of  the  mode  in  which  the  aggregate  is  ordered. 
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It  will  be  found,  when  the  generalization  of  ordinal  numbers  for  non-finite 
aggregates  is  considered  in  Chapter  IV,  that  this  property,  that  the  ordinal 
number  of  an  aggregate  is  independent  of  the  mode  of  ordering,  is  peculiar  to 
finite  aggregates. 

7.  Two  aggregates  are  said  to  be  equivalent,  when  their  elements  can  be 
placed  into  correspondence  so  that  to  each  element  of  either  aggregate  there 
corresponds  one  and  only  one  element  of  the  other  aggregate. 

It  will  be  observed  that  the  relation  of  equivalence  differs  from  that  of 
similarity,  in  that  it  contains  no  reference  to  order.  It  is  clear  that  two 
aggregates  which 'are  each  equivalent  to  a  third  are  equivalent  to  one  an- 
other. 

An  unordered  aggregate  is  said  to  be  finite  when  it  can  be  so  ordered  that 
the  ordered  aggregate  is  finite  in  accordance  with  the  definition  given  in  §  2. 

Two  (finite)  aggregates  which  are  equivalent  are  said  to  have  the  same 
cardinal  number. 

It  thus  appears  that  a  cardinal  number  is  characteristic  of  a  class  of  equi- 
valent aggregates. 

Each  of  the  cardinal  numbers  is  to  be  regarded  as  a  unique  ideal  object ; 
the  relation  of  a  cardinal  number  to  a  member  of  the  "class  of  equivalent 
aggregates  of  objects,  of  which  it  is  characteristic,  may  be  illustrated  in  the 
same  manner  as  in  §  3,  in  the  case  of  the  ordinal  numbers. 

Since  all  similar  aggregates  are  also  equivalent,  and  since,  in  the  case  of  a 
finite  aggregate,  the  ordinal  number  is  independent  of  the  mode  in  which  the 
aggregate  is  ordered,  it  follows  that  for  every  finite  ordinal  number  there  is  a 
corresponding  cardinal  number. 

The  cardinal  numbers  of  finite  aggregates  are  denoted  by  the  same 
symbols  1,  2,  3, ...  as  the  corresponding  ordinal  numbers.  The  two  kinds  of 
numbers  are  not  symbolically  distinguished  from  each  other,  although  logically 
they  are  not  identical. 

It  will  be  seen  in  Chapter  iv  that  this  practical  identity  of  ordinal  and 
cardinal  numbers  is  confined  to  the  case  of  the  numbers  corresponding  to  finite 
aggregates,  and  therefore  called  finite  numbers.  The  finite  cardinal  numbers 
form  a  simple  sequence  1,  2,  3,  ...  similar  to  the  sequence  of  finite  ordinal 
numbers;  the  expressions  "greater"  and  "  less  "  are  used  in  relation  to  two 
cardinal  numbers  in  the  same  purely  ordinal  sense,  denoting  higher  and  lower 
rank,  as  in  the  case  of  ordinal  numbers. 

It  is  impossible,  in  a  purely  mathematical  work,  to  enter  into  a  full  dis- 
cussion of  the  nature  and  proper  definition  of  number  from  a  philosophical 
point  of  view.  One  view  of  number,  which  is  widely  held,  is  embodied  in  the 
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definition  by  abstraction,  in  which  the  cardinal  number*  is  regarded  as  the 
concept  of  an  aggregate  which  remains  when  we  make  abstraction  of  the  nature 
of  the  objects  forming  the  aggregate,  and  of  the  order  in  which  they  are  given  ; 
the  ordinal  number  is  then  regarded  as  the  concept  obtained  by  making  ab- 
straction of  the  nature  of  the  objects  only,  retaining  the  orderf  in  which  they 
are  given  in  the  aggregate.  The  view  has  also  been  maintained]:  that  a  cardinal 
number  is  simply  the  class  of  all  equivalent  aggregates.  A  tendency  has 
been  exhibited  amongst  mathematicians^  to  regard  numbers,  at  least  for  the 
purposes  of  analysis,  as  identical  with  the  symbols  which  represent  them.  In 
accordance  with  this  view,  abstract  arithmetic  is  cut  entirely  adrift  from  the 
fundamental  notions  related  to  experience  in  which  it  had  its  origin,  and  it 
is  thus  reduced  to  a  species  of  mechanical  game  played  in  accordance  with  a 
set  of  rules  which,  when  divorced  from  their  origin,  have  the  appearance  of 
being  perfectly  arbitrary;  though  it  may,  of  course,  be  said  that  it  is  possible 
at  the  end  of  any  arithmetical  process  to  reconnect  the  symbols  employed, 
with  the  ideas  which  originally  suggested  them,  and  thus  to  interpret  the 
results  of  the  purely  symbolical  processes.  Whatever  view||  be  adopted  as 

*  This  view  is  that  of  G.  Cantor;  see  Math.  Annalen,  vol.  XLVI,  p.  481,  where  the  following 
definition  is  given: — "  'Machtigkeit,'  oder  ' Cardinalzahl '  von  M  nennen  wir  den  Allgemeinbegriff 
welcher  mit  Hiilfe  unseres  activen  Denkvermogens  aus  der  Menge  M  hervorgeht,  dass  von  der  Be- 
schaffenh.eit  ihrer  verschiedenen  Elemente  m,  und  von  der  Ordnung  ihres  Gegebenseins  abstrahirt 
wird."  See  also  Peano,  Formulaires  de  Mathematiques,  1901,  §  32,  '0  Note. 

t  Ordinal  numbers  are  frequently  regarded  as  logically  prior  to  cardinal  numbers,  but  this  order 
of  procedure  is  not  a  necessary  one.  In  Dedekind's  tract  "Was  sind  und  was  sollen  die  Zahlen," 
Brunswick,  1887  and  1893,  which  has  been  translated  into  English  by  Prof.  W.  W.  Benian,  under 
the  title  "  Essays  on  the  Theory  of  Numbers,"  1901,  a  detailed  treatment  of  the  subject  is  given, 
in  which  the  notion  of  order  is  regarded  as  fundamental. 

J  See  B.  Eussell,  The  Principles  of  Mathematics,  vol.  i,  chap.  xi. 

§  For  example  see  Heine,  Cr Ale's  Journal,  vol.  LXXIV  (1872),  p.  173,  where  the  matter  is  stated 
in  the  following  plain  form:  "  Ich  nenne  gewisse  greifbare  Zeichen  Zahlen.  sodass  die  Existent 
dieser  Zahlen  also  nicht  in  Frage  steht."  Again,  Helmholtz  appears  to  hold  a  view  closely 
approaching  the  notion  that  Arithmetic  is  the  art  of  manipulating  certain  signs  according  to 
certain  rules  of  operation ;  he  writes  in  Ges.  Abh.  vol.  in,  p.  359,  "Ich  betrachte  die  Arithmetik 
oder  die  Lehre  von  den  reinen  Zahlen  als  eine  auf  rein  psychologische  Thatsachen  aufgebaute 
Methode,  durch  die  die  folgerichtige  Anwendung  eines  Zeichensystems  (namlich  der  Zahlen)  vou 
unbegrenzter  Au^dehnung  und  unbegrenzter  Moglichkeit  der  Verfeinerung  gelehrt  wird."  Refer- 
ence  may  be  made  to  an  essay  by  A.  Pringsheim  in  the  Jahresberichte  der  d.  math.  Vereinigung, 
vol.  vi,  1899,  "  Ueber  den  Zahl-  und  Grenzbegriff  im  Unterricht."  In  an  article  entitled  "Die 
Du  Bois  Eeymond'sche  Convergenz-Grenze,"  Sitzungsberichtc  d.  bayer.  Akad.  vol.  xxvn,  1897, 
Pringsheim  speaks  of  numbers  as  "  Zeichen,  denen  lediglich  eine  bestimmte  Succession  zukomrnt." 
See  p.  326.  This  article  contains  various  remarks  on  arithmetization,  and  especially  a  criticism 
of  the  views  of  P.  Du  Bois  Reymond.  A  searching  criticism  of  the  tendency  to  reduce  Arithmetic 
to  the  formal  manipulation  of  symbols  is  given  in  L.  Couturat's  work  De  I'infmi  mathematique, 
Paris,  1896,  which  contains  a  valuable  account  and  discussion  of  theories  of  the  philosophy  of 
arithmetic. 

||  References  to  the  literature  relating  to  the  Philosophy  of  Number  will  be  found  in  the 
Article  i.  A.  1,  "  Grundlagen  der  Arithmetik,"  by  H.  Schubert,  in  the  Encyclopfidie  der  mathe- 
matischen  Wisseiischaften,  vol.  i;  also  in  E.  G.  Husserl's  Philosophic  der  Arithmetik,  vol.  i,  chaps. 
5  and  6,  Halle,  1891.  Frege's  treatises,  the  Grundlayen  der  Arithmetik,  Breslau,  1884,  and  the 
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to  the  real  nature  of  number  and  its  place  in  a  general  scheme  of 
thought,  the  assumption  of  the  right  to  hypostatize  numbers  would  appear  to 
be  an  essential  condition  of  the  possibility  of  developing  an  abstract  arithmetic, 
and  consequently  of  the  establishment  of  mathematical  analysis  in  general. 

THE   OPERATIONS   ON   INTEGRAL   NUMBERS. 

8.  Iftwo  finite  ordered  aggregates  A  and  B,  of  which  the  ordinal  numbers 
are  a  and  /;  respectively,  are  combined  into  a  single  ordered  aggregate  in 
which  the  elements  of  A  have  all  lower  rank  than  those  of  B,  and  in  which 
any  two  elements  of  A,  and  any  two  elements  of  5,  have  the  same  relative 
orders  as  in  the  original  aggregates,  then  the  ordinal  number  of  the  combined 
aggregate  is  said  to  be  the  sum  of  the  ordinal  numbers  a  and  b,  and  is  denoted 
by  a  +  b. 

It  can  be  shewn  that  the  new  aggregate  is  a  finite  one,  and  that  its 
ordinal  number  is  unaltered  if  for  A  and  B  there  be  substituted  aggregates 
which  are  similar  to  them;  it  thus  appears  that  the  sum  a  +  b  is  a  finite 
number  which  depends  only  upon  a  and  b. 

The  aggregate  (A,  B)  has  as  lowest  element  the  lowest  element  of  A,  and 
as  highest  element  the  highest  element  of  B;  moreover  any  part  of  (A,  B)  is 
of  the  form  (A1,  B'),  where  A'  is  a  part  of  A,  and  B'  is  a  part  of  B;  or  else  it 
has  one  df  the  forms  A',  B',  and  since  A',  B'  have  each  a  lowest  and  a  highest 
element,  any  such  part  of  (A,  B)  has  a  lowest  and  a  highest  element.  Thus 
(A,  B)  is  finite. 

Again,  if  Alt  B^  are  aggregates  which  are  similar  to  A  and  B  respectively, 
the  elements  of  A  may  be  placed  in  correspondence  with  those  of  Alt  and  the 
elements  of  B  with  those  of  B^\  we  have  then  a  (1, 1)  correspondence  between 
the  elements  of  (A,  B)  and  those  of  (A1}  Bj);  thus  the  ordinal  number  of 
(A,  B)  is  the  same  as  that  of  (Alt  B^. 

Since  (A,  B)  has  the  same  ordinal  number  as  (B,  A)  it  follows  that 
a  +  b  =  b  +  a,  which  is  known  as  the  commutative  law  of  addition. 

If  a,  b  are  the  cardinal  numbers  of  two  finite  aggregates  A,  B,  then  the 
cardinal  number  of  the  aggregate  formed  by  combining  the  two  aggregates 
into  one  is  said  to  be  the  sum  of  a  and  b,  and  is  denoted  by  a  +  b.  That  a  +  b 
is  a  definite  finite  number,  dependent  only  on  a  and  b,  follows  at  once  from 
the  corresponding  theorem  which  has  been  proved  for  ordinal  numbers. 

Grundgesetze  der  Arithmetik,  Jena,  1893  and  1903,  also  Whitehead  and  Russell's  work,  Principia 
Mathematical,,  Cambridge,  may  be  here  specially  referjed  to,  in  connection  with  the  relation  of  the 
foundations  of  Arithmetic  to  general  Logic.  The  view  that  Number  is  fundamentally  dependent 
on  the  notion  of  Time  was  developed  by  Sir  W.  R.  Hamilton  ;  see  the  Dublin  Transactions,  vol. 
xvn  (II),  1835,  "Theory  of  Conjugate  Functions  or  Algebraic  Couples  with  a  Preliminary  and 
Elementary  Essay  on  Algebra  as  the  Science  of  Pure  Time";  see  also  Helmholtz's  essay  Ziihlen 
und  Mextten  (1887),  where  the  view  is  adopted  that  the  axioms  of  Arithmetic  have  a  relation  to 
the  intuitional  form  of  time,  similar  to  that  which  the  axioms  of  Geometry  have  to  the  intuitional 
form  of  space. 
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The  operation  of  finding  the  sum  of  two  numbers  a  and  b  is  known  as  the 
operation  of  addition,  and  it  has  been  shewn  that  this  operation  is  commuta- 
tive. It  should  be  observed  that  the  sum  of  two  numbers  a  and  6  cannot  be 
determined  merely  by  contemplating  those  numbers  themselves  as  abstract 
concepts,  but  can  only  be  defined  as  above,  by  referring  to  aggregates  of 
which  a  and  b  are  the  numbers,  and  then  combining  those  aggregates.  The 
number  of  the  combined  aggregate  is  then  conceived  of  as  the  result  of  a 
symbolical  operation  upon  the  numbers  a  and  b.  For  example,  the  equation 
5  +  3  =  8  does  not  imply  that  the  concept  8  is  obtainable  by  placing  the 
concepts  5,  3  as  it  were  in  juxtaposition,  but  can  only  be  regarded  as  a 
symbolical  expression  of  the  fact  that  an  aggregate  of  5  objects  together  with 
one  of  3  objects  makes  up  an  aggregate  of  8  objects.  Bearing  this  observation 
in  mind,  the  numbers  1,  2,  3,  ...  are  represented  symbolically  as  the  results 
of  successive  operations  of  addition,  1  +  1=2,  2  +  1  =  3,  3  +  1=4,  etc.; 
but  these  equations  do  not  express  definitions  of  the  numbers  2,  3,  4,  ..., 
since  from  the  concept  unity  taken  by  itself,  no  other  concept  is  directly 
derivable. 

The  operation  of  addition  can  be  extended  by  continued  repetition.  Thus 
the  sum  of  a,  b,  c, ...  k  is  a  finite  number  represented  bya  +  b  +  c+...+k, 
and,  in  particular,  any  number  n  is  represented  by  n  =  1  +  1  +  I  +  . . .-  +  1.  An 
immediate  induction  shews  that  the  result  of  the  operation  of  addition 
repeated  any  definite  number  of  times  is  a  finite  number  dependent  only 
on  the  constituents  of  the  summation. 

The  associative  law  of  addition,  a  +  (b  +  c)  =  (a  +  b)  +  c,  follows  from  the 
irrelevancy  of  the  order  in  which  the  operations  are  performed.  This  is  seen 
from  the  contemplation  of  aggregates  of  \\hich  a,  b,  c  are  either  the  ordinal  or 
the  cardinal  numbers. 

9.  If  in  a  finite  aggregate  of  which  the  number  is  6,  each  element  be 
replaced  by  a  finite  aggregate  of  which  the  number  is  a,  the  number  of  the  new 
aggregate  so  formed  is  said  to  be  the  product  of  b  by  a,  and  is  denoted  by  ab. 
This  operation  is  said  to  be  that  of  multiplying  b  by  a.  By  taking  the 
aggregates  to  be  ordered,  it  is  seen  at  once  that  the  new  aggregate  satisfies 
the  conditions  that  it  is  finite,  and  that  its  number  is  unaltered  by  the 
substitution  of  similar  aggregates  of  other  objects  for  those  originally  em- 
ployed. Thus  ab  is  a  definite  number  dependent  only  on  a  and  b. 

It  is  clear  that  ab  may  be  regarded  as  the  sum  a  +  a  +  a  +  . . . ,  where  a 
occurs  b  times  in  the  operation. 

If  the  ordered  aggregate  of  which  the  number  is  ab  be  re-ordered  in  the 
following  manner: — take  the  first  element  of  each  of  the  aggregates  of  which 
a  is  the  number,  then  the  second  elements  of  these  aggregates,  and  so  on, 
with  lastly  the  ath  elements  of  these  aggregates,  then  we  have  as  the  result 
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of  the  process  an  aggregate  of  which  the  number  is  a,  and  each  element  of 
which  consists  of  an  aggregate  of  which  the  number  is  6;  the  re -ordered 
aggregate  has  the  number  ba.  It  has  thus  been  shewn  that  ab  =  ba,  which 
is  expressed  by  saying  that  the  operation  of  multiplication  of  finite  integers 
is  commutative. 

The  distributive  law  for  multiplication,  a  (b  +  c)  =  ab  +  ac,  follows  from 
the  definition  of  the  operation,  by  considering  the  aggregates  of  which  a,  6,  c 
are  the  numbers. 

An  immediate  induction  shews  that  the  repetition  of  the  operation  of 
multiplication  any  definite  number  of  times  gives  a  finite  number  dependent 
only  on  the  numbers  multiplied,  and  independent  of  the  order  in  which  the 
operations  are  performed. 

The  result  of  the  operation  of  multiplying  the  number  a  by  itself  is 
denoted  by  an,  where  n  is  the  number  of  times  a  occurs  in  the  product 
a.  a.  a... a.  From  this  definition  the  law  am.an  =  am+n  is  directly  deducible. 

10.  If  the  sum  of  two  numbers  a,  b  be  denoted  by  c,  the  number  a  is 
uniquely  determined  when  b,  c  are  fixed;  and  it  is  then  regarded  as  the  result 
of  the  operation  of  subtracting  b  from  c.    The  operation  of  subtraction  is 
thus  defined  as  inverse  to  that  of  addition.    If  c  =  a  +  6,  a  is  obtained  as  the 
result  of  the  operation  denoted  by  c  —  b,  which  is  such  that  (c  —  b)  +  b  =  c. 
It  is  obvious  that  the  operation  of  subtraction  of  b  from  c  is  only  possible  in 
case  c  >  b. 

If  the  product  ,of  two  numbers  a,  b  be  the  number  c,  then  the  number  a 
is  uniquely  determined  when  b  arid  c  are  given;  and  a  is  regarded  as  the 
result  of  the  operation  of  division  of  c  by  6.  The  operation  of  division  so 
defined  is  inverse  to  that  of  multiplication;  it  is  clear  that  the  operation  is 
only  possible  in  case  c  is  one  of  the  class  of  numbers  b,  26,  36, 

FRACTIONAL   NUMBERS. 

11.  The    operation   of  multiplying  two  integers  a,  b  together  is  one 
which  is  always  a  possible  operation,  in  accordance  with  the  definition  of  the 
operation  of  multiplication  which  has  been  given  above;  the  inverse  opera- 
tion of  division  is  however,  as  we  have  seen,  not  always  a  possible  one.    This 
restriction   upon  the  possibility   of  the  operation  of  division  suggests  the 
introduction  into  Arithmetic  of  a  new  class  of  numbers,  the  rational  fractions, 
which,  when  defined,  shall  be  such  that  the  operation  of  division,  within  the 
whole   aggregate  of  integers  and  fractions,  may  be  a  possible  one  without 
restriction.    Stated  in  algebraical  form,  the  demand  arises  for  a  scheme  of 
numbers  such  that  the  equation  ax=b  shall  always  have  a  solution  in  x, 
where  a,  6  are  any  two  numbers  which  belong  to  the  contemplated  aggregate 
of  numbers. 
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The  actual  use  of  fractional  numbers  arose  historically  from  the  necessities 
of  the  process  of  measurement  of  extensive  magnitude,  and  the  conception  of 
a  fraction  which  arises  in  this  connection  is  the  one  which  is  used  in  ordinary 
life,  and  is  made  the  basis  of  the  treatment  of  the  theory  of  fractions,  even  in 
recent  scientific  text-books.  In  accordance  with  this  view,  a  unit  of  magnitude 
of  some  kind  is  divided  into  6  equal  parts,  and  a  of  these  parts  are  taken;  the 
resulting  magnitude  is  then  denoted  by  the  fraction  a/6. 

This  notion  of  the  essential  nature  of  a  fraction,  dependent  as  it  is  upon 
the  notions  of  a  unit,  and  of  the  divisibility  of  such  unit  into  equal  parts,  is 
incompatible  with  the  modern  view  that  Mathematical  Analysis  should  be 
developed  upon  the  basis  of  a  Pure  Arithmetic,  quite  independently  of  all 
notions  connected  with  the  measurement  of  extensive  magnitude.  The  modern 
tendency  known  as  Arithmetization  manifests  itself  in  the  construction  of 
theories  of  Number  and  of  the  operations  involving  numbers,  which  depend 
entirely  upon  the  conceptions  connected  with  the  process  of  counting;  measure- 
ment being  regarded  as  a  process  foreign  to  Pure  Arithmetic.  The  process  of 
counting  is  an  exact  one:  whereas  measurement  can  in  practice  only  be  carried 
out  with  a  greater  or  less  degree  of  approximation,  and  can  only  ideally  be 
made  an  exact  process.  Pure  Arithmetic  is  made  the  basis  of  Analysis,  not 
only  in  accordance  with  the  general  principle  that  the  fundamental  conceptions 
of  a  branch  of  science  should  be  irreducible  to  simpler  conceptions,  but  also 
because  the  theory  of  ideally  exact  measurement  has  peculiar  difficulties  of  its 
own.  Our  essentially  inexact  intuitions  of  spatial,  temporal,  or  other  magni- 
tudes, necessitate  a  process  of  idealization  in  which  the  objects  of  perception 
are  replaced  by  ideal  objects  subject  to  an  exact  scheme  of  definitions  and 
postulates,  in  order  that  an  exact  science  of  measurement  may  be  possible. 
The  view  is  at  present  held  by  the  majority  of  mathematicians  that  the  nature 
of  the  abstract  continuum,  and  that  of  a  limit,  are  capable  of  exact  formulation 
only  in  the  language  of  a  Pure  Arithmetic;  and  that  this  science  must  there- 
fore be  developed  upon  an  independent  basis  before  it  can  be  applied  to  the 
elucidation  of  the  conceptions  requisite  for  an  abstract  theory  of  continuous 
magnitude.  The  theory  of  measurement  is,  in  accordance  with  this  view, 
regarded  as  an  application,  and  not  as  part  of  the  basis,  of  Mathematical 
Analysis. 

12.  By  those  writers  who  are  under  the  influence  of  the  modern  arithme- 
tizing  tendency,  the  traditional  non-arithmetical  definition  of  a  fraction  has 
been  abandoned,  and  in  its  place  a  formal  definition  has  been  substituted,  in 
which  the  fraction  is  regarded  as  an  association  of  a  pair  of  integers.  The 
associated  integers  are  regarded  as  making  a  single  object,  and  laws  of  combina- 
tion of  these  objects  are  then  postulated. 

If  a,  b  are  two  integers,  a  new  number  (a,  6),  or  in  ordinary  notation  j ,  is 
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formed  by  the  association  of  a  and  6,  the  new  number  being  defined  to  be  such 
as  to  satisfy  the  following  conditions: 

(1)  (a,  6)  is  regarded  as  ordinally  greater,  equal  to,  or  less  than  (c,  d), 
according  as  ad  is  greater,  equal  to,  or  less  than  be.    The  expressions  greater, 
equal  to,  or  less  than,  are  here  used,  not  in  their  primitive  sense  as  referring 
to  magnitude,  but  in  the  sense  in  which  we  have  used  them  in  the  case  of 
integers,  as  assigning  relative  order  to  the  numbers. 

(2)  (a,  1)  is  defined  as  equal  to  a;  thus  if  b  =  1,  the  association  is  regarded 
as  equivalent  to  the  integer  a.   Taking  (1)  in  conjunction  with  this  postulate, 
the  new  numbers  have  their  orders  assigned,  not  only  relatively  to  one  another, 
but  relatively  also  to  the  integral  numbers;  so  that  the  whole  aggregate  of 
integers  and  fractions  is  ordered,  in  the  sense  that,  of  two  given  numbers,  it 
can  always  be  said  which  has  the  higher  rank. 

(3)  The  addition  of  two  fractional  numbers  is  defined  by 

(a,  6)  +  (c,  d)  =  (ad  +  be,  bd). 

(4)  The  multiplication  of  fractional  numbers  is  defined  by 

(a,  6)  x  (c,  d)  =  (ac,  bd). 

(5)  The  use  of  a  fraction  as  an  index  is  defined  by  the  postulate 

g,(a,  bi  x  g,(c,  d)  _  g,(a,  b)  +  (c,  d) 

where  x  is  any  number,  either  integral  or  fractional.  The  symbol  #<*•  A)  is  to 
be  interpreted  subject  to  this  postulate,  in  case  such  interpretation  is  possible. 
It  will  be  observed  that,  in  the  case  6  =  1,  d—  1,  the  above  definitions  are 
consistent  with  those  which  have  been  adopted  in  the  case  of  integral  numbers ; 
and  thus  the  new  numbers,  together  with  the  integers,  form  an  aggregate  with 
uniform  laws  of  operations.  It  is  easily  seen  that  the  operations  with  new 
numbers  satisfy  the  commutative,  associative,  and  distributive  laws.  The 
inverse  operation  of  division  is  now  one  which  is  always  possible  within  the 
domain  of  the  numbers;  thus  (a,  b)  -4-  (c,  d)  =  (ad,  be).  The  inverse  operation 
of  subtraction,  (a,  b)  —  (c,  d)  =  (ad  —  be,  bd),  is  only  possible  if  (a,  b)  >  (c,  d). 

The  association, of  a  pair  of  integers  is  a  "number"  in  quite  a  different 
sense  from  that  in  which  the  cardinal  and  ordinal  numbers,  hitherto  discussed, 
are  numbers.  The  justification  of  the  extension  of  the  term  "number"  to  the 
fractions  lies  in  the  fact  that  a  consistent  scheme  of  operations  can  be  imposed 
upon  them,  of  which  the  laws  are  in  agreement  with  those  which  hold  for 
operations  which  involve  integers  only. 

13.  The  scheme  which  has  been  above  indicated  suffices  for  a  formal 
definition  arid  logical  development  of  the  properties  of  fractions,  but  it  is 
subject  to  the  objection  that  it  is  of  an  arbitrary  character;  indeed  it  is  not 
easy  to  see  why  the  particular  laws  of  operations  have  been  postulated,  except 
as  suggested  by  the  traditional  non-arithmetical  conception  of  a  traction. 
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To  remedy  this  defect,  a  view  of  the  nature  of  a  fraction  will  be  here  given 
which  relates  the  fraction  with  the  process  of  counting,  in  such  a  manner  that 
fractional  and  integral  numbers  have  similar  relations  to  that  process.  It  will 
appear  that  the  laws  of  combination  given  above  naturally  follow  from  this 
mode  of  regarding  the  fraction,  with  the  exception  of  (5),  which  is  however 
immediately  suggested  by  the  rule  for  integral  indices. 

Consider  an  aggregate  of  b  objects,  and  out  of  these  b  objects  pick  out  any 
a  (£  b)  of  them.  If  we  regard  these  a  objects  not  only  as  single  objects  of 
number  a,  but  also  as  belonging  to  an  aggregate  whose  number  is  b,  we  may 
Denote  the  a  objects  by  (a,  b),  where  their  number  a  is  associated  with  the 
cardinal  number  b  of  the  aggregate  to  which  they  belong.  This  process  being 
independent  of  the  particular  aggregate  used,  the  abstract  fraction  (a,  6)  is 
related  to  this  process  in  an  analogous  manner  to  that  in  which  the  number  b 
is  related  to  the  process  of  counting  an  aggregate  whose  cardinal  number  is  b. 
Thus  the  fraction  (a,  b),  or  a/6,  is  characteristic  of  an  aggregate  of  a  objects 
each  of  which  belongs  to  an  aggregate  of  b  objects.  The  extension  of  the 
definition  to  the  case  a>b,  is  clear  when  we  observe  that  it  is  unessential  that 
the  a  objects  taken  should  all  belong  to  one  and  the  same  aggregate  of  b 
objects;  it  is  sufficient  that  each  of  them  be  regarded  as  essentially  belonging 
to  some  aggregate  of  cardinal  number  6.  In  accordance  with  this  view,  a 
fraction,  say  3/5,  is  characteristic  of  any  three  things  each  of  which  belongs  to 
an  aggregate  of  five  things,  i.e.  3/5  means  3  out  of  5.  That  the  three  things 
taken  out  of  five  should  necessarily  all  be  equal  in  respect  of  size,  or  some 
other  kind  of  magnitude,  is  as  irrelevant  to  the  true  nature  of  a  fraction  as 
the  assumption  of  five  things  necessarily  meaning  five  equal  things  is  to  the 
true  nature  of  the  number  five. 

Since  (a,  1)  is  characteristic  of  an  aggregate  of  a  things  each  of  which  is 
also  regarded  as  a  single  object,  it  is  clear  that  (a,  1)  is  identical  with  a. 

If  we  suppose  each  of  the  b  elements  in  an  aggregate,  of  which  the  cardinal 
number  is  b,  to  be  replaced  by  an  aggregate  of  n  elements,  we  have  now  an 
aggregate  with  nb  for  its  cardinal  number;  and  instead  of  a  elements  chosen 
out  of  this  aggregate  we  now  have  na  of  the  new  elements,  each  of  which  is  to 
be  regarded  as  associated  with  the  cardinal  number  nb.  We  represent  these 
na  elements  by  (na,  nb),  which  is  equivalent  to  (a,  b),  since  the  two  forms 
represent  two  different  aspects  of  the  same  process.  Therefore  we  have 
(a,  b)  =  (na,  nb),  or  in  the  ordinary  notation  a/b  =  na/nb.  This  relation  is  in 
complete  accordance  with  the  law  of  logical  (not  arithmetical)  addition,  that 
a  mere  repetition  of  a  term  yields  only  the  term  itself. 

Since  (a,  b)-=(ad,  bd),  and  (c,  d)  =  (bc,  bd),  we  regard  (a,  6)  as  greater, 
equal  to,  or  less  than  (c,  d),  in  the  purely  ordinal  sense  of  the  terms,  according 
as  ad  is  |  be.  For  the  two  numbers  (ad,  bd),  (be,  bd)  are  characteristic  of  the 
process  of  taking  ad,  be  elements  respectively  from  an  aggregate  of  the  same 
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cardinal  number  bd;  and  thus  the  relative  order  of  the  two  numbers  (a,  b), 
(c,  d)  will  naturally  be  fixed  in  accordance  with  the  relative  order  of  the  two 
numbers  ad,  be. 

The  addition  of  the  two  numbers  (a,  b)  and  (c,  d)  is  equivalent  to  that  of 
(ad,  bd)  and  (be,  bd),  and  is  consequently  naturally  defined  as  given  by 
(ad  4-  be,  bd),  which  characterises  the  amalgamation  of  two  aggregates  of  which 
the  numbers  are  ad,  be,  the  elements  of  each  of  which  all  belong  to  an  aggregate 
of  number  bd,  or  to  one  of  several  such  aggregates. 

To  interpret  the  operation  of  multiplication,  let  us  consider  an  object 
represented  by  (c,  d);  this  consists  of  c  things  each  belonging  to  an  aggregate 
of  d  things.  To  multiply  it  by  (a,  b),  is  to  take  a  such  objects  each  of  which 
belongs  to  an  aggregate  of  b  such  objects;  we  have  on  the  whole  one  or  more 
aggregates  of  bd  elements,  and  out  of  these,  ac  elements  are  to  be  taken.  Thus 
the  multiplication  of  the  number  (c,  d)  by  the  number  (a,  b)  may  be  understood 
to  characterise  the  result  of  taking  a  objects,  each  of  which  is  characterised 
by  (c,  d),  out  of  one  or  more  collections  of  6  objects,  each  of  which  objects  is 
characterised  by  (c,  d).  This  is  the  same  thing  as  the  process  of  taking  ac 
objects  out  of  one  or  more  aggregates  of  bd  objects,  and  is  characterised  by 
the  number  (ac,  bd);  we  are  thus  led  to  the  law  of  multiplication 


ac 


(a,  b)  x  (c,  d)  =  (ac,  bd),  or  T  x  -=  =  r-^ . 

6      d     bd 


NEGATIVE  NUMBERS,  AND  THE  NUMBER  ZERO. 

14.  Although  the  operation  of  addition  is  always  possible  within  the 
aggregate  of  integral  and  fractional  numbers,  yet  the  inverse  operation  of 
subtraction  is  not  always  possible;  thus  a  number  a;  cannot  be  found  such  that 
x  +  (c,  d)  =  (a,  b),  unless  (a,  b)>(c,  d).  As  the  limitation  of  the  possibility  of 
division  suggests  the  introduction  of  fractional  numbers,  so  this  limitation  of 
the  possibility  of  subtraction  suggests  the  introduction  of  a  further  set  of  new 
numbers,  which  shall  be  such  that,  within  the  so  completed  aggregate,  sub- 
traction may  always  be  a  possible  operation. 

If  a,  /3,  7,  8  denote  integral  or  fractional  numbers  such  that  a  >  /9,  7  >  8  : 
we  may  put  a  =  /3  +  #,  y=  8  +y;  then  x  =  a  -  ft,  y  =  7  -  8.  We  have 

a  +  <y=fi  +  8+x  +  y, 
hence  x  +  y  =  («  +  7)  -  (/3  +  8), 

or  (a -£)  +  (? -8)  =  (a +  7) -(#  +  8) (1). 

Again,  if  a  -  /3  =  7  -  8,  i.e.  x  =  y,  we  have  a+8  =  £  +  8  +  a?  =  £+y; 
or  a  +  S  =  /3  +  7,  if  a-/3  =  7- 8    (2). 

H.  9 
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Lastly,  we  have 


=  /38  4-  @y  +  x8 
hence  07  +  /38  =  /3  (;y  +  8)  +  8  (x  +  /3)  +  xy 


hence  (a  -  /3)  (7  -  8)  =  (a7  +  /38)  -  (aS  +  £7)  ..................  (3). 

The  rules  (1),  (2),  (3),  with  regard  to  the  numbers  a  —  @,  7  —  8,  which  so 
far  exist  only  when  a  >  /3,  7  >  8,  suggest  the  mode  of  the  extension  referred 
to  above. 

15.  Let  a,  /3  be  any  two  numbers  integral  or  fractional,  and  conceive  a 
new  number  D  (a,  /3),  formed  by  the  association  of  a  and  /3,  to  be  defined  as 
subject  to  the  laws 

(4)  D  (a,  /3)  =  D  (7,  8),  if  a  +  8  =  £  +  7, 

(5)  D  (a,  /3)  +  J9  (7,  8)  =  jD  (a  +  7,  £  4-  8), 

(6)  D  (a,  /3)  x  D  (7>  S)  =  D  (07  4-  £8,  a8  +  /37): 

it  will  be  observed  that  when  a  >  /3,  and  y  >  8,  D  (a,  /3)  may  denote  a  —  $, 
the  three  laws  becoming  (2),  (1),  (3).  It  will  now  be  shewn  that  the  symbol 
D  (a,  ft]  defines  a  number  of  an  aggregate  within  which  the  operation  of  sub- 
traction is  always  possible.  For,  to  find  a  number  x,  such  that 

x  4-  D  (a,  /3)  =  D  (7,  8),  we  see  that  x  =  D  (J3  +  7,  a  4-  8), 
since  D  (a  4-  /3  4-  7,  a  4-  0  +  8)  =  D  (7,  8),  in  virtue  of  (4). 

Since  D  (a,  a)  =  D  (7,  7),  we  see  that  D  (a,  a)  is  independent  of  a;  and  thus 
D  (a,  a)  defines  a  new  number  which  is  called  the  number  zero,  and  is  denoted 
by  the  symbol  0. 

The  number  zero  is  regarded  as  characteristic  of  the  absence  of  all  elements 
from  an  aggregate  of  which  the  existence  has  been  contemplated;  it  is  the 
number  of  such  a  hypothetical  aggregate,  in  a  sense  similar  to  that  in  which 
a  positive  integer  is  the  number  of  an  actual  aggregate. 

The  number  D  (a  4  k,  k)  depends  only  on  a,  and  we  shall  postulate  that  it 
is  identical  in  meaning  with  a  itself. 

The  numbers  D  (a,  (3),  or  a  —  yS,  for  which  a  >  /3,  are  called  positive  numbers, 
and  form  the  aggregate  of  integral  and  fractional  numbers  we  have  previously 
considered. 

Those  numbers,  for  which  a  <  /3,  are  called  negative  numbers. 

Since  by  (5),  D  (a,  /3)  4-  D  (/3,  a)  =  D  (a  4-  j3,  a  4-  £)  =  0,  the  number  D  (p,  a) 
may  be  denoted  by  —  D  (a,  /3),  or  in  ordinary  notation  —  (a  —  /3).  Thus  to  every 
positive  number  x  there  corresponds  a  single  negative  number  —  x,  which  is 
such  that  x  +  (—  x)  =  0. 
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We  may  now  use  the  notation  o  —  (3  in  every  case  for  D(a,  y3),  and  thus 

*-£--(£-«). 

From  (6),  it  is  seen  that  the  operation  of  division  is  always  possible  for 
two  members  of  the  complete  aggregate  of  positive  and  negative  numbers  and 
zero,  except  when  the  divisor  is  the  number  zero,  in  which  case  the  operation 
is  meaningless. 

From  (6),  we  see  by  putting  y=8,N  =  D(a,  0),  that  ^.0  =  0.  From  (5), 
we  have  N+Q  =  N. 

Any  number  D  (a,  /3)  is  said  to  be  greater  in  the  ordinal  sense  than  D  (y,  S), 
when  D(a,@)  —  D(y,8)  is  positive;  thus  the  complete  aggregate  of  positive 
and  negative  integral  and  fractional  numbers  together  with  the  number  zero 
is  one  in  which  all  the  numbers  are  arranged  in  a  definite  order.  This  aggre- 
gate is  known  as  the  aggregate  of  rational  numbers. 

In  the  aggregate  of  rational  numbers  so  ordered,  the  number  zero  -has 
lower  rank  than  any  of  the  positive  numbers,  and  higher  rank  than  any  of  the 
negative  numbers.  Further,  if  x,  y  are  two  positive  numbers  of  which  x  has 
higher  rank  than  y,  the  negative  number  —  x  has  lower  rank  than  —  y. 

If  x,  y  are  any  two  rational  numbers,  such  that  x<y,  there  exist  an  un- 
limited number  of  rational  numbers  each  of  which  is  >  x,  and  <  y. 

Such  numbers  are  said  to  be  between  x  and  y.  For  it  can  be  seen  at  once, 
from  the  definition  of  order  given  above,  that  \(x+  y)  is  one  such  number; 
between  \  (x  +  y)  and  either  x  or  y  another  rational  number  can,  in  a  similar 
manner,  be  found.  This  process  can  be  carried  on  without  end;  and  it  is  clear 
that,  in  accordance  with  the  mode  of  ordering  of  the  aggregate,  defined  above, 
all  the  numbers  thus  determined  are  between  x  and  y. 

If  x,  y  are  any  two  positive  rational  numbers  such  that  x  <  y,  an  integer  n 
can  be  found  which  is  such  that  nx  >y. 

y  .  11  ft 

For-  is  a  rational  number  such  that  -.x  =  x\  again  if-  be  any  positive 

X  «/  (j 

1) 
rational  number,  there  exist  integers  which  are  >  -  ;  for  p  +  1  is  itself  such 

11  11 

an  integer.    If  n  is  an  integer  which  is  >  -,  we  have  nx  >  - .  x  >  y ;  thus  the 

00  00 

theorem  is  established. 

IRRATIONAL  NUMBERS. 

16.  The  only  numbers  of  which  the  existence  was  recognized  by  the  Greek 
geometers  were  the  rational  numbers,  although  the  fact  that  the  ratio  of  two 
geometrical  magnitudes  is  not  necessarily  exactly  representable  by  such  num- 
bers appears  to  have  been  discovered  at  a  very  early  period.  Euclid  gave,  in 
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the  fifth  book  of  his  treatise,  a  discussion  of  the  theory  of  ratios,  and  in  the 
tenth  book  a  theory  of  those  incommensurable  magnitudes  which  are  ideally 
constructible  by  means  of  straight  lines  and  circles.  In  later  times*,  the  idea 
was  current  that,  to  the  ratio  of  any  two  magnitudes  of  the  same  kind,  there 
corresponds  a  definite  number;  and  in  fact  Newton  in  his  Arithmetica  Urtiver- 
salis  expressly  defines  a  number  as  the  ratio  of  any  two  quantities.  Before  the 
recent  development  of  the  arithmetical  theories  of  irrational  number,  and  to 
a  considerable  extent  even  later,  a  number  has  been  regarded  as  the  ratio  of 
a  segment  of  a  straight  line  to  a  unit  segment,  and  the  conception  of  irrational 
number  as  the  ratio  of  incommensurable  segments  has  been  accepted  as  a 
sufficient  basis  for  the  use  of  such  numbers  in  Analysis. 

In  accordance  with  the  doctrine  that  Mathematical  Analysis  must  rest 
upon  a  purely  arithmetical  basis,  the  introduction  of  irrational  numbers  into 
Analysis  must  be  made  without  an  appeal  to  our  intuition  of  extensive  magni- 
tude, but  rather  by  an  extension  of  the  conception  of  Number,  resting  on  a 
further  development  of  the  ideas  which  have  been  here  discussed  in  connection 
with  the  theory  of  rational  numbers.  The  necessity  for  this  extension  of  the 
domain  of  Number  arises  not  only  on  account  of  the  inadequacy  of  rational 
numbers  for  application  to  ideally  exact  measurement,  but  also,  as  will  be  ex- 
plained later  in  detail,  because  the  theory  of  limits,  which  is  an  essential 
element  in  Analysis,  is  incapable  of  any  rigorous  formulation  apart  from  a 
complete  arithmetical  theory  of  irrational  numbers. 

Before  the  recent  establishment  of  the  theory  of  irrational  numbers,  no 
completely  adequate  theory  of  Magnitude  was  in  existence.  This  is  not  sur- 
prising, if  we  recognize  the  fact  that  the  language  requisite  for  a  complete 
description  of  relations  of  magnitudes  must  be  provided  by  a  developed 
Arithmetic. 

17.  The  successive  extensions  of  the  domain  of  Number,  by  the  introduc- 
tion of  fractional  and  of  negative  numbers,  were  suggested  by  the  desirability 
of  so  completing  the  domain  that  the  operations  of  division  and  subtraction, 
which  are  not  always  possible  in  the  more  limited  domain,  might  always  be 
so  in  the  more  extended  one.  In  the  aggregate  of  rational  numbers,  the 
operations  of  addition,  subtraction,  multiplication,  and  division  are  always 
possible  operations;  but  it  can  be  readily  shewn  that  the  inverse  operation 
involved  in  determining  a  fractional  power  of  a  rational  number  is  not,  in 
general,  a  possible  one. 

As  the  simplest  case  of  this  impossibility  of  such  operation,  we  may  take 
the  problem  of  finding  the  square  root  of  a  positive  integer  m  which  is  not 

*  A  good  short  account  of  the  history  of  this  subject  will  be  found  in  the  Article  I.  A  3,  "  Irra- 
tionalzahlen  und  Konvergenz  unendlicher  Prozesse,"  by  A.  Pringsheim,  in  the  Encyclopedic  der 
Math.  H'isxenschaften,  vol.  i.  See  also  M.  Cantor,  Geschichte  der  Math.,  vol.  i. 
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a  square  number.  It  can  be  shewn  that  such  a  number  has  no  square  root 
within  the  aggregate  of  rational  numbers. 

If  possible*  let  m  be  the  square  of  a  rational  fraction  p/q  in  its  lowest 
terms;  thus  pl  —  mq*  =  0.  There  always  exists  a  positive  integer  X  such  that 
Xa  <  m  <  (X  -I- 1)2;  we  then  have  \q  <  p  <  (X  +  1)  q. 

Now  let  us  consider  the  identity 

(mq  -  \p)*  -  m  (p  -  \q)*  =  (X2  -  TO)  (p1  -  mq*)  =  0. 

From  this  identity  it  follows  that  m  is  the  square  of  the  rational  number 
(mq  —  Xp)/(jo  —  \q\  of  which  the  denominator  is  less  than  q,  and  this  is  con- 
trary to  the  hypothesis  that  m  is  the  square  of  the  fraction  p/q  which  is  in  its 
lowest  terms.  It  thus  appears  that  there  exists  no  rational  number  of  which 
the  square  is  m. 

On  the  formal  side  of  Arithmetic,  a  demand  for  the  extension  of  the  domain 
of  number  arises  from  the  impossibility  of  carrying  out,  with  the  requisite 
generality,  certain  operations,  as  in  the  example  given  above.  Such  extensions 
of  the  domain  of  number  as  are  made  when  fractional,  negative,  irrational,  and 
complex  numbers  are  successively  adjoined  to  the  original  integral  numbers, 
are  made  in  accordance  with  a  principle  known  as  that  of  the  permanence  of 
forms,  which  was  first  indicated  by  Peacock  -f-,  and  further  developed  by  Hankel  J  • 
This  principle  may  be  stated  in  the  form  that,  in  order  to  generalize  the  con- 
ception of  number,  the  following  four  requisites  must  be  satisfied: 

(1)  Every  operation  which  is  represented  by  a  formal  expression  involving 
the  unextended  class  of  numbers,  and  which  does  not  result  in  the  representa- 
tion of  a  number  of  the  unextended  class,  must  have  a  meaning  assigned  to  it 
of  such  a  character  that  the  formal  expression  may  be  dealt  with  according  to 
the  same  rules  as  would  be  applicable  if  the  expression  represented  one  of  the 
unextended  class  of  numbers. 

(2)  An  extended  definition  of  number  must  be  given,  such  that  a  formal 
expression,  as  in  (1),  may  represent  a  number  in  the  extended  sense  of  the 
term. 

(3)  A  proof  must  be  given  that  for  numbers  of  the  extended  class  the 
same  formal  laws  of  operation  hold  as  for  the  unextended  class. 

(4)  Definitions  must  be  given  of  the  meaning  of  greater,  equal,  and  less, 
in  the  extended  domain  of  number,  these  terms  being  taken  in  the  ordinal 
sense. 

*  This  proof  is  given  by  Dedekind  in  his  tract  Stetiykeit  mid  irrationale  Zalilen.  An  extension 
of  Dedekind's  method  to  the  case  of  nth  roots  has  been  given  by  S.  M.  Jacob.  See  Proc.  Lund. 
Math.  Soc.  Ser.  2,  vol.  i,  p.  166. 

t  British  Association  Report  for  1834;  also  Symbolical  Algebra,  Cambridge,  1845. 

£  See  his  Theorie  der  komplexen  Zahlsysteme,  Leipzig,  1867. 
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The  arithmetical  theory  of  irrational  numbers  has  been  developed  in 
three  main  forms,  of  which  the  first*  was  given  by  Weierstrass  in  his  lectures 
on  Analytical  Functions;  the  second f  is  that  of  G.  Cantor,  which  was  de- 
veloped in  further  detail  by  Heine  J,  and  was  also  developed  independently 
by  Ch.  Meray§;  the  third,  that  of  R.  Dedekindj,  appeared  about  the  same 
time  as  that  of  Cantor.  We  shall  give  an  account  of  the  theories  of  Dedekind 
and  of  Cantor,  and  shall  shew  that  they  are  fundamentally  identical. 


KRONECKER  S  SCHEME  OF  ARITHMETIZAT1ON. 

18.  As  it  is  now  generally  understood,  the  term  "arithmetization"  is  used 
to  denote  the  movement  which  has  resulted  in  placing  analysis  on  a  basis  free 
from  all  notions  derived  from  the  idea  of  measurable  quantity,  the  fractional, 
negative,  and  irrational  numbers  being  so  defined  that  they  depend  ultimately 
upon  the  conception  of  integral  number.  An  extreme  theory  of  arithmetization 
has  however  been  advocated  by  Kroneckerll,  who  proposed  the  abolition  of 
all  modifications  and  extensions  of  the  conception  of  nurhber,  the  integral 
numbers  being  alone  retained.  His  ideal**  is  that  every  theorem  in  analysis 
shall  be  stated  as  a  relation  between  integral  numbers  only,  the  terminology 
involved  in  the  use  of  negative,  fractional,  and  irrational  numbers,  being 
entirely  removed.  This  ideal,  if  it  were  possible  to  attain  it,  would  amount 
to  a  reversal  of  the  actual  historical  course  which  the  science  has  pursued ; 
for  all  actual  progress  has  depended  upon  successive  generalizations  of  the 
notion  of  number,  although  these  generalizations  are  nowregarded  as  ultimately 
dependent  on  the  whole  number  for  their  foundation.  The  abandonment  of 
the  inestimable  advantages  of  the  formal  use  in  Analysis  of  the  extensions  of 
the  notion  of  number  could  only  be  characterised  as  a  species  of  Mathematical 
Nihilism. 

*  For  an  account  of  this  Theory  see  S.  Pincherle,  Giorn.  di  mat.,  vol.  xvni  (1880),  p.  185;  also 
0.  Btfrmann,  Thcorie  der  analytischen  Funktionen,  Leipzig,  1887,  p.  19. 

t  Math.  Annalen,  vol.  v  (1872);  see  also  Math.  Annalen,  vol.  xxi,  where  Cantor  discusses  all 
the  three  theories. 

J  C relic's  Journal,  vol.  LXXIV  (1872). 

§  Nouveau  Precis  d1  Analyse  infinite nimal-e,  Paris,  1872. 

||  Stetigkeit  und  irrationale  Zahlen,  Brunswick,  1872. 

U  See  C relic's  Journal,  vol.  ci,  "  Ueber  den  Zahlbegriff." 

**  He  writes  (loc.  cit.  p.  338)  :  "Und  ich  glaube  auch,  dass  es  dereinst  gelingen  wird,  den 
gesammten  Inhalt  aller  dieser  mathematischen  Discipline!!  zu  'arithmetisiren,'  d.  h.  einzig  und 
allein  auf  den  im  engsten  Sinne  genommenen  Zahlbegriff  zu  griinden,  also  die  Modificationen  und 
Erweiterungen  dieses  Begriffs  (ich  meine  hier  namentlich  die  Hinzunahme  der  irrationalen  sowie 
der  continuirlichen  Grossen)  wieder  abzustreifen,  welche  zumeist  durch  die  Anwendungen  auf  die 
Geometric  und  Mechanik  veranlasst  worden  sind."  He  proceeds  to  shew  in  detail,  how  the  notions 
of  negative,  fractional,  and  algebraical  numbers  can  be  avoided  by  substituting  for  equalities  in 
which  these  numbers  occur,  congruences  relative  to  certain  moduli  or  systems  of  moduli.  A  similar 
suggestion  has  been  made  by  Cauchy  with  reference  to  imaginary  numbers. 
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THE  DEDEKIND  THEORY  OF  IRRATIONAL  NUMBERS. 

19.  Let  us  consider  the  aggregate  of  all  the  rational  numbers  ordered  in 
the  manner  which  has  been  previously  discussed,  and  let  us  take  any  one 
such  number  N.  We  may  conceive  all  the  rational  numbers  to  be  divided 
into  two  classes  R},  and  R2,  such  that  every  number  of  J?,  is,  in  the  ordinal 
sense  of  the  term,  less  than  every  number  belonging  to  the  second  class  Hit 
the  two  classes  being  separated  by  the  number  N,  which  may  itself  be 
assigned  at  .choice  either  to  the  first  or  to  the  second  class.  If  N  belongs  to 
the  first  class,  it  is  the  greatest  number  in  that  class,  and  the  numbers  of  the 
second  class  have  no  number  which  is  less  than  all  the  others  of  that  class ; 
if  N  be  taken  to  belong  to  the  second  class,  it  is  the  least  number  in  that 
class,  and  there  exists  no  number  in  the  first  class  which  is  greater  than  all 
the  others;  for  if  any  rational  number  less  than  N  be  taken,  it  is  always 
possible  to  find  another  greater  one  which  is  less  than  N.  Such  a  division  of 
the  rational  numbers  into  two  classes  is  called  a  section  (Schnitt),  and  we 
therefore  say  that  corresponding  to  any  given  rational  number  there  eocists  a 
section  which  divides  the  aggregate  of  rational  numbers  into  two  classes,  such 
that  all  the  numbers  of  the  first  class  are  less  than  all  those  of  the  second  class; 
and  such  that  either  in  the  first  class  there  is  no  greatest  number,  or  else  in  the 
second  class  there  is  no  least  number. 

It  can  be  shewn  by  means  of  examples,  that  sections  of  the  aggregate 
of  rational  numbers  exist  which  are  different  in  character  from  those  just 
described.  If  m  is  a  positive  integer  which  is  not  a  square  number,  we  may 
conceive  the  rational  numbers  to  be  divided  into  two  classes,  the  first  of 
which  contains  all  the  negative  numbers  and  also  those  positive  numbers  of 
which  the  square  is  less  than  m,  including  zero;  the  second  class  contains  all 
the  positive  numbers  of  which  the  square  is  greater  than  m.  The  first  class 
contains  no  greatest  number,  and  the  second  class  contains  no  least  number ; 
this  section  is  said  to  be  related  to  an  irrational  number  ^m,  in  the  same 
way  as  a  section  such  as  has  been  considered  above  is  related  to  a  rational 
number.  This  example  shews  that  sections  of  the  rational  numbers  R  exist, 
such  that  R  is  divided  into  two  classes  R1}  R.2,  where  every  number  of  Rl  is 
less  than  every  number  of  Rz,  and  such  that  R}  contains  no  number  greater 
than  all  the  others,  and  also  R2  contains  no  number  less  than  all  the  others. 

A  new  aggregate  of  objects,  the  real  numbers,  may  now  be  defined  as 
follows : 

To  every  section  (Rl}  R2)  of  the  aggregate  R  of  rational  numbers,  such  that 
every  number  of  R  belongs  to  one  or  other  of  the  two  classes  R1}  R2,  and  every 
number  in  Rl  is  ordinally  less  than  every  number  in  R.2,  there  corresponds  a 
real  number. 
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In  case  neither  Rl  contains  a  number  which  is  ordinally  greater  than  all 
the  others  in  R1}  nor  Rz  contains  a  number  which  is  ordinally  less  than  all  the 
others  in  R2,  the  real  number  corresponding  to  the  section  is  said  to  be  an 
irrational  number. 

In  case  either  R^  has  a  greatest  number  x,  or  R.2  has  a  least  number  a-,  the 
section  is  said  to  define  a  real  number  corresponding  to  the  rational  number  x. 

The  real  number  which  corresponds  to  a  rational  number  x,  though 
conceptually  distinct  from  x,  has  no  properties  distinct  from  those  of  x,  and 
is  usually  denoted  by  the  same  symbol. 

The  definition  of  a  real  number  can  be  put  into  a  different  and  somewhat 
less  abstract  form,  by  employing  the  notion  of  a  lower  segment  of  the 
aggregate  of  rational  numbers.  A  lower  segment  of  the  aggregate  R  of 
rational  numbers  is  any  class  of  rational  numbers  which  contains  no  number 
greater  than  all  the  others,  and  such  that  if  any  number  whatever  of  the -class 
be  taken,  the  class  contains  all  those  numbers  of  R  which  are  less  than  that 
number.  A  lower  segment  of  R  is  identical  with  one  of  Dedekind's  classes  RI, 
in  case  jRx  contains  no  greatest  number. 

A  real  number  may  be  defined*  to  be  a  lower  segment  of  the  aggregate,  R  of 
rational  numbers;  and  thus  every  real  number,  whether  irrational  or  not,  is  a 
definite  class  of  rational  numbers. 

In  accordance  with  this  definition,  the  real  number  3,  for  example,  is 
defined  to  be  the  aggregate  of  all  rational  numbers  which  are  less  than  the 
rational  number  3 ;  the  irrational  number  V3  is  defined  as  the  aggregate  of 
all  rational  numbers  which  are  either  negative,  or  if  positive  have  their  squares 
less  than  3,  the  number  zero  being  also  included  in  the  aggregate. 

The  use,  here  adopted,  of  the  term  real  number,  is  sanctioned  by  general 
usage.  The  employment  of  the  term  real  has  originated  from  the  contrasting 
of  these  numbers,  not  with  rational  numbers,  but  with  complex  numbers.  The 
extension  of  the  term  Number  to  the  real  numbers  is  justified  by  the  fact 
that  is  is  possible  to  define  the  operations  of  addition,  multiplication,  &c.,  for 
real  numbers,  so  that  the  formal  laws  of  these  operations  are  in  agreement 
with  those  which  hold  for  operations  within  the  domain  of  the  rational 
numbers. 

20.  It  will  now  be  shewn  that  the  aggregate  of  real  numbers,  defined  in 
Dedekind's  manner,  can  be  so  ordered  that  every  real  number  has  a  definite 
rank  in  the  aggregate,  i.e.  of  any  two  real  numbers  it  is  determinate  which 
has  the  higher  and  which  the  lower  rank. 

*  This  form  of  the  definition  is  that  given  by  B.  Russell,  see  The  Principles  of  Mathematics, 
vol.  i,  chaps,  xxin  and  xxiv;  it  was  suggested  by  Peano,  see  Rivista  di  Matematica,  vol.  vi, 
pp.  126—140. 
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The  basis  of  the  scheme  of  order  being  taken  to  be  the  ordered  aggregate 
of  rational  numbers,  let  us  denote  by  n,  ri  any  two  real  numbers,  and  let 
the  sections  by  which  they  are  defined  be  denoted  by  (Rlt  R2),  (#,',  Rt') 
respectively. 

The  following  cases  may  arise  : 

(1)  If  (.Ri,  RI)  and  (Ri,  R^}  are  identical,  that  is,  if  every  number  in  JR, 
is  also  in  RI,  and  every  number  in  Rt  is  also  in  R2,  the  two  numbers  n,  n 

are  identical ;  thus  n  =  n. 

^  eW^  c 

(2)  Let  us  next  suppose  that  there  is  one  rational  number  r,  =  r2,  which 
is  contained  in  Rlt  but  not  in  jR/;  it  is  consequently  contained  in  R2'.   All 
the  numbers  in  J?/  are  less  than  r/,  and  hence  all  the  numbers  in  Rt'  are 
in  Rlf     Since  rl  is  the  only  number  in  RI  which  is  contained  in  R2,  it 
follows  that  rl  is  greater  than  all  the  other  numbers  in  R^ ;  and  thus  the 
number  n  defined  by  (Rlt  R.2)  is  a  number  corresponding  to  the  rational  number 
rl  or  r.2'.   All  the  elements  in  RI   are  contained  in  J£j ,  and  are  less  than  r/ ; 
all  the  numbers  in  R2   except  r2',  are  greater  than  r2',  for  if  not  they  would 
be  contained  in  R^.   hence  the  section  (R^,  R2)  defines  the  real  number 
ri  =  n,  corresponding  to  the  rational  number  r./^rv    The  two  sections  are 
essentially  identical,    the   only  difference   being  that  the  rational  number 
rl  =  7*.,',  is  regarded  as  belonging  to  the  first  class  in  one  section  and  to  the 
second  class  in  the  other  section. 

(3)  If  there  are  two  different  numbers  belonging  to  J^  which  also  belong 
to  R.J,  there  are  an  indefinite  number  of  other  numbers  which  have  the  same 
property,  since  an  unlimited  number  of  rational  numbers  can  be  found  which 
lie  between  two  given  rational  numbers.    In  this  case  we  define  the  number 
n  or  (Ri,  R.2),  to  be  greater,  in  the  ordinal  sense  of  the  term,  than  ri  or  (R^,  R2), 
agreeably  with  the  definition  already  given  for  the  rational  numbers. 

The  cases  in  which  one,  or  more  than  one,  number  which  belongs  to  RI 
also  belongs  to  R2,  may  be  treated  in  a  similar  manner;  thus  we  define  the 
meaning  of  the  relation  n  <  ri.  It  is  easily  seen  that  if  n  >  ri,  and  ri  >  ri', 
then  the  relation  n  >  n"  is  also  satisfied.  Thus  the  system  of  real  numbers 
is  arranged  in  a  regular  order,  such  that  those  of  them  which  correspond  to 
rational  numbers  have  the  same  relative  rank  as  the  corresponding  'rational 
numbers  have  in  the  aggregate  of  rational  numbers. 

21.    The  aggregate  of  real  numbers  has  the  following  properties  : 

(1)  If  a  >/3,  and  /3>y,  then  a>j. 

(2)  Between  any  two  real  numbers  «,  7  there  are  an  unlimited  number 
of  real  numbers.    This  is  easily  proved  from  the  corresponding  property  of 
rational  numbers,  by  considering  the  sections  which  define  the  numbers. 
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(3)  If  a  is  a  jixed  real  number,  then  all  real  numbers  may  be  divided  into 
two  classes  Rl}  J?2,  such  that  R^  contains  all  the  real  numbers  which  are  less 
than  a,  and  R.2  contains  all  the  real  numbers  which  are  greater  than  a.    The 
number  a  may  be  regarded  either  as  belonging  to  Rl}  in  which  case  it  is  the 
greatest  number  in  Rlt  or  else  as  belonging  to  R2,  in  which  case  it  is  the  least 
number  in  R2.    This  also  follows  from  the  definition  above. 

(4)  If  the  aggregate  of  real  numbers  falls  into  two  classes  Rlt  R2,  such 
that  every  number  of  Ex  is  less  than  every  number  of  R2,  then  there  exists 
one,  and  only  one,  number  by  which  this  section  is  produced. 

To  prove  this,  we  observe  that  the  section  (R1}  R.2)  of  the  aggregate  of 
real  numbers  also  defines  a  section  (R^R^)  of  the  aggregate  of  rational  numbers, 
such  that  all  rational  numbers  belonging  to  jRj  correspond  to  real  numbers 
which  belong  to  R1}  and  all  numbers  belonging  to  R2  correspond  to  real 
numbers  which  belong  to  R2. 

Let  N  be  the  real  number  defined  by  the  section  (R^,  R.^),  and  let  N'  be 
any  real  number  different  from  N,  defined  by  the  section  (RJ,  R^).  There 
are  an  indefinite  number  of  rational  numbers  n  which  belong  to  only  one  of 
the  aggregates  Rlt  R^ ;  let  n  be  the  real  number  corresponding  to  n.  If 
N'  <  N,  then  n  belongs  to  _R1}  and  therefore  n  belongs  to  R^,  and  since  N'<n, 
it  follows  that  N'  belongs  to  R^  Similarly,  if  N'  >  N,  we  can  shew  that  N' 
belongs  to  R2.  It  has  thus  been  shewn  that  every  number  different  from  N 
belongs  to  R1  or  to  R.2,  according  as  it  is  less  or  greater  than  N.  Thus  N  is 
either  the  greatest  number  in  7^  or  the  least  in  R.2>  and  therefore  N  is  the 
only  number  by  which  the  section  (jRj ,  _R2)  can  be  made. 

22.  The  operations  between  two  real  numbers  may,  in  accordance  with 
the  above  definition  of  real  numbers  by  means  of  sections,  be  so  defined  that 
the  result  of  each  operation  corresponds  to  a  section  of  the  rational  numbers ; 
thus  the  arithmetical  operations  are  reduced  to  operations  with  rational 
numbers. 

A  complete  theory  of  the  operations  involving  real  numbers  can  be 
established ;  and  the  formal  laws  of  the  operations  can  be  shewn  to  be  the 
same  as  in  the  case  of  the  rational  numbers,  the  range  of  possibility  of 
operations  being  greater  in  the  case  of  real  than  in  that  of  rational  numbers. 
This  theory  has  been  worked  out  to  some  extent  by  Dedekind :  but  as  the 
Cantor  theory  of  real  numbers  lends  itself  to  a  simpler  detailed  treatment  of 
the  operations  than  that  of  Dedekind,  and  as  it  will  appear  that  the  two 
theories  are  fundamentally  equivalent  to  one  another,  it  will  be  sufficient, 
as  an  example  of  the  general  method  of  treating  operations  in  accordance 
with  Dedekind's  theory,  to  take  only  the  case  of  the  addition  of  two  real 
numbers. 
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Let  a,  b  be  two  real  numbers  defined  by  means  of  the  sections  (Rt ,  RJ, 
(#,' ,  .&,')  respectively ;  then  the  sum  a  +  6,  of  a  and  b,  is  defined  by  means 
of  a  section  (R",  R.")  which  satisfies  the  following  conditions: — If  c,  is  any 
rational  number,  it  is  put  into  the  class  R",  provided  there  are  two  rational 
numbers  al  in  Rlt  and  6j  in  R^,  such  that  a,  +  6,  =  cl ;  all  rational  numbers  Cy 
for  which  this  is  not  the  case  fall  into  the  class  JR2".  It  is  clear  that  every 
number  cl  is  less  than  every  number  ca,  hence  the  section  (ft/',  Rz")  is  defined 
by  means  of  this  condition. 

It  can  be  shewn  that,  when  a,  b  both  correspond  to  rational  numbers,  this 
definition  is  in  agreement  with  the  ordinary  definition  of  the  sum  of  two 
rational  numbers,  so  that  the  sum  of  the  numbers  corresponds  to  the  sum  of 
the  corresponding  rational  numbers.  Every  number  Cj  in  -R/',  is  =  a  +  &> 
because  a,  ^  a,  b^  b,  and  therefore  a^  +  b^a  +  b.  Further,  if  there  were 
contained  in  R2"  a  number  ca  <  a  +  b,  so  that  a  +  b  =  ca  4-  p,  where  p  is  a  positive 
rational  number,  we  should  have  c2  =  (a  —  \p)  +  (b  —  £/)),  and  this  is  contrary 
to  the  definition  of  c2,  because  a  —  \p  belongs  to  Rlt  and  b—  \p  to  R{ ;  thus 
every  number  c.2  in  R2",  is  ^  a  +  b,  and  it  has  consequently  been  shewn  that 
(R",  R.2")  defines  the  number  a  +  b.  As  is  usual,  we  have  denoted  the  rational 
numbers  a,  b  and  the  conceptually  distinct  real  numbers  a,  b  by  the  same 
symbols.- 

THE  CANTOR  THEORY  OF  IRRATIONAL  NUMBERS. 

23.  The  Cantor  theory  of  irrational  numbers  essentially  depends  upon 
the  use  of  convergent  simply  infinite  ascending  aggregates,  or  convergent 
sequences  (Fundamentalreihen)  in  which  the  elements  are  rational  numbers ; 
we  therefore  proceed  to  define  and  discuss  these  aggregates. 

A  simply  infinite  ascending  aggregate  (alt  a,2,  «3,  ...  an,  ...),  in  which  each 
element  is  a  rational  number,  is  said  to  be  convergent,  if  it  is  such  that 
corresponding  to  any  fixed  arbitrarily  chosen  positive  rational  number  e,  as 
small,  in  the  ordinal  sense,  as  we  please,  a  number  n  can  be  found  such  that 


an  —  a, 


<e,for  m  =  1,  2,  3,  — 


The  symbol  x  \  is  here  used  to  denote  that  one  of  the  two  numbers  x,  —  x 
which  is  positive;  \x  is  said  to  be  the  absolute  value  of  #. 

This  definition  is  equivalent  to  the  statement  that,  in  a  simply  infinite 
convergent  aggregate,  an  element  can  always  be  found  whose  absolute 
difference  from  any  element  whatever  which  comes  after  it  is  as  small  as  we 
please. 

It  should  be  observed  that  the  terms  "as  small  as  we  please,"  or  "arbitrarily 
small,"  as  applied  to  a  positive  number  which  is  at  choice,  have  reference  to 
the  conception  of  order  only,  and  not  to  the  non-arithmetical  notion  of 
magnitude.  These  expressions  denote  only  that  the  number  can  be  so  chosen 
as  to  be  of  lower  rank  than  any  other  arbitrarily  chosen  positive  number. 
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To  each  value  of  e  there  corresponds  a  value  of  n,  which  will  in  general 
have  to  be  increased  when  e  is  made  smaller. 

We  may  denote  the  aggregate  by  the  symbol  {«„},  and  shall  speak  of  it 
shortly  as  a  convergent  sequence ;  that  it  is  simply  infinite  will  in  future  be 
understood. 

For  a  convergent  sequence,  corresponding  to  any  arbitrarily  chosen  positive 
number  e,  an  integer  n  can  be  found  such  that  from  and  after  that  value  of  n 
the  absolute  difference  of  any  two  elements  is  less  than  e. 

For  choose  n  so  that  an  —  an+m  <  \  e,  for  all  positive  integral  values  of  m ; 
then  I  an+m  —  an+m>  j  =  |  an  —  an+m  |  4-  j  an  —  an+m>  \  <  e. 

For  the  convergent  sequence  {an},  if  we  choose  n  such  that  |  an  —  an+m  <  e,. 
then  for  m  =  1,  2, 3, . . .,  the  value  of  an+m  for  all  values  of  m,  lies  between  an  +  e, 
and  an  —  e;  that  is  to  say,  from  and  after  some  value  of  n,  all  the  elements  lie 
between  two  rational  numbers  whose  difference  is  arbitrarily  small.  There 
exist  therefore  two  positive  numbers  a,  a',  of  which  the  smaller  a!  maybe  zero, 
such  that,  from  and  after  some  fixed  value  of  n,  all  the  elements  lie  in  absolute 
value  between  a  and  a. 

24.  If  the  aggregate  (a1}  a2,  ...  an, ...)  is  such  that,  from  and  after  some 
fixed  element,  each  element  is  not  greater  than  the  following  one,  and  if  all  the 
elements  are  less  than  some  fixed  number  N,  then  the  aggregate  is  a  convergent 
sequence. 

For  if  the  aggregate  is  not  convergent,  there  must  exist  some  positive 
number  8,  such  that  an  indefinite  number  of  increasing  values  n0,  n^,  n^,  ...  of 
n  can  be  found,  for  which  ani  —  ano  ,  |  a^  —  aMl  | ,  j  ans  —  anz  j . . .  are  all  ^  8. 
Since  ani  —  ano ,  anz  —  ani , ...  are  all  positive,  we  have  anf  =  ano  4-  r8,  where  r  can 
always  be  taken  so  large  that  ano  +  rS  >  N,  or  anr  >  N,  which  is  contrary  to  the 
hypothesis.  Hence  the  aggregate  is  convergent. 

It  may  in  a  similar  manner  be  shewn  that  the  aggregate  is  convergent  if, 
from  and  after  some  fixed  element,  each  element  is  not  less  than  the  following 
one,  and  if  all  the  elements  are  greater  than  some  fixed  number. 

An  aggregate  (aa,  az, ...  an, ...)  such  that  al  ^  a2^  a3^  ...  is  said  to  be  a 
monotone  non-diminishing  sequence.  If  all  the  elements  are  less  than  some 
fixed  number  TV,  the  sequence  is  convergent.  Similarly  if  al  =  az  =  as  =  ...  the 
aggregate  is  said  to  be  a  monotone  non-increasing  sequence.  It  is  convergent 
if  all  the  numbers  an  are  greater  than  some  fixed  number. 

//  {an},  [bn]  are  two  convergent  sequences  of  rational  numbers,  a  value  of  n 
can  be  found  corresponding  to  any  arbitrarily  assigned  number  e,  such  that  both 
i  a>n+m  —  O'n+m1 1  and  bn+m  —  bn+m>  are  less  than  e,  m  and  m-  having  all  positive 
values. 

For  we  have  only  to  choose  for  n  the  greater  of  the  two  values  correspond- 
ing to  e,  for  each  aggregate  separately. 
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25.    It  will  now  be  shewn  that  tfie  aggregates 

,  {«„-&»},  KM.          . 


in  which  the  elements  are  the  sum,  difference,  product,  and  quotient,  respectively 
of  the  corresponding  elements  of  the  two  convergent  sequences  {an},  {bn},  are  also 
convergent  sequences,  with  a  certain  restriction  in  the  last  case. 

We  have  |  (an  ±  &„)  -  (an+m  ±  bn+m)  \  £  \  an  -  an+m  \  +  \bn-  bn+m  \  ;  now  n  can 
be  so  chosen  for  a  given  e,  that  for  all  values  of  m,  \an  —  an+TO|<  £e,  and 
I  bn  -  bn+m  |  <  |  e,  hence  so  that  |  (an  ±  &„)  -  (an+m  ±  bn+m)  \  <  <•;  therefore  the 
aggregates  {an  +  bn},  {an  —  bn}  are  convergent. 

Again, 

I  anbn  ~  U>n+mbn+m    ~  \  an  (°n  —  bn+m)  +  bn+m  (an  —  an+m)  \ 

<a\bn-  bn+m  |  +  /9  1  an  -  an+m  \  , 

where  a,  /3  are  the  two  positive  numbers  which  are  such  that  |  an  \  <  a, 
|  bn+m  \  <  /3,  for  all  values  of  n  and  m. 

We  can  take  n  so  large  that  |  bn  —  bn+m  <  S,  \  an  —  an+m  <  8,  where  8  is 

at  our  choice,  and  may  be  taken  to  be      —  .    Hence,  for  this  value  of  n 

a  +  p 

|  anbn  —  an+mbn+m  \  <e,  for  every  value  of  m;  and  thus  {anbn}  has  been  shewn 
to  be  a  convergent  sequence. 


Lastly,  in  the  case  of    y,  we  shall  suppose  that  all  the  elements  of  {&„} 
(on) 

are  numerically  greater  than  some  fixed  positive  number  /3'. 
We  have  then 

#n  _  an+m  _  &n  (bn+m  ~  bn)  +  On  (an  —  an+m) 


hence 

«  I  bn  —  bn+m   +  p*  an  —  an+m 


Un        Un+m 

If  now  n  be  chosen  so  that  |  bn  —  bn+m  \,\an  —  an+m    are  both  less  than 

— — .  e,  for  every  value  of  m,  then,  for  such  a  value  of  n,  i  7-^  —  -, n+m  <e : 
a+P  :bn      bn+m 

therefore  \-.nl  is  a  convergent  sequence,  provided  \bn\  is,  for  all  values  of  n, 
( "n) 

greater  than  some  fixed  positive  number  p*',  which  may  be  as  small  as  we 
please,  but  must  not  be  zero. 

26.  The  essence  of  Cantor's  theory  consists  in  the  postulating  of  the 
existence  of  an  aggregate  of  objects  for  thought,  the  real  numbers,  ordered  in 
a  definite  manner,  which  manner  is  assigned  by  means  of  certain  prescribed 
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rules.  Any  element  of  the  aggregate  of  real  numbers  is  regarded  as  capable 
of  symbolical  representation  by  means  of  a  convergent  sequence  of  which  the 
elements  are  rational  numbers ;  and  the  mode  in  which  the  aggregate  of  real 
numbers  is  ordered  is  specified  by  means  of  formal  rules  relating  to  these 
convergent  sequences.  The  aggregate  of  real  numbers  contains  within  itself 
an  aggregate  of  objects  which  is  similar  to  the  ordered  aggregate  of  rational 
numbers  which  has  already  been  considered,  in  the  sense  that  to  each  rational 
number  there  corresponds  a  certain  real  number;  and  the  relative  order  of  any 
two  rational  numbers,  in  the  ordered  aggregate  of  rational  numbers,  is  the 
same  as  the  relative  order  of  the  two  corresponding  real  numbers  in  the  new 
aggregate  of  real  numbers.  The  rational  numbers  are  frequently  regarded  as 
identical  with  the  real  numbers  to  which  they  correspond,  and  are  denoted 
by  the  same  symbols.  In  the  development  of  Analysis,  this  identity  leads  to 
no  difficulties;  but,  in  the  fundamental  theory  of  the  aggregate  of  real  numbers, 
a  conceptual  distinction  between  rational  numbers  and  the  real  numbers  to 
which  they  correspond  must  be  made,  in  order  to  obviate  logical  difficulties, 
and  especially  with  a  view  to  coordinating  Cantor's  theory  with  that  of 
Dedekind.  Those  real  numbers  which  do  not  correspond  to  rational  numbers 
are  called  irrational  numbers ;  and  those  real  numbers  which  correspond  to 
rational  numbers  are  usually  spoken  of  as  themselves  rational  numbers. 

The  rules  by  which  the  order  of  the  real  numbers  in  their  aggregate  is 
assigned  are  the  following: 

(1)  Any  convergent  sequence  {an},  of  which  the  elements  are  rational 
numbers,  is  taken  to  represent  a  real  number,  which  we  may  denote  by  a. 
Two  such  aggregates  {an},  {bn}  are  taken  to  represent  the  same  real  number 
provided  they  satisfy  the  condition  that,  for  any  arbitrarily  chosen  positive 
rational  number  e,  a  value  of  n  can  be  found  such  that  j  an+m  —  bn+m  \  <  e,  for 
this  value  of  n,  and  for  all  values  0,  1,  2,  3,...  of  in.    Symbolically*,  we  have 
{an}  =  {bn\  under  the  condition  stated. 

(2)  The  real  number  represented  by  {an}  is  regarded  as  of  higher  rank,  or 
in  the  ordinal  sense  greater,  than  the  real  number  represented  by  {&„},  if  a 
value  of  n  can  be  found  such  that  an+m  —  bn+m  is,  for  this  value  of??,  and  for 
all  values  0,  1,  2,  3,...  of  m,  greater  than  some  fixed  positive  rational  number 
8.    If  n  can  be  so  determined  that  an+m  -  bn+m  is  negative  and  numerically 
greater  than  some  fixed  positive  rational  number  8,  for  every  value  of  m,  the 
number  represented  by  {an}  is  taken  to  be  less  than  that  represented  by  {&„}. 

The  aggregate  (x,  x,  x,...)  or  {*•},  in  which  all  the  elements  are  identical 
with  one  rational  number  x,  represents,  since  it  is  a  convergent  sequence,  a 

*  Those  who  hold  the  view,  advocated  by  Heine  and  others  (see  §  7,  note),  that  a  real  number 
is  identical  with  the  set  of  symbols  by  which  it  is  represented,  can  attach  no  direct  meaning  to  this 
equality.  It  can  only  be  taken  to  indicate  that  the  two  expressions  may  be  used  indifferently  in 
any  operation  which  involves  the  number. 
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real  number  which  corresponds  to  the  rational  number  x.  It  is  clear,  from  the 
definition  of  order  in  (2),  that  the  relative  order  of  any  two  rational  numbers, 
in  the  aggregate  of  rational  numbers,  is  the  same  as  that  of  the  real  numbers 
which  correspond  to  them,  in  the  aggregate  of  real  numbers.  The  aggregate 
of  rational  numbers,  and  that  of  the  real  numbers  which  correspond  to  them, 
are  similar  aggregates. 

Cantor's  theory  of  irrational  numbers,  in  the  form  in  which  it  was  presented 
by  himself  and  by  Heine,  has  been  criticized*  on  the  ground  that  an  assump- 
tion is  made  that  the  sequence  [x],  in  which  all  the  elements  are  the  same 
rational  number  x,  represents  the  rational  number  x  itself,  and  that  this 
amounts  to  an  assumption  that  x  is  the  limit  of  the  sequence  \x\ ;  whereas  the 
theory  of  arithmetical  limits  is  represented  by  Cantorf  as  deducible  from  his 
theory  of  irrational  numbers,  and  as  not  assumed  in  the  construction  of  the 
theory  itself.  The  theory  in  the  form  presented  above  is  not  open  to  this 
objection. 

It  can  be  shewn  that  any  two  convergent  sequences  {an},  {bn}  satisfy  one 
or  other  of  the  conditions  laid  down  in  the  above  definitions  of  equality  and 
inequality,  i.e.  symbolically  {an}  |  {bn}- 

For,  as  has  been  shewn  in  §  23,  corresponding  to  any  arbitrarily  chosen 
positive  rational  number  8,  a  value  of  n  can  be  found  such  that  an+m  lies 
between  an  +  8,  and  an  —  8,  and  such  that,  for  the  same  value  of  n,  bn+m  lies 
between  bn+  8,  and  bn—  8;  from  this  it  follows  that,  for  such  value  of  n, 
«n+m  —  bn+m  lies  between  an  -  bn  +  28  and  an  —  bn  —  28;  or  an+m  —  bn+m  differs 
from  an  —  bn  by  not  more  than  28.  If  corresponding  values  of  8  and  n  can  be 
found,  for  which  an  —  bn  +  28,  an  —  bn  —  28  have  the  same  sign,  then  an+m  —  bn+m 
has  the  same  sign  as  an  —  bn,  and  is  numerically  greater  than  a  fixed  number; 
the  condition  of  inequality  of  {«„},  {bn}  is  then  satisfied.  If  no  such  values  of 
8  and  n  can  be  found,  then  an+m  —  bn+m  is  numerically  less  than  48;  and  since 
8  is  arbitrarily  small,  the  condition  of  equality  of  {an},  {bn}  is  then  satisfied. 

Although  Cantor's  form  of  the  theory  of  irrational  numbers,  or  rather  of 
real  numbers,  is  more  convenient  for  detailed  development  than  is  Dedekind's 
form,  yet  it  lies  under  the  disadvantage  that  the  nature  of  any  single  real 
number  is  veiled  by  the  fact  that,  although  it  is  a  unique  object,  it  is  capable 
of  representation  by  an  unlimited  number  of  convergent  sequences,  and  there- 
fore that  the  formal  character  of  the  theory  does  not  make  it  clear  what  such 
a  number  really  is.  The  comparison  between  the  two  theories  which  J  will  be 
given  later  on  will  throw  light  upon  this  point:  for  it  will  be  shewn  that  a 
convergent  sequence  of  the  rational  numbers  is  sufficient  to  define  a  section, 

*  See  B.  Russell,  The  Principles  of  Mathematics,  vol.  i,  p.  285. 
t  See  Math.  Annalen,  vol.  xxi,  p.  568. 

£  In  Tannery's  work  Introduction  a  la  thdorie  den  fonctions  d'une  variable,  chap,  i,  the  theory 
of  irrationals  is  treated  by  a  combination  of  the  two  methods  of  Cantor  and  Dedekind. 
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of  the  kind  fundamental  in  Dedekind's  theory;  and  this,  as  we  have  seen,  is 
equivalent  to  the  definition  of  a  lower  segment,  which  is  itself  a  certain  definite 
class  of  rational  numbers. 

27.  The  sum  a  +  b,  of  two  real  numbers  represented  by  the  sequences  [an], 
{bn},  is  defined  to  be  the  real  number  represented  by  the  sequence  {an  +  bn};  and 
the  difference  a  —  b  is  defined  as  a  number  represented  by  {an  —  bn\. 

It  has  been  shewn  in  §  25  that  the  two  sequences  {an  +  bn],  {a.n  —  bn]  are 
convergent. 

If  {an},  {bn}  represent  the  same  number,  the  sequence  {an  —  bn]  defines  the 
real  number  zero;  for  the  condition  that  |  an+m  —  bn+m  \  <  e,  where  e  is  arbi- 
trarily small,  for  a  sufficiently  great  value  of  n,  and  for  m  =  0,  1,  2,  3, ...,  is  in 
this  case  satisfied. 

The  product  ab,  of  two  real  numbers,  is  defined  to  be  the  number  represented 
by  the  sequence  {anbn},  which  has  been  shewn  in  §  25  to  be  convergent. 

The  quotient  a/b  is  defined  to  be  the  number  represented  by  the  convergent 

faj 

sequence  \-r~\. 
(t>n) 

The  only  restriction  on  this  last  definition  is  that  b  is  not  to  be  zero ;  for, 
when  this  condition  is  satisfied,  the  elements  of  the  sequence  {&„},  which  repre- 
sents b,  can  be  so  chosen  as  to  satisfy  the  restrictive  condition  given  in  §  25, 

that  \j~\  may  be  convergent. 

(On) 

It  is  necessary  to  shew  that  the  sum  a  +  b,  the  difference  a  —  b,  the  product 
ab,  and  the  quotient  T  ,  of  two  numbers  a,  b,  as  they  have  been  defined  above, 

are  definite  numbers  independent  of  the  particular  convergent  sequences  used 
to  represent  the  numbers  a  and  6.  Thus  it  must  be  shewn  that  if  \an]  =  {an'\, 
{bn}  =  {bn'},  then 

{an  +  bn]  =  {an'  +  b^},    {an  -  bn]  =  {an'  -  bn'},    {anbn}  =  {an'bn'}, 

an]       (a* 


and  -\  j  -  r  —  -\  ,  ,(  • 

(bn)       (bn) 

We  have 

I  (an_|_m  +  On_)-m)       (ft  n+m  i  "  n+m)  \  ==  I  &n+m       a  n+m  |  ~r  |  On+m       ^  n+m  \  • 

Now  n  can  be  so  chosen,  corresponding  to  a  fixed  number  e,  that 


an+m-an+m 


bn+m  -  b'n+m  I  <^€,  for  m  =  0,  1,  2,  3,  ...; 


with  this  value  of  n,  we  now  have  the  condition 

I  (fl>n+m  i  un+m)  —  (.a  n+m  i  ^  n+m)     <  e> 

satisfied;  and  this  is  the  condition  that  [an  ±  bn}  represents  the  same  number 
as  fan'  ±  6»'}. 
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Again, 

n+m  |  ^  |  &n+m  (vn+m  ~~  0  n+m)  |  +  |  0  n+m  (#«+»«  —  ft  n+m)  | 


<  4  |  &n+m  -  b'n+m  \+B\  an+m  -  a'n+m  \, 

where  A,  B  are  fixed  positive  numbers.  It  is  now  clear  that  n  may  be  so 
chosen  that  |  an+mbn+m  —  a'n+mb'n+m  \  <  v),  where  rj  is  an  arbitrarily  chosen 
positive  number;  thus  (an6nj,  {an'bn'}  represent  the  same  number. 

Again, 


a 
V, 


n+m 


"n+m         /;  i/          \ 

jT~"     \On+m  ~  v  n+mj 


(an+m  —  &  n+m) 


i+mv  n+m 

whence  it  can  easily  be  seen  that  the  condition  is  satisfied  that 

and 


represent  the  same  number. 

It  is  readily  seen  that  the  same  commutative,  associative,  and  distributive 
laws  hold  for  the  operations  between  real  numbers  as  for  those  involving 
rational  numbers. 

28.  If,  from  and  after  some  fixed  element  an,  all  the  elements  of  \an}  are 
greater  than  some  fixed  positive  rational  number  £>,  then  the  real  number  repre- 
sented by  {an}  is  positive,  i.e.  it  is  ordinally  greater  than  zero. 

For,  if  we  take  any  convergent  sequence  {bn}  which  defines  the  number 
zero,  we  have  {an}  >  {bn} ;  because,  for  some  fixed  value  of  n,  an+m  —  bn+m  is 
certainly  positive  for  all  values  of  m,  and  is  greater  than  a  fixed  positive 
number ;  since  n  can  be  taken  so  large  that  an+m  >  B,  and  bn+m  <  B',  where  B' 
is  a  positive  rational  number  chosen  less  than  B. 

Similarly,  it  may  be  shewn  that  the  number  defined  by  {an}  is  negative,  if, 
from  and  after  some  fixed  value  of  n,  all  the  an  are  negative  and  numerically 
greater  than  some  fixed  positive  rational  number  B. 

The  term  "numerically  greater"  denotes  that  an  \  >  B  and  thus  refers  to. 
the  absolute  values  of  the  numbers  concerned. 

It  is  easily  seen  that,  unless  {an}  is  such  that,  from  and  after  some  fixed 
value  of  n,  all  the  elements  have  the  same  sign,  then  {an}  must  represent  the 
number  zero. 

if  [un],  [bn]  define  two  different  real  numbers  a,  b,  then  there  lie  between 
a,  b  an  unlimited  number  of  those  real  numbers  which  correspond  to  rational 
numbers. 

Suppose  a  >  b,  then  there  exist  a  definite  rational  positive  number  B,  and  a 
least  integer  n,  such  that,  for  all  positive  integral  values  of  m  including  zero, 
«n+»»  —  bn+m  ^  8,  \  an  —  an+m  |  <  e,  |  bn  —  bn+m  \  <  e,  where  e  is  a  rational  number 
chosen  to  be  <  %B.  If  we  take  any  rational  number  x,  which  is  <  B,  and  >  e, 

u-  3 
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the  number  [an  —  x},  in  which  all  the  elements  are  identical,  lies  between 
{«„}  and  {&„},  since,  for  every  value  of  m,  we  have  an+m  —  (an  —  x}  >  x  —  e; 
therefore  a  is  greater  than  the  real  number  which  corresponds  to  an  —  x. 
Again,  (an  —  x)  —  bn+m  =  (an  —  bn)  +  (bn  —  bn+m)  —  <K  >  &  —  e  -  #,  therefore,  pro- 
vided x  is  chosen  to  be  <  8  —  e,  the  real  number  which  corresponds  to  an  —  a;  is 
greater  than  b,  and  thus  lies  between  a  and  b.  The  rational  number  an  —  x 
may  be  chosen  in  an  unlimited  number  of  ways,  since  x  is  any  rational  number 
whatever  which  lies  between  8  —  e  and  e.  To  obtain  one  such  number  we  may 
take  e  =  ^  8,  x  =  ^  8. 

-  *i "      a 

3 
CONVERGENT  SEQUENCES  OF  REAL  NUMBERS. 

29.  Convergent  sequences  will  now  be  considered,  of  which  the  elements 
are  real  numbers.  It  might  at  first  sight  be  imagined  that  we  should  be  led, 
by  the  employment  of  such  sequences,  to  a  further  extension  of  the  domain  of 
number;  it  will  however  be  seen  that  this  is  not  the  case. 

The  definition  of  a  convergent  sequence  of  real  numbers  is  precisely  similar 
to  the  definition  which  has  been  given  in  the  case  of  sequences  of  rational 
numbers;  thus  (a1}  «2,  ...  an,  ...)  is  a  convergent  sequence  of  real  numbers, 
provided  that,  corresponding  to  each  arbitrarily  chosen  positive  real  number  77, 
a  value  of  n  can  be  found  such  that  j  an  —  an+m  |  <  17,  for  m  =  l,  2,  3,  —  If  we 
conceive  that  each  such  convergent  sequence  of  real  numbers  represents  a 
single  ideal  object,  and  if  we  give  definitions  of  equality  and  inequality,  and 
of  the  fundamental  operations,  precisely  analogous  to  those  given  in  §  26  and 
§  27,  and  assume  as  before  that  a  convergent  sequence  in  which  all  the  elements 
are  identical  with  the  real  number  a  is  taken  to  represent  that  one  of  the  new 
aggregate  of  objects  which  corresponds  to  a,  it  will  be  shewn  that  the  new 
aggregate  of  objects  is  similar  to  the  aggregate  of  real  numbers,  i.e.  to  each  of 
the  new  objects  there  corresponds  one  of  the  real  numbers,  and  also  that  the 
relation  of  order  between  corresponding  pairs  of  elements  in  the  two  aggregates 
is  the  same.  It  thus  appears  that  the  aggregate  of  new  objects  is  practically 
identical  with  the  aggregate  of  real  numbers,  since  the  two  are  ordinally 
similar.  No  such  relation  has  been  shewn  to  exist  between  the  aggregate  of 
real  numbers  and  that  of  rational  numbers  ;  and  it  will  be  shewn  later,  in  con- 
nection with  the  general  theory  of  order-types,  that  no  such  relation  of  simi- 
larity can  exist.  Therefore  the  passage  from  rational  numbers  to  real  numbers 
involves  a  real  extension  of  the  domain  of  number ;  but  the  passage  from  real 
numbers  to  an  aggregate  of  objects  represented,  in  accordance  with  the  rules 
referred  to  above,  by  convergent  sequences  of  real  numbers,  does  not  lead  to 
any  essential  extension  of  the  domain  of  number. 

Let  (an|  be  any  convergent  sequence  of  rational  numbers,  and  let  {an} 
denote  the  sequence  of  those  real  numbers  which  correspond  to  the  rational 
numbers  which  form  the  elements  of  {a,,}.  It  can  easily  be  shewn  that  {«„} 
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is  a  convergent  sequence:  for,  if  e  is  an  arbitrarily  chosen  positive  rational 
number,  and  6  the  corresponding  real  number,  the  condition  of  convergence 
of  {aw}  is  that,  for  every  e,  a  value  of  n  can  be  found  such  that  an+m  lies 
between  an  -f  e,  an  -  e,  for  m  =  1,  2,  3,  ...  .  It  follows  from  this  that  an+w  lies 
between  an  +  e,  a,,  —  e  ;  and  this  ensures  the  convergence  of  the  sequence  {an}. 
Conversely,  we  see  that  if  {an}  is  convergent,  so  also  is  {an}. 

Next,  let  {an}  be  a  convergent  sequence  of  real  numbers;  then,  between  v  , 
a,,  and  otn+i,  a  real  number  an  can  be  found  which  corresponds  to  a  rational 
number  an,  determined  as  at  the  end  of  §  28.    Conceive  this  to  be  done  for  every  «,«  r, 
pair  of  consecutive  elements  in  {«„},  and  let  us  consider  the  sequences  |an},  {an}. 

Since  an  -  an+m  =  (an  -  an)  +  («„  -  an+m)  +  (an+m  -  an+m),  we  have 


=  !  an  ~  &n    ~^~  \  ^n  ~  an+m    "+"    Bn+wi 

Now,  corresponding  to  any  real  positive  number  B,  n  may  be  so  chosen  that 
for  every  value  of  m,  \an  —  an  ,  \an  —  an+m  \  ,  \  on+m  —  an+m  |  are  each  less 
than  ^S  ;  hence,  for  such  a  value  of  n,  ~  VST  e  A/f  /  S 

|  dn  -  ttn+m  |  <  8,  for  W  =  1,  2,  3,  .  .  .  ,  Ct-OSC^ 

thus  the  sequence  {«„}  is  convergent. 

Again,       {o^}  -  {an}  =  {an  -  an],  and  !  otn  -  an  |  <    an-an+l\, 

and,  since  {«„}  is  convergent,  n  may  be  chosen  so  great  that,  for  that  and  all 
higher  values  of  n,  all  the  differences  an  —  an+1  \  are  less  than  an  arbitrarily 
fixed  number,  hence  an  —  an  \  satisfies  the  same  condition  J  and  therefore  the 
two  convergent  sequences  {an},  {dn}  satisfy  the  condition  of  equality;  or  they 
represent  the  same  one  of  the  new  objects.  It  has  been  shewn  above  that 
since  [an\  is  convergent,  so  also  is  {an}.  Now  {an}  corresponds  to  a  single  real 
number  a;  therefore,  to  any  convergent  sequence  («„},  of  which  the  elements 
are  real  numbers,  there  corresponds  a  real  number  a. 

We  have  further  to  shew  that,  if  {an},  {ftn}  are  two  convergent  sequences 
of  real  numbers,  and  a,  b  the  corresponding  real  numbers  as  just  determined, 
then  a  =  6,  according  as  {an\  =  {/3nj. 

We  know  that  a  =  b,  according  as  [an]  =  {&„},  where  {bn}  denotes  the  se- 
quence of  rational  numbers  which  defines  b,  in  the  same  way  as  [an]  defines  a. 
Now  }an}  -  {/3n}  =  K  ~  £n}>  and  an  -  @n  =  (an  -  an)  +  (bn  -  /3n)  +  (an  -  bn);  and 
we  can  choose  n  so  large  that  |  an  —  dn  \  and  |  6n  —  @n  are  each  less  than  |  rj, 
where  r)  is  an  arbitrarily  chosen  real  positive  number:  therefore  we  see  that 
«n~  ftn  lies  between  an—  l)n+  rj  and  an  —  6,1—77.  It  follows  easily  that  |OM}  =  >^n}i 
according  as  [dn\  =  \bn],  or  according  as  {an}  =  {bn},  and  hence,  as  shewn  above, 
according  as  a  =  6.  It  has  now  been  shewn  that  the  objects  which  arc  re- 
presented by  convergent  sequences  of  real  numbers  have  the  same  ordinal 
relation  to  one  another  as  the  real  numbers  to  which  those  sequences  have 
been  shewn  to  correspond. 

3—2 
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It  appears,  from  what  has  now  been  proved,  that,  to  every  convergent  se- 
quence of  real  numbers  there  corresponds  a  real  number  which  may  be  taken  to 
be  defined  by  means  of  that  sequence. 

There  does  not  necessarily  exist  any  rational  number  which  corresponds  in 
the  same  sense  to  a  convergent  sequence  of  rational  numbers.  The  property 
of  the  aggregate  of  real  numbers  here  stated  embodies  the  characteristic 
difference  between  that  aggregate  and  the  aggregate  of  rational  numbers  ;  for 
the  latter  does  not  possess  the  corresponding  property.  It  is  this  property  of 
the  aggregate  of  real  numbers  which  makes  it  suitable  to  be  the  field  of  the 
real  variable  in  the  Theory  of  Functions. 


THE  ARITHMETICAL  THEORY  OF  LIMITS. 

30.  If  a-1>  x2,  xs,  ...  xn,  ...  is  a  sequence  of  real  numbers  such  that  a  number 
x  exists  which  has  the  property  that,  corresponding  to  any  arbitrarily  chosen 
positive  number  e,  a  value  of  n  can  be  found  such  that  x  —  xn,  \x  —  xn+i\, 
I  x  —  acn+2  ,  ...  are  all  less  tJian  e,  then  the  number  a;  is  said  to  be  the  limit  of  the 
sequence  as1}  x2,  ...  xn  .....  This  fact  may  be  denoted  by  the  equation  x  =  \i 


This  definition  is  known  as  the  arithmetical  definition  of  a  limit,  and  was 
first  given*,  in  a  form  substantially  identical  with  the  above,  by  John  Wallis. 

It  will  be  observed  that  the  above  definition  contains  no  assertion  as  to 
the  necessary  existence  of  a  limit  of  a  sequence  of  numbers,  but  contains  only 
a  statement  as  to  the  relation  of  the  limit  to  the  numbers  of  the  sequence,  in 
case  that  limit  exists. 

There  cannot  be  two  numbers  which  both  satisfy  the  condition  of  being 
a  limit  of  the  same  sequence.  For,  if  possible,  let  x,  x'  be  two  such  numbers 
and  let  x  —  x  =  8.  Choose  a  value  of  e,  less  than  \  8  ;  then  numbers  n,  n  can 
be  found  such  that  x  -  xn+m  ,  j  x  —  xn>+m  \  ,  for  all  values  0,  1,  2,  3,  ...  of  m, 
are  less  than  e.  Suppose  n  >  n,  then  j  x  —  xn  \  and  j  x  —  xn  \  are  both  less  than 
e  ;  hence  x  —  x  <  2e  <  8,  which  is  contrary  to  the  condition  x  —  x  \  =  8. 

It  will  now  be  shewn  that,  if  the  numbers  of  the  sequence  \xn\  are  real 
numbers,  and  if  the  sequence  is  a  convergent  one,  then  the  real  number  x 
defined  in  the  manner  explained  in  §  2!),  by  the  sequence  [xn],  is  the  limit  of 
the  sequence. 

For  the  two  sequences  [xn],  {x}  both  define  the  same  number  x,  and 
therefore  s.itisfy  the  condition  of  equality,  which  is  that  j  x  —  xn+m  \  <  e,  for  any 
arbitrarily  chosen  e,  provided  n  be  sufficiently  great  ;  and  this  is  the  condition 
that  x  should  be  the  limit  of  the  sequence  \xn}.  A  sequence  of  real  numbers 
which  has  a  limit  must  be  convergent.  For,  if  x  is  the  limit  of  [xn],  then  for  a 

*  Arithmetica  Infinitorum  (1655),  Prop.  43,  Lemma.     See  M.  Cantor's  Geschichte  der  Mathe- 
matik,  vol.  n,  p.  823. 
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sufficiently  large  value  of  n,  \x  —  xn\,  \x  —  aSn+\\,  •••  \x  —  xn+m\,  ...  are  all  less 
than  |e,  where  e  is  arbitrarily  chosen  ;  now  \  xn  —  Xn+m  \  £  \  x  —  xn  \  +  \  se  —  xn+m  \  \ 
hence  |  xn  -  xn+m  \  <  *>  which  is  the  condition  of  convergence  of  {#„}. 

As  the  complete  result  we  have  now  the  theorem  known  as  the  General 
Principle  of  Convergence*: 

The  necessary  and  sufficient  condition  that  a  sequence  xltxz,  ...  xn,  ...  of  real 
numbers  may  have  a  limit,  is  that,  corresponding  to  every  arbitrarily  chosen 
positive  number  e,  a  value  of  n  can  be  found  such  that  xn  —  xn+l,  xn  —  Xn+z, 
xn  —  xn+3,  ...  shall  be  all  numerically  less  than  e. 

This  theorem,  which  contains  the  criterion  for  the  existence  of  a  limit,  as 
defined  in  accordance  with  the  arithmetical  definition  of  a  limit,  is  a  deduction 
from  Cantor's  theory  of  real  numbers. 

31.  If  the  numbers  of  a  sequence  [xn]  are  rational  numbers,  instead  of  real 
numbers,  the  definition  of  the  limit  is  applicable,  and  it  is  a  necessary,  but  not 
a  sufficient,  condition  for  the  existence  of  the  limit,  that  the  sequence  should 
be  convergent.   Strictly  speaking,  jf  a  convergent  sequence  of  rational  numbers 
has  a  limit,  that  limit  is  also  a  rational  number ;  but  from  the  existence  of 
convergent  sequences  of  rational  numbers  which  have  no  limit  there  arises  the 
necessity  for  the  extension  of  the  domain  of  number,  so  that  in  the  extended 
domain  every  convergent  sequence  may  have  a  limit ;  this  extension  has  been 
carried  out  by  substituting  Real  Number  for  Rational  Number.    However, 
although  a  convergent  sequence  of  rational  numbers  which  has  no  rational 
limit,  has  in  this  strict  sense  no  limit  at  all,  by  reason  of  the  convergent  sequence 
of  those  real  numbers  which  correspond  to  the  rational  numbers  having  an 
irrational  number  as  limit,  and  since,  as  has  been  seen  above,  these  real  numbers 
are  for  practical  purposes  not  distinguished  from  the  rational  numbers  to  which 
they  correspond,  it  is  usual  to  consider  this  irrational  number  to  be  the  limit 
of  the  sequence  of  rational  numbers.  We  may  thus  assert  that  any  convergent 
sequence  of  rational  numbers  which  has  not  a  rational  number  as  limit  has  an 
irrational  number  as  its  limit.    This  assertion  is  a  correct  one  for  the  practical 
purposes  of  Mathematical  Analysis. 

32.  The  method  of  limits,  which  is  essential  both  to  pure  Analysis  and  to 
the  applications  of  Analysis  in  Geometry  and  in  Kinetics,  had  a  geometrical 
origin  in  the  Method  of  Exhaustions,  which  was  applied  by  the  Greek  geometers 
to  determine  lengths,  areas,  and  volumes,  in  simple  cases.    This  method,  sup- 
plemented by  the  notion  of  the  numerically  infinite,  was  developed  in  later 
times,  in  various  forms,  into  a  general  method  which  formed  the  basis  of  the 
Infinitesimal  Calculus.    The  traditional  geometrical  conception  of  a  limit  may 
be  exemplified  by  the  case  of  the  determination  of  the  length  of  a  curve  as  the 

*  This  term  "das  allgemeine  Cqnvergenzprinzip  "  is  due  to  P.  Du  Bois-Reymond  ;   see  his 
Allyemeine  Functionenthebrie. 
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limit  of  a  sequence  of  properly  chosen  inscribed  polygons.  The  lengths  of  the 
perimeters  of  the  polygons  are  regarded  as  continually  approaching  the  re- 
quired length  of  the  curve,  whilst  the  number  of  sides  of  the  polygons  is  con- 
tinually increased.  The  limit,  the  length  of  the  curve,  is  then  regarded  as 
actually  reached  at  the  end  of  a  process  described  as  making  the  number  of 
sides  of  the  polygon  infinite ;  this  mode  of  attainment  of  the  limit  being  however 
inaccessible  to  the  sensuous  imagination,  and  disguising  an  actual  qualitative 
change  of  a  geometrical  figure,  which  possesses  corners  and  is  bounded  by 
segments  of  straight  lines,  into  one  which  has  no  corners  and  has  a  curvilinear 
boundary.  No  doubt  was  felt  as  to  the  existence  of  the  limit,  which  was  re- 
garded as  obvious  from  geometrical  intuition.  That  a  curve  possesses  a  length, 
or  an  area,  was  considered  to  require  no  proof.  The  first  mathematician  who 
recognized  the  necessity  for  a  proof  of  the  existence  of  a  limit  was  Cauchy, 
who  gave  a  proof  of  the  existence  of  the  integral  of  a  continuous  function. 
That  the  logical  basis  of  the  traditional  method  of  limits  is  defective  has  in 
recent  times  received  a  posteriori  confirmation  by  the  exhibition  of  continuous 
functions  which  possess  no  differential  coefficient,  and  by  many  other  cases  of 
exception  to  what  were  regarded  as  ordinary  results  of  analysis  resting  on  the 
method  of  limits,  which  have  been  brought  to  light  by  those  mathematicians 
who  have  been  engaged  in  examining  the  foundations  of  analysis. 

The  arithmetical  theory  of  limits,  which  is  summed  up  in  the  general 
principle  of  Convergence,  provides  a  definite  criterion  for  the  existence  of  the 
limit  of  a  sequence  of  numbers ;  and  a  considerable  part  of  modern  analysis 
is  concerned  with  obtaining  special  forms  of  the  general  criterion  adapted  for 
use  in  special  classes  of  cases.  The  theory  is  essentially  dependent  upon  the 
theory  of  irrational  numbers ;  for,  in  default  of  an  arithmetical  theory  of  ir- 
rational numbers,  all  attempts  to  prove*  the  existence  of  a  limit  of  a  convergent 
sequence  are  doomed  to  inevitable  failure;  and  this  for  the  simple  reason  that 
a  convergent  sequence  of  rational  numbers  does  not  necessarily  possess  a  limit 
which  is  within  the  domain  of  such  numbers.  The  definition  of  real  numbers 
by  means  of  convergent  sequences  of  rational  numbers  is  not  a  mere  postulation 
of  the  existence  of  limits  to  such  sequences;  it  involves  rather  the  introduction 
of  an  enlarged  conception  of  number,  of  such  a  character  that  the  scheme  of 
ordered  real  numbers  should  form  a  consistent  whole,  and  such  that  every 
convergent  sequence  of  numbers  in  the  domain  of  real  number  necessarily  has 
a  limit  within  that  domain.  The  postulation  of  the  existence  of  the  aggregate 
of  real  numbers  is  justified  by  shewing  that  a  complete  scheme  of  definitions 
and  postulates  can  be  set  up  for  the  elements  of  this  aggregate,  and  that  such 
a  scheme  does  not  lead  to  contradictionf.  As  regards  the  existence  of  limits 

*  An  interesting  discussion  of  various  methods  which  have  been  suggested  of  proving  the 
existence  of  a  limit  will  be  found  in  Du  Bois-Reymond's  Allgemcine  Functionentheorie. 

t  On  this  mode  of  regarding  the  aggregate  of  real  numbers  as  dependent  upon  a  complete 
consistent  scheme  of  definitions  and  axioms,  see  Hilbert,  "  Ueber  den  Zahlbegriff,"  Jalirexber.  d. 
deutsch.  math.  Vereinigung,  vol.  vin  (1900). 
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in  the  case  of  lengths,  areas,  volumes,  &c.,  referred  to  above,  the  order  of  pro- 
cedure is  a  reversal  of  the  traditional  one,  the  existence  of  the  limit  being  no 
longer  inferred  from  geometrical  intuition.  For  example,  in  the  case  of  the 
determination  of  the  length  of  a  curve,  that  length  is  not  assumed  to  be 
independently  known  to  exist,  but  is  defined  as  the  arithmetical  limit  of  the 
sequence  of  numbers  which  represent  the  perimeters  of  a  suitable  sequence 
of  inscribed  polygons.  When  this  sequence  is  convergent,  and  its  limit  is 
independent  of  the  particular  choice  of  the  polygons,  subject  to  suitable  re- 
strictions, then  the  limit  so  obtained  determines  the  length  of  the  curve.  In 
case  no  such  limit  exists,  the  curve  is  regarded  as  not  having  a  length. 

EQUIVALENCE  OF  THE  DEFINITIONS  OF  DEDEKIND  AND  CANTOR. 

33.  In  order  to  establish  the  equivalence  of  the  definitions  of  irrational 
numbers,  as  given  by  Dedekind  and  by  Cantor,  it  must  be  shewn  that  every 
convergent  sequence  of  rational  numbers  defines  uniquely  a  section  of  all 
the  rational  numbers,  and  that  this  section  is  the  same  for  all  convergent 
sequences  which  represent  the  same  real  number  in  accordance  with  rule  (1) 
in  §  26.  Conversely,  it  must  be  shewn  that  any  number  defined  by  a  section 
can  also  be  represented  by  a  convergent  sequence  of  rational  numbers. 

To  shew  that,  corresponding  to  the  convergent  sequence  [xn]  which,  in 
accordance  with  the  Cantor  theory,  defines  the  real  number  x,  a  section  can 
be  found : — Let  r  be  any  rational  number,  and  let  r  be  the  corresponding  real 
number  represented  by  {r}.  The  number  x  —  r  is  represented  by  [xn  —  r}; 
and  if  this  number  is  not  zero,  then  (see  §  28),  from  and  after  some  fixed  value 
of  n,  xn  —  r  has  a  fixed  sign,  positive  or  negative  according  to  the  value  of  r. 
A  section  of  the  rational  numbers  may  now  be  defined  as  follows : — Let  every 
number  r  such  that  xn  —  r  is  negative,  from  and  after  some  fixed  value  of  n, 
be  placed  in  the  class  R2 ;  and  let  every  number  for  which  xn  —  r  is  positive, 
from  and  after  some  fixed  value  of  n,  be  placed  in  the  class  Rj  .  If  there  exists 
a  rational  number  r,  such  that  neither  of  these  cases  arises,  then  x=r,  and 
r  may  be  put  into  either  of  the  classes  R1}  R.2.  It  has  thus  been  shewn  that 
a  section  of  the  rational  numbers  can  be  determined,  corresponding  to  the 
convergent  sequence  {xn}. 

Next,  let  \Xn\  be  any  other  convergent  sequence  which  represents  the 
same  real  number  x,  as  {xn}  does.  We  have  to  shew  that  the  section  of  the 
rational  numbers  which  corresponds  to  \xn'}  is  identical  with  that  which 
corresponds  to  {.rn}.  If,  as  before,  r  denote  any  rational  number,  we  have 
{xn  —  r\  =  {asn—r}.  Now  a  value  of  n  can  be  found,  from  and  after  which, 
xn  —  r  and  xn'  —  r  both  have  fixed  signs  independent  of  n,  and  they  must  have 
the  same  sign.  It  follows  that  a  number  r  which  belongs  to  the  class  Rlt 
must  also  belong  to  the  class  R^',  by  which  the  section  corresponding  to  •&•„'} 
is  defined ;  and  also  a  number  r  which  belongs  to  the  class  R.2,  necessarily 
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belongs  to  R.2',  except  in  the  case  {xn}  =  {#„'}  =  r.  It  has  thus  been  shewn 
that  the  section  (R^  R.2)  which  corresponds  to  [xn\  is  identical  with  the 
section  (Ri,  R2)  which  corresponds  to  [xn'}. 

34.  To  shew  that  a  convergent  sequence  can  always  be  found  such  as  to 
define  the  number  corresponding  to  a  given  section  (Rlt  R2),  we  observe  that 
two  rational  numbers  can  always  be  found,  one  of  which  is  in  Rlt  and  the 
other  in  R2,  and  such  that  their  difference  is  numerically  less  than  a  given 
arbitrarily  small  rational  number  e.  Let  A  be  any  rational  number  in  R1}  and 
let  e'  be  a  rational  number  <  e.  Then  of  the  numbers  A  +  e',A+  2e',. .  .A  +re't,.. . 
there  must  be  a  last  one  A  +  re'  which  falls  in  J?1}  for  A  +  ne  may  be  made  as 
large  as  we  please  by  taking  n  large  enough  ;  the  next  number  A  +  (r  +  l)e'  is 
then  in  R2 ',  and  these  numbers  A  +  re,  A  +  (r  +  1 ) e',  whose  difference  is  e' < e, 
are  the  two  numbers  required.  Moreover,  if  B  is  a  rational  number  in  R2, 
the  two  numbers  may  be  so  determined  that  both  lie  between  A  and  B;  for  we 

need  only  take  e'  to  be  of  the  form  -(B  —  A),  where  s  is  a  positive  integer  so 

s 

chosen  that  -  (B  —  A}  <  e. 
s 

Now  let  {en}  be  any  convergent  aggregate  of  rational  numbers,  which  has 
zero  for  its  limit.  Determine  a\  in  R^  and  x2  in  R2,  so  that  x2  —  xl  <  ea;  next  take 
x3  in  R1}  and  wt  in  R.2,  so  that  \x.A  —  #4|  <  e2;  and  that  x3,  x4  both  lie  between 
X-L  and  xz.  Proceeding  in  this  way,  we  can  determine  #2«-i>  xm  rational  numbers 
of  different  classes,  so  that  |  #2n-i  —  x2n  <  en ;  then  either  of  the  sequences 
[#j,  a's,  #6,..,},  {&g,  #4,...}  defines  the  number  which  is  represented  by  the 
section  (R1}  R2). 

To  prove  this,  we  observe  that  {a?2n-il  ig  a  convergent  sequence,  since  all 
the  elements  are  <  x2.  and  xl  <  x.,  <  x5 . . .. 

Again,  suppose  a  is  a  rational  number  belonging  to  R2,  we  can  shew  that, 
provided  a  rational  number  6  exists  in  R2  which  is  less  than  a,  then  a  is 
greater  than  all  the  numbers  sc1,  x.3, ...  by  more  than  a  —  b.  For 

a  —  #2?i-i  =  («  —  b)  +  (b  —  tfow-i)  >  ct  —  b. 

however  small  b  —  #2n-i  may  become.  Hence,  unless  a  is  the  smallest  rational 
number  in  R2,  the  real  number  [a]  which  corresponds  to  a  is  greater  than 
the  number  (xly  xs, ...). 

Again,  the  sequences  {.r2«-i},  {#2n}  represent  the  same  number,  since  their 
difference  is  the  aggregate  \en\  which  defines  zero.  It  now  appears,  by 
reasoning  similar  to  the  above,  that  any  number  a  in  Rl  is  such  that  the  real 
number  {a}  is  less  than  the  number  {x2n},  unless  a  is  the  greatest  rational 
number  in  Rt. 

If  either  Rl  has  a  greatest  rational  number,  or  R2  has  a  least  one,  the 
real  number  {a}  which  corresponds  to  this  rational  number  a,  is  itself  defined 
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by  (/?,,  R.j),  and  is  the  number  represented  by  either  of  the  sequences 

{a^n}.    In  any  case,  either  of  these  two  sequences  defines  the  number  given 

by  the  section  (R1}  JKy). 

The  complete  equivalence  of  the  two  theories  of  Dedekind  and  of  Cantor 
has  now  been  established.  The  first  theory  operates  with  the  whole  aggregate 
of  rational  numbers,  the  second  with  sequences  selected  out  of  that  aggregate. 


THE  NON-EXISTENCE  OF  INFINITESIMALS. 

35.  It  should  be  remarked  that,  in  assuming  that  every  section  of  the 
aggregate  of  real  numbers  defines  a  single  real  number,  it  has  been  implicitly 
assumed  that  if  a,  b  are  any  two  positive  real  numbers,  such  that  a<b,.then 
a  positive  integer  n  can  be  found  such  that  na  >b.  ^  * 

This  is  the  arithmetical  analogue  of  the  so-called  principle  of  Ai'oliiiiiuiiLo. 

If  any  real  numbers  existed  which  are  ordinally  greater  than  all  the  numbers 
a,  2a,  3«, . . . ,  then  a  section  of  the  aggregate  of  real  numbers  would  be  defined 
by  considering  all  numbers  greater  than  all  the  numbers  a,  2a,  3a,  ...  to  be  in 
one  class,  and  all  the  remaining  real  numbers  to  be  in  the  other  class ;  and 
this  section  would  define  a  real  number  N.  If  now  e  be  an  arbitrarily  chosen 
positive  number  less  than  a,  then  N  —  e  is  a  number  whiph  is  less  than  some 
of  the  numbers  a,  2a,  3a, . . . ;  and  there  must  be  a  first  of  this  set  of  numbers 
such  that  N  —  e  is  less  than  it.  Let  this  be  pa ;  thus  N  —  e  <  pa,  hence 
N< pa  +  e<  (p  +  l)a;  which  is  contrary  to  the  hypothesis  that  no  number 
na  is  in  the  class  of  numbers  which  are  >  N. 

The  property  of  the  aggregate  of  real  numbers  which  has  been  established 
may  be  denoted  by  the  statement  that  the  aggregate  of  real  numbers  forms 
an  Archimedean  system]  and  this  property  of  the  aggregate  is  essentially 
equivalent  to  the  property  that  every  section  of  the  aggregate  defines  a  single 
number  of  the  aggregate. 

A  consequence  of  the  fact  that  the  aggregate  of  real  numbers  forms  an 
Archimedean  system  is  that  so-called  infinitesimal  numbers  do  not  exist  within 
the  aggregate.  Every  positive  number  e,  being  such  that  an  integer  n  can  be 
found  such  that  ne>l,  is  a  finite  number,  in  the  sense  in  which  finite  numbers 
were  distinguished  from  infinitesimals  in  the  older  forms  of  the  Infinitesimal 
Calculus.  In  Arithmetical  Analysis  the  conception  of  the  actually  infinitesimal 
has  no  place.  When  the  expression  "infinitesimal"  is  used  at  all,  it  is  to  describe 
the  process  by  which  a  variable  to  which  the  numbers  of  a  sequence  converging 
to  zero  are  successively  ascribed,  as  values,  approaches  the  limit  zero ;  thus  an 
infinitesimal  is  a  variable  in  a  state  of  flux,  never  a  number.  Such  a  form  of 
expression,  appealing  as  it  does  to  a  mode  of  thinking  which  is  essentially 
non-arithmetical,  is  better  avoided. 
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THE  THEORY  OF  INDICES. 

36.  When  m  is  a  positive  integer,  and  x  a  rational  number,  xm  was  denned 
to  denote  x  x  x  x  x . . .  x  x  (m  factors) ;  and  this  definition  may  be  extended 
to  the  case  in  which  x  is  any  number  defined  by  a  convergent  sequence ;  so 
that  if  x  is  defined  by  {#„},  xm  is  defined  by  {xnm}.  It  thus  appears  that 
for  any  real  numbers  x,  we  have,  provided  m  and  n  are  positive  integers, 

Xm  X  Xn  =  /rm+n 

If  we  assume  x°  and  x~m  to  be  defined  as  having  such  a  meaning  that  this 
law  of  indices  holds  when  m  or  n  is  zero,  or  a  negative  integer,  we  can  at 
once  interpret  x°  and  x~m\  for 

of  x  xn  =  xn+0  =  xn,  thus  x°  =  I, 
and  x~n  x  xn  =  x°  =  1,  thus  arn  =  - 


.7;' 


When  p/q  is  a  rational  fraction,  we  shall  define  xq  to  have  such  a  meaning 
that  the  above  law  of  indices  holds  when  either  or  both  of  m,  n  may  be 
rational  fractions.  With  this  assumption 

p_        p        '!L  P. 

xq  x  xq  x  xq  . . .  x  xq  (q  factors)  =  x'p ; 

p  P 

hence  (xq)q  =  x?',  or  xq  is,  if  it  exists,  a  number  whose  qth  power  is  xp.    The 

problem  of  determining,  if  possible,  a  number  xq,  is  that  of  finding  a  number 
whose  <?th  power  is  a  given  number ;  and  it  has  been  already  shewn  that  this 
is  not  always  a  possible  operation  within  the  domain  of  rational  number. 

It  will  now  be  shewn  that,  in  the  domain  of  real  numbers,  the  operation 

of  finding  xq  is  always  a  possible  one  when  x  is  positive;  and  also  when  x  is 
negative,  provided  however  that,  in  this  latter  case,  q  is  an  odd  number,  or  if  it 
is  even,  p  is  not  odd. 

The  following  lemma  will  be  required : — If  a.  is  any  real  positive  number 
less  than  unity,  a  positive  integer  m  can  be  found  such  that  am  <  e,  where  e  is  an 
arbitrarily  prescribed  positive  number,  or  in  other  words,  lim  an  =  0. 

M~  00 

Since  an  >  an+l,  the  sequence  (a,  or, ...  an, ...)  is  convergent. 

Suppose,  if  possible,  that  the  sequence  represents  a  positive  number  A' 
different  from  zero ;  then  m  may  be  so  chosen  that  am,  a'"^1, ...  all  differ  from  A-  by 
less  than  the  arbitrarily  prescribed  number  8,  say  or"  =  k  +  n,  where  77  <  8.  We 
have  therefore  am+1  =  (k  +  77)  a.  <  (k  +  8)  a. ',  now  8  can  be  chosen  to  be  equal  to 

,  then  o"l+1  <  k  —  8 :  and  this  is  contrary  to  the  condition  imposed  in 
1  +  a 

the  choice  of  m. 
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It  follows  that  k  cannot  be  different  from  zero  ;  and  thus  the  lemma  is 
established. 

Suppose  now  that  a  is  any  positive  number,  rational  or  not,  which  lies 
between  N?  and  (N  +  1)9,  where  N  is  a  positive  integer;  we  shall  first  shew 
that  a  number  N  +  h,  where  h  <  1,  can  always  be  found  such  that  a  — 
is  positive,  and  less  than  a  —  NI.  We  find  by  division 


hence,  if  h  is  positive  and  less  than  unity,  (N  +  ft)q  —  N<*  lies  between 

qhN*-1  and  qh(N+l)r-\ 
Since  a  -  (N  +  h}9  =  (a  -  M)  -  {(N  +  h)q  -  N«], 

a  _  ffq  • 

we  must  take  h  not  greater  than  —^  —     _t  ,  in  order  that  a  —  (N  +  h)?  may 
certainly  be  positive  ;  and  the  difference  a  —  (N  +  h)v  is  then  less  than 


then  a  -  (N  +  A)«  <  (a  -  N*)    l  -        - 

Let  N!  =  N  +  h>  then  JV",  is  such  that 


and  ^i  >  N. 

In  a  similar  manner,  we  can  shew  that  a  number  iY2  exists  which  is  >  Nlt 
and  such  that 


Proceeding  in  this  manner,  we  obtain  a  series  of  numbers  JV,  Nl}  , 
Nr, ...  such  that  Nr  >  Nr-lt  and  that  a  —  Nrv  is  positive,  and  less  than 

(a -#,_,«)  Tl- 

We  shall  now  shew  that  (N,  Nlf  N2, ...  Nr, ...)  is  a  convergent  sequence 
which  defines  a  number  whose  qih  power  is  a. 

The  sequence  {Nr}  is  convergent,  since  Nr  >Nr_l,  and  every  ATr  is  less  than 
N  +  1.  The  #th  power  of  the  number  defined  by  this  convergent  sequence  is 
{AV}»  and  we  shall  shew  that  this  defines  the  number  a  or  {a}. 

We  have 

^  -  GvTl 
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N  N 

for 

and  hence  1  - 


/     JV"     \9~x. 

Now  1  —  (  -jr:  —  j  is  a  proper  fraction,  hence  from  the  lemma  proved  above 
we  infer  that  a  power  r  of  the  expression  can  be  found  which  is  less  than  an 
arbitrarily  chosen  positive  number  ;  which  number  we  may  take  to  be  --  ^-  . 

Hence,  corresponding  to  every  e,  a  number  r  can  be  found  such  that 
a  —  N9r+s  <  e,  for  s=  0,  1,  2,  ...,  and  therefore  the  sequence  {Nrv\  defines  the 
number  {a}  or  a. 

If  a  is  a  positive  proper  fraction,  we  have  (a2)?  <  a  ;  hence  we  may  take  N 
to  be  equal  to  a2,  instead  of  to  a  positive  integer.  Then  a  <  (N+l)i  ;  thus  this 
value  of  N  will  play  the  same  part  as  the  integral  value  in  the  above  proof, 
and  the  reasoning  is  the  same  as  before. 

37.  It  has  now  been  shewn  that  in  every  case  a  real  number  can  be  found 

P 
of  which  the  qi,h  power  is  a  given  positive  number  a.    It  thus  appears  that  xq 

has  an  interpretation  within  the  domain  of  real  numbers,  when  x  is  any  positive 

t) 
number,  und  -  is  a  positive  rational  fraction. 

2 

_p 
We  interpret  x  t  to  be  such  that 

_?        P 

X    2   X 
_P 


or  x  (/ 

If  #  is  a  negative  number  —  x',  we  have  (—  x')q,  defined  as  a  number  whose 
qih.  power  is  (—  x'}p  ;  and  (—  x')p  is  x'v  or  —x'v,  according  as  p  is  even 
or  odd. 

p  P 

If  p  is  even,  (—  a/)9  can  be  interpreted  as  the  value  of  x'i.    If  p  is  odd, 

P  p 

and  q  is  odd,  (-x')i  may  be  interpreted  as  —x'q.     When  p  is  odd,  and  q  is 

P 
even,  we  have  obtained  no  interpretation  of  (—  x')^. 

2H-1 

To  complete  the  theory  of  indices  in  such  a  way  that  (—  x)  -s  may  have 
an  interpretation,  we  should  require  a  further  extension  of  the  conception  of 
number.  This  further  extension  takes  place  by  the  introduction  of  complex 
number,  which  is  however  outside  the  limits  imposed  upon  this  work  as  a 
treatise  dealing  only  with  real  number. 
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38.    The  only  case  in  which  xn,  for  a  positive  x,  has  mt  been  defined,  is 
that  in  which  n  is  ••*  a  rational  number.     To  extend  the  definition  to  thif» 
case",  we  suppose  n  to  be  defined  by  a  convergent  sequence  {nr},  in  which  all  the 
numbers  nr  are  rational.  We  shall  shew  that  the  aggregate  {*>nr}  is  convergent, 
and  the  number  which  it  defines  we  shall  denote  by  xn. 

We  have  x"*  —  xnr+t  =  acnr  jl  —  #nr+i-»rj  ;  now,  since  \nr]  is  a  convergent 
aggregate,  all  the  numbers  nr  are  numerically  less  than  some  fixed  number, 
and  therefore  \xn+\  <  A,  where  A  is  some  fixed  number. 

First  suppose  x>\,  then 

#«r  _  xnr+t  =  xnr+»  (#"»•-  nr+«  —  1)  =  #">•  (1  —  xnr+*~nr), 

hence  x^  -#"«•+•  |  <  A  |#lfl»—  nr+«i  —  1  1. 

Now  let  r  be  so  chosen  that,  for  all  values  of  s, 

1 
|  nf  -  nr+t  |  <  -  , 

where  q  is  a  positive  integer  ;  then 

-  x-l 

x\nr-nri.i\_l  <  xq_  I  <_     __  -  __  —  —  j, 

~~ 


X  —  1 

hence  «l«r^«r+«i  —  1|<- 

2 

or—  1 

or  !  xnr  —  #nH-8  I  <  A  -     —  , 

? 

and  if  q  is  chosen  so  thai  -  <  -r-,  ,  where  e  is  a  fixed  number,  we  see 

q     A  (x  —  1) 

that  r  may  be  so  chosen  that  |  #"r  —  xnr+»  j  <  e,  for  all  values  of  s  ;  therefore 

{x1^}  is  a  convergent  sequence. 

i  \  * 

If  x  <  1,  then  j  —  [•  is  a  convergent  sequence,  and  therefore  {x'lr}  is  also 

convergent,  since  it  is  the  quotient  of  {1}  and  [x"*}.    If  x  =  1,  then  [xnr]  =  1. 
Thus  in  every  case  {xUr\  is  a  convergent  sequence  if  [nr]  is  convergent. 

Since  {x1*]  x  {xmr}  =  ja;n»i+T"^},  we  see  that  the  definition  of  xn,  when  n  is 
not  rational,  is  such  that  the  relation  xm  xxn  =  xm+n  is  satisfied. 


THE  REPRESENTATION  OF  REAL  NUMBERS. 

39.  The  ordinary  mode  of  representation  of  a  real  number  is  by  means  of 
a  decimal,  or  more  generally  by  a  radix -fraction.  When  the  decimal  is  non- 
terrninating,  this  mode  of  representation  is  a  case  of  the  representation  by  a 
convergent  sequence  of  rational  numbers,  in  accordance  with  Cantor's  theory. 
For  example,  the  number  TT  is  represented  by  the  sequence 
(3,  3-1,  3-141,  3-1415,  3'14159,  3-141592, ...), 
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where,  by  known  processes,  any  prescribed  element  can  be  found  as  the  result 
of  a  definite  number  of  arithmetical  operations. 

The  general  theorem,  will  be  established  that  every  positive  non-rational 
real  number  N  is  uniquely  representable  by  means  of  a  non -terminating  series 
of  radix-fractions,  of  which  r,  the  radix,  is  any  integer  ^  2. 

Of  the  numbers  0,  r,  2r,  3r,  . . . ,  there  is  (see  §  35),  of  all  those  which  are 
less  than  rN,  a  greatest  one  c0r,  which  may  be  zero  ;  thus 

rN  >  c0r,  and  <  (c0  +  1)  r ; 

jy 

it  follows  that  N  =  cfl  H , 

r 

where  Nt  is  a  positive  number  less  than  r. 
In  a  similar  manner  we  obtain 


where  N2,  Ns,  ...  Nn+i  are  all  <  r ;  therefore 

N~  =  r  4-  —  4-  -2  4-         -4-  —  4-       n+1 

where   c0,  Cj,   c2,    ...   cn   are   each   of  them    positive   integral,   or   zero,   and 
0  <  Nn+l  <  r. 

Since  JV 


and  it  has  been  shewn  that  —  has  the  limit  zero  as  n  is  indefinitely  increased, 
we  see  that  the  sequence,  of  which  the  nth  element  is 


is  convergent,  and  represents  the  real  number  N.     This  is  expressed  by 

HT  ^1  £"2  Gti 

N  =  c0  +  -+-2  +  .  ..  +  ->  +  ..., 

^»        .y*&  r^n, 

in  which  N  is  represented  by  a  non-terminating  radix-fraction. 

Let  us  now  consider  the  case  in  which  N  is  a  rational  number  -r  ,  in  its 

b 

lowest  term.  We  have  a  =  a0b  +  @0,  where  yS0  <  b  ;  and  r/30  =  ^b  +  filt  where 
&<&;  r^  =  ct2&  +  Ai,  ...,r^n_i  =  «,,6  -f  /3n,  where  /8T>  &,  ...  @n  are  all  less 
than  6. 

If  one  of  the  numbers  ft,  say  /3n,  is  zero,  we  have 

,,     a  oil      cto  an 

N  =  r  =a0  +  -1  +  ~  +  ...  +  ^; 
b  r      r*  rn 
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and  thus  N  is  expressed  in  terminating  radix-fractions ;  this  case  can  only 
arise  when  6  contains  only  prime  factors  of  r.  The  terminating  series  of 
radix-fractions  can  be  replaced  by  a  periodic  one  which  does  not  terminate. 
For  if  we  use  ow  —  1  instead  of  an,  as  the  numerator  of  rn,  we  have 

»•£„_!  =  (an  -  1)  b  +  b  ; 
thus  @n  becomes  6  instead  of  zero,  and 

rb  =  (r  -  1)6  +  6; 

thus  fin,  fin+\, ...  are  all  equal  to  6;  and  an+1,  On+z, ...  are  all  equal  to  r  —  1. 
Thus  N  is  represented  by 

r  —  1 


It  thus  appears  that  a  rational  number,  which  in  its  lowest  terms  has  a 
denominator  which  contains  only  prime  factors  of  r,  is  capable  of  a  double 
representation;  (1)  by  a  terminating  series  of  radix-fractions  ;  (2)  by  a  non- 
terminating  series  of  radix-fractions,  of  which  the  numerators  after  some  fixed 
one  are  all  r  —  I. 

In  case  none  of  the  numbers  filt  /32,  ...  fin,  ...  vanishes,  it  is  clear  that 
since  all  these  numbers  are  either  1,  2,  3,  ...  b  —  1,  they  cannot  be  all  unequal. 
Suppose  fin  is  the  first  which  is  repeated,  and  let  fin  =  fin+m  ;  it  is  then  clear 
that  fin+i  =  fin+m+i,  fin+t  =  fin+m+2,  •  •  •  ',  and  therefore  the  number  is  repre- 
sented by  a  recurring  series  of  radix-fractions. 

40.  When  a  number  is  defined  by  means  of  a  convergent  sequence  of  some 
special  form,  it  is  in  general  not  immediately  obvious  whether  the  number  is 
rational  or  irrational.  Many  special  investigations  relating  to  particular  cases, 
and  various  general  criteria,  have  been  given  by  well-known  mathematicians. 

One  of  the  most  important  modes  of  such  representation  of  a  number  is 
that  by  an  endless  continued  fraction.  This  fraction  may  be  regarded  as  an 
aggregate,  each  element  of  which  is  a  finite  continued  fraction.  Legendre 
established  the  fundamental  theorem  that  a  number  represented  by  an  end- 
less continued  fraction 

«]     a2     a3          an 
bi  ±  &2  ±  63  ±  '"  b^±  " 
that  is,  by  an  aggregate  of  which  the  nth  element  is 


a 


bi  ±  b2±  63  ±  '  "  bu  ' 

is  irrational*,  provided  the  positive  integers  an,  bn  are  such  that  for  every 
value  of  n,  bn  —  an  ^  1  ;  except  that  when  bn  —  an  =  1,  for  every  value  of  n  ^  m, 

*  A  proof  of  this  theorem  is  given  by  Pringsheim,  "Ueber  die  Convergenz  unendlicher  Ketten- 
briiche,"  Sitzungsberichte  d.  layer.  Akud.  vol.  xxvn,  1897,  p.  318. 
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where  m  is  some  fixed  number,  and  when  at  the  same  time  the  signs  before 
all  the  fractions  -^ ,  for  n  >  m,  are  negative,  then   the   continued   fraction 

t>n 

converges  to  unity,  or  to  a  rational  fraction,  according  as  m=  1,  or  ra>l. 

This  theorem  contains  as  special  cases  the  theorems  previously  established 
by  Lambert,  that  e?,  tan  x,  logg  x,  tan"1  x,  TT  are  irrational  for  rational  values 
of  x.  The  irrationality  of  e  and  e2  was  first  proved  by  Euler*.  Legendref 
himself  applied  the  general  theorem  to  prove  the  irrationality  of  7r2,  although 
his  proof  was  lacking  in  rigour. 

The  following  general  theorem  has  been  proved J  by  Cantor: 

If  b,  b',  b",  ...  is  a  set  of  positive  integers  such  that,  q  being  any 
arbitrarily  chosen  integer,  all  the  numbers  1,  b,  bb',  bb'b",  . ..,  from  and  after 
some  fixed  number  of  the  sequence,  are  divisible  by  q;  then  any  number  N 
can  be  uniquely  represented  by 

j-        \          /J,  V 

+  b+bb'  +  bW'  +  "• 

where  /  is  an  integer,  and  X,  //.,  v, ...  are  integers  (including  0)  such  that 


Further,  in  order  that  the  number  N  may  be  rational,  it  is  necessary  that, 
from  and  after  some  fixed  term  of  the  series,  all  the  numbers  A,,  p,  v,  ... 
have  their  highest  possible  values.  If  this  condition  is  not  satisfied,  N  is 
irrational. 

As  an  example  of  this  theorem,  the  number  e  represented  by 

1    —     -L 

99    39    *}    A 

—  *J    .   O  A  •    O   •     *? 

is  seen  to  be  irrational. 

Another  mode  of  representation  is  that  in  which  the  sequence  of  integers 
b,  b',  b",  ...,  from  a  particular  element  onwards,  is  periodic.  In  this  case,  the 
necessary  and  sufficient  condition  that  the  number  represented  by 

l)+lb'  +  bb'b"  +'" 

should  be  rational,  is  that  the  sequence  (3,  ft',  8",  ...  be,  from  and  after  some 
fixed  number  of  the  sequence,  periodic.  This  is  a  generalization  of  the 
theorem  relating  to  a  number  represented  by  radix-fractions. 

*  On  the  history  of  these  theorems  see  Pringsheim's  article  "Ueber  die  ersten  Beweise  der 
Irratiorialitat  von  e  und  IT,"  Sitzungsberichte  d.  bayer.  Akad.  vol.  xxvn. 

t  See  his  Elements  de  Geometric,  Note  4;  see  also  Kudio's  work,  "Archimedes,  Huygens, 
Lambert,  Legendre,"  1892,  p.  166. 

J  Schlomilch's  Zeitschrift,  vol.  xiv  (1869),  "  Ueber  die  einfachen  Zahlensysterne. " 
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If  6  =  2,  b'  =  3,  b"  =  4,  ...  we  obtain  the  theorem*  that  the  number  re- 
presented by 

c»    i   c»   i   c»  i        i  Cn  _L 

where  cn  ^  n  —  1,  is  rational,  only  if,  from  and  after  some  particular  value  of  n, 

—  «  1 

A  mode  of  representation  of  numbers  by  sequence  of  products  has  been 
given f  by  Cantor.  He  shews  that  every  number  N>1,  can  be  uniquely 
represented  in  the  form 

fl  +  i)(l  +  J)(l+i)(l  +  J).... 

V        tt/V        6/V        cj\        d) 
where  a,  b,  c,...  are  integers  such  that 

N              Na 
The  number  a  is  determined  as  the  integral  part  of  -^ — r  .    If =  B, 

J\  —  1  a .  +  1 

D  DJ. 

6  is  the  integral  part  of    JT — - '   if  ^ — -  =  (7,    c   is   the    integral    part   of 

C 

^ — r- ;  and  so  on. 
0  —~  1 

As  an  example,  V2  is  represented  by 


6658577  " 
where       17  =  2.32-1,     577  =  2.17--!,     665857  =  2.577s -1,.... 

The  criterion  for  determining  whether  N  is  rational  or  irrational  is  the 
following : 

The  number  represented  by 


where  b  ^  a-,     c  ^  62, . . . , 

all  the  numbers  a,  b,  c, ...  being  positive  integers,  is  rational  if,  from  and  after 
some  fixed  number  of  the  sequence  a,b,c,...,  each  number  is  the  square  of 
the  preceding  number  of  the  sequence ;  but  the  number  is  irrational  it' 
this  condition  is  not  satisfied. 

*  See  Stephanos,  Bulletin  de  la  soc.  math,  dc  France,  vol.  vn  (1879).  For  further  information 
on  the  history  of  this  subject  see  Pringsheim's  article  I.  A.  3,  in  the  Encydopiidie  der  Math. 
Wissenschaften. 

t  Schlomilclrs  Zeitxclirift,  vol.  xiv  (1869),  "  Ueber  zwei  Siitze...." 
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THE   CONTINUUM   OF   REAL   NUMBERS. 

41.  If  alt  &i  are  any  two  real  numbers  such  that  al<  61}  then  two  real 
numbers  a2,  62,  (a2  <  62),  can  be  found  both  lying  between  ax,  61?  and  such  that 
the  difference  between  a2,  62  is  as  small  as  we  please,  i.e.  62  —  a2<  e,  where  e  is 
an  arbitrarily  prescribed  number.  Between  a2,  62,  two  more  numbers  a3,  63, 
(a3  <  63),  can  be  found  whose  difference  is  again  as  small  as  we  please;  and 
this  process  may  be  carried  on  indefinitely.  This  property  of  the  aggregate 
of  real  numbers  may  be  expressed,  to  use  the  term  introduced  by  G.  Cantor, 
by  saying  that  the  aggregate  of  real  numbers  is  connex;  it  arises  from  the 
fact  that  an  indefinite  series  of  numbers  can  be  found  which  lie  between  any 
two  given  numbers.  If  we  anticipate  a  term  which  will  be  introduced  when 
we  come  to  the  general  theory  of  aggregates,  the  property  of  connexity  may 
be  expressed  by  saying  that  the  aggregate  of  real  numbers  is  everywhere  dense. 

It  will  further  be  observed  that  the  aggregate  of  rational  numbers  is  also 
connex,  or  everywhere  dense;  so  that,  so  far  as  this  property  is  concerned, 
there  is  nothing  to  differentiate  the  one  aggregate  from  the  other. 

If  the  difference  of  an  and  bn  is  denoted  by  en,  and  the  sequence 
61,  e2,  ...  en,  ...  satisfies  the  condition  that,  corresponding  to  any  fixed 
arbitrarily  small  positive  number  77,  a  value  of  n  can  be  found  such  that 
e»>  €n+i,  •  •  •  are  all  less  than  rj,  then  there  exists  a  single  real  number  x  which  is 
greater  than  all  the  numbers  a1}  a.2,  ...,  and  less  than  all  the  numbers  61;  62, 
This  number  x  is  the  limit  of  either  of  the  sequences  (alt  a2,  ...  an,  ...)  and 
(&i,  b2,  •••  bn,  ...),  and  is  defined  by  a  section  of  all  the  real  numbers. 

If  we  confine  ourselves  to  the  domain  of  rational  numbers,  there  subsists 
in  that  domain  no  such  property;  that  is,  the  above  numbers  a,  b  being  all 
rational,  no  such  rational  number  as  x  necessarily  exists. 

In  the  domain  of  Real  Number,  (a)  every  convergent  sequence  has  a  limit 
which  is  a  number  belonging  to  the  domain,  and  (6)  every  number  is  the  limit 
of  properly  chosen  sequences  of  numbers  belonging  to  the  domain.  The 
possession  by  the  domain  of  real  numbers  of  these  properties  (a)  and  (b)  is 
expressed  by  saying  that  the  aggregate  of  real  numbers  is  perfect. 

The  domain  of  Rational  Number  possesses  the  property  (b)  but  not  the 
property  («);  consequently  the  aggregate  of  rational  numbers  is  not  perfect. 

From  the  point  of  view  of  Dedekind's  theory,  the  property  that  the 
aggregate  of  real  numbers  is  perfect  expresses  the  fact  that  every  section  of 
the  real  numbers  corresponds  to  a  single  real  number,  and  the  converse. 
A  section  of  the  rational  numbers  does  not  always  correspond  to  a  rational 
number;  consequently  the  aggregate  of  rational  numbers  is  not  perfect. 

We  here  give  the  name  continuum*  to  an  aggregate  which  possesses  the 
two  properties  of  being  connex,  and  of  being  perfect.     This  is  in  the  first 
*  See  Cantor,  Math.  Annalen,  vol.  xxi,  p.  576. 
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instance  taken  to  be  the  definition  of  the  meaning  of  the  word  continuum,  as 
it  is  frequently  used  in  Analysis.  Thus  the  aggregate  of  real  numbers  forms 
a  continuum;  whereas  the  aggregate  of  rational  numbers  is  essentially  discrete, 
and  does  not  form  a  continuum,  since  one  of  the  two  essential  properties  of  a 
continuum  is  absent. 

The  aggregate  of  real  numbers  is  spoken  of  as  the  continuum  of  real 
numbers,  or  the  arithmetic  continuum. 

The  real  numbers  which  lie  between  two  numbers  a,  b  do  not  form  a 
continuum  in  accordance  with  the  above  definition;  but  if  the  two  numbers 
a,  b  themselves  are  considered  to  be  included  in  the  total  aggregate,  then  this 
completed  aggregate  does  form  a  continuum. 

It  should  be  remarked  that,  in  accordance  with  a  somewhat  different 
definition  of  the  term  continuum,  employed  by  Weierstrass,  and  which  will 
be  referred  to  in  Chapter  II,  the  real  numbers  between  a  and  b  form  a 
continuum. 

All  the  real  numbers  x  such  that  a  £  x  £  b,  in  the  ordinal  sense  of  the 
symbols  <,=,>,  are  said  to  form  an  interval  (a,  b);  and  such  an  interval 
is  frequently  described  as  a  closed  interval. 

The  real  numbers  x  which  are  such  that  a  <  x<  b,  are  frequently  said  to 
form  an  open  interval  (a,  b). 

The  closed  interval  (a,  b}  is  a  continuum,  since  it  satisfies  the  two  necessary 
conditions  for  the  applicability  of  the  term;  but  the  open  interval  (a,  b)  is  not 
a  continuum  in  this  sense  of  the  term,  as  it  contains  convergent  sequences 
which  have  no  limit  belonging  to  the  open  interval.  Such  an  open  interval 
has  been  termed  by  Cantor  a  semi-continuum,  but,  in  accordance  with  the 
observation  made  above,  it  may  be  termed  a  Weierstrassian  continuum. 

Of  the  two  essential  properties  of  the  arithmetic  continuum,  that  of 
connexity,  and  that  denoted  by  the  term  perfect,  the  latter  is  absolutely 
indispensable,  in  order  that  the  arithmetic  continuum  may  be  suitable  to  be 
the  field  of  operations  in  analysis.  It  will  appear,  when  we  come  to  the  con- 
sideration of  the  theory  of  functions  of  a  real  variable,  that  many  of  the  most 
important  properties  of  a  function  may  still  subsist  even  if  the  domain  of  the 
variable  lacks  the  property  of  connexity;  but  that  such  properties  would  not 
belong  to  functions  of  a  variable  which  is  defined  for  a  domain  such  that 
convergent  sequences  of  numbers  in  it  possess  no  limit  within  that  domain, 
and  which  therefore  lacks  the  property  of  being  perfect.  This  is  the  more 
remarkable  on  account  of  the  fact  that,  in  the  older  traditional  notion  of  a 
continuum,  the  property  of  connexity  was  the  one  which  was  regarded  as  all 
important;  the  more  essential  property  of  being  perfect  has  only  been  ex- 
plicitly formulated  in  the  course  of  the  construction  of  the  modern  arithmetical 
theory. 

4—2 
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42.  The  term  arithmetic  continuum  is  used  to  denote  the  aggregate  of 
real  numbers,  because  it  is  held  that  the  system  of  numbers  of  this  aggregate 
is  adequate  for  the  complete  analytical  representation  of  what  is  known  as 
continuous  magnitude.  The  theory  of  the  arithmetic  continuum  has  been 
criticized  on  the  ground  that  it  is  an  attempt  to  find  the  continuous  within 
the  domain  of  number,  whereas  number  is  essentially  discrete.  Such  an 
objection  presupposes  the  existence  of  some  independent  conception  of  the 
continuum,  with  which  that  of  the  aggregate  of  real  numbers  can  be  com- 
pared. At  the  time  when  the  theory  of  the  arithmetic  continuum  was 
developed,  the  only  conception  of  the  continuum  which  was  extant  was  that 
of  the  continuum  as  given  by  intuition;  but  this,  as  we  shall  shew,  is  too 
vague  a  conception  to  be  fitted  for  an  object  of  exact  mathematical  thought, 
until  its  character  as  a  pure  intuitional  datum  has  been  clarified  by  exact 
definitions  and  axioms.  The  discussions  connected  with  arithmetization  have 
led  to  the  construction  of  abstract  theories*  of  measurable  quantity;  and 
these  all  involve  the  use  of  some  system  of  arithmetic,  as  providing  the 
necessary  language  for  the  description  of  the  relations  of  magnitudes  and 
quantities.  It  would  thus  appear  to  be  highly  probable  that,  whatever 
abstract  conception  of  the  intuitional  continuum  of  quantity  and  magnitude 
may  be  developed,  a  parallel  conception  of  the  arithmetic  continuum,  though 
not  necessarily  identical  with  the  one  which  we  have  discussed,  will  be 
required.  To  any  such  scheme  of  numbers,  the  same  objection  might  be 
raised  as  has  been  referred  to  above;  but  if  the  objection  were  a  valid  one, 
the  complete  representation  of  continuous  magnitudes  by  numbers  would, 
under  any  theory  of  such  magnitudes,  be  impossible.  It  is  clear  that  it 
is  only  in  connection  with  an  exact  abstract  theory  of  magnitude,  that  any 
question  as  to  the  adequacy  of  the  continuum  of  real  numbers  for  the 
measurement  of  magnitudes  can  arise.  For  actual  measurement  of  physical, 
or  of  spatial, -or  temporal  magnitudes,  the  rational  numbers  are  sufficient; 
such  measurement  being  essentially  of  an  approximate  character  only,  the 
degree  of  error  depending  upon  the  accuracy  of  the  instruments  employed. 

The  purely  ordinal  nature  of  the  conception  of  the  arithmetic  continuum, 
including  the  ordinal  character  of  an  interval,  has  been  pointed  out  in  the 
course  of  the  development  of  the  theory.  This  will  be  further  elucidated 
in  connection  with  the  abstract  theory  of  order-types,  to  be  discussed  in 
Chapter  iv. 

*  See  O.  H.  Holder,  Die  Axiome  der  Quantitat  und  die  Lchre  roin  Mans,  Leipziger  Berichte, 
vol.  un  (1901) ;  also  Veronese's  work,  Fondamenti  di  Geometria,  1891 ;  and  Bettazzi's  work,  Teoria 
delle  grandezze,  1890. 
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THE  CONTINUUM   GIVEN   BY   INTUITION. 

43.  Before  the  development  of  analysis  was  made  to  rest  upon  a  purely 
arithmetical  basis,  it  was  usually  considered  that  the  field  of  operations  was 
the  continuum  given  by  our  intuition  of  extensive  magnitude,  especially  of 
spatial  or  temporal  magnitude,  and  of  the  motion  of  bodies  through  space. 

The  intuitive  idea  of  continuous  motion  implies  that,  in  order  that  a  body 
may  pass  from  one  position  A  to  another  position  B,  it  must  pass  through 
every  intermediate  position  in  its  path.  An  attempt  to  answer  the  question, 
what  is  meant  by  every  intermediate  position,  reveals  the  essential  difficulties 
of  this  conception,  and  gives  rise  to  a  demand  for  an  exact  theoretical  treat- 
ment of  continuous  magnitude. 

The  implication  contained  in  the  idea  of  continuous  motion  shews  that, 
between  A  and  B,  other  positions  A't  B'  exist,  which  the  body  must  occupy 
at  definite  times;  that  between  A',  B',  other  such  positions  exist,  and  SQ  on. 
The  intuitive  notion  of  the  continuum,  and  that  of  continuous  motion,  negate 
the  idea  that  such  a  process  of  subdivision  can  be  conceived  of  as  having  a 
definite  termination.  The  view  is  prevalent  that  the  intuitional  notions  of 
continuity  and  of  continuous  motion  are  fundamental  and  sui  generis;  and 
that  they  are  incapable  of  being  exhaustively  described  by  a  scheme  of 
specification  of  positions.  Nevertheless,  the  aspect  of  the  continuum  as  a  field 
of  possible  positions  is  the  one  which  is  accessible  to  Arithmetic  Analysis,  and 
with-  which  alone  Mathematical  Analysis  is  directly  concerned.  That  property 
of  the  intuitional  continuum,  which  may  be  described  as  unlimited  divisibility, 
is  the  only  one  that  is  immediately  available  for  use  in  Mathematical  thought; 
and  this  property  is  not  sufficient  for  the  purposes  in  view,  until  it  has  been 
supplemented  by  a  system  of  axioms  and  definitions  which  shall  suffice  to 
provide  a  complete  and  exact  description  of  the  possible  positions  of  points 
and  other  geometrical  objects  which  can  be  determined  in  space.  Such  a 
scheme  constitutes  an  abstract  theory  of  spatial  magnitude. 

The  exact  theory  of  magnitude  was  developed  to  a  considerable  extent 
by  Euclid;  but  not  until  recently,  under  the  influence  of  the  ideas  of  the 
arithmetical  theory,  has  it  been  perfected  in  a  form  which  exhibits  the  exact 
system  of  axioms  and  definitions  necessary  for  a  characterisation  of  continuity, 
that  is  adequate  for  mathematical  analysis.  Besides  the  arithmetic  theory  of 
number,  there  exists  at  the  present  time  a  theory  of  magnitude  which  runs 
to  a  certain  extent  parallel  with  the  former  theory.  Some  mathematicians* 
still  prefer  to  regard  number  as  primarily  representing  the  ratio  of  two 
magnitudes;  but  they  nevertheless  to  a  large  extent  employ  the  methods  of 
arithmetical  analysis. 

*  P.  Du  Bois  Beymond  in  his  Allgemeine  Functiotientheorie  strongly  advocates  the  view  tha 
linear  magnitude  forms  the  basis  of   the  conception  of   Number.      See  also  Stolz,  Allgemeine 
Arithmetik,  where  both  views  of  Number  are  developed.    See  also  G.  Ascoli,  Rend.  Int.  Lomb.  (2) 
28  (1895). 
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THE  STRAIGHT  LINE  AS  A  CONTINUUM. 

44.  Although  it  is  no  part  of  the  plan  of  the  present  work  to  enter  fully 
into  the  general  theory  of  Magnitude,  it  is  necessary  briefly  to  consider  the 
case  of  those  magnitudes  which  are  segments  of  a  straight  line ;  that  straight 
line  which  is  the  ideal  object  of  geometry,  and  which  is  the  ideal  counterpart 
of  the  physical  straight  line  of  perception. 

The  length  of  the  segment  between  two  points  A,  B,  of  a  straight  line, 
is  a  particular  case  of  a  magnitude;  and  we  shall  take  this  conception  as  a 
datum,  subject  to  a  set  of  axioms*  relating  to  the  notions  of  congruency,  and 
to  the  notions  greater  and  less  as  applied  to  magnitudes. 

We  assume  that  any  number  of  congruent  segments  OA,  AB,  EG,  ...  can 
be  constructed  on  the  straight  line;  and  that  any  segment  OA  can  be  divided 
into  any  number  of  segments  which  are  all  congruent  to  one  another. 

Any  segment  OA  may  be  taken  as  the  unit  of  length,  so  that  its 
magnitude  is  represented  by  the  number  1;  its  multiples  OB,  00,  ...  are 

denoted  by  the  numbers  2,3, If  each  one  of  the  segments  OA,  AB,  BC, ... 

be  divided  into  the  same  number  q  of  equal  parts,  then,  if  P  is  a  point  of 
division,  OP  is  denoted  by  a  fractional  number  p/q,  where  p  is  the  number  of 
the  sub-segments  in  OP.  Thus  when  p,  q  are  any  positive  integral  numbers, 
p/q  represents  a  definite  magnitude  OP,  the  unit  magnitude  OA  having  been 
fixed  upon  beforehand. 

Further,  the  number  p/q  may  also  be  regarded  as  representing  the  position 
of  the  point  P  itself.  In  order  to  represent  points  of  the  straight  line  on  both 
sides  of  0,  the  convention  is  made,  that  points  on  one  side  of  0  shall  be  re- 
presented by  positive  numbers,  and  those  on  the  other  side  by  negative  numbers; 
thus  if  P  is  on  the  right  of  0,  and  P'  on  the  left  of  0,  and  if  OP  =  OP',  the 
point  P'  is  represented  by  the  number  —p/q.  The  length  of  any  segment  of 
the  straight  line,  whose  ends  are  points  to  which  rational  numbers  have  been 
assigned  in  the  manner  explained  above,  is  the  difference  of  the  above  two 
numbers.  In  this  manner,  we  have  a  correspondence  established  between  the 
aggregate  of  rational  numbers  and  an  aggregate  of  points  on  the  straight  line, 
the  relation  of  order  being  conserved  in  the  correspondence,  so  that  the  two 
aggregates  are  similar. 

The  set  of  points,  thus  represented  by  rational  numbers,  we  may  speak  of 
as  the  rational  points  of  the  straight  line ;  but  it  must  be  remembered  that  a 
definite  origin  0,  and  a  definite  unit  of  length  OA,  are  supposed  to  have  been 
fixed  upon  beforehand ;  and  if  these  be  altered,  the  set  of  rational  points  will 
in  general  be  altered  also. 

*  These  axioms  are  discussed  by  0.  Holder,  Leipziger  Berichte,  vol.  LIII,  1901. 
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It  has  been  assumed  as  an  axiom  that,  if  P^,  is  any  segment  of  the  straight 
line,  it  may  be  divided  into  any  number  w,  of  equal  parts  :  of  these,  if  PyQ2  be 
taken  as  one,  the  same  axiom  asserts  that  P2Q2  may  be  similarly  divided  into 
any  number,  n.,,  of  equal  parts,  P3Q3  being  one  of  the  parts ;  and  that  this 
process  may  be  repeated  an  unlimited  number  of  times.  The  axiom  is  equivalent 
to  an  assumption  that  the  straight  line  is  capable  of  unlimited  divisibility ; 
and  this,  being  a  characteristic  property  of  the  intuitional  linear  continuum, 
must  also  hold  for  its  ideal  counterpart,  the  straight  line  which  we  are  here 
considering. 

We  proceed  to  assume  as  another  axiom  that,  P,Qi,  P2Q2,  PsQs,  •  •  •  being  the 
segments  constructed  as  above,  there  exists  in  the  straight  line  one  point  X, 
and  one  only,  which  separates  all  the  points  Pl}  P2,  P3,  ...  from  all  the  points 
Qi,  Q*>  Qs,  —  If  y  be  any  point  other  than  X,  then  points  belonging  to  the 
sequence  P,,  P2,  P3, ...  and  points  belonging  to  the  sequence  Qlt  Q.,,  Qs,  ...  can 
be  found  which  are  both  on  the  same  side  of  Y. 

The  point  X  may  be  regarded  as  the  limit  of  either  sequence  of  points ; 
and  the  property  corresponds  to  that  property  of  the  arithmetic  continuum 
which  is  expressed  by  saying  that  it  is  perfect. 

In  accordance  with  this  axiom  there  is  one  single  point  on  the  straight  line 
which  corresponds  to  any  given  real  number ;  and  this  point,  or  the  magnitude 
of  the  corresponding  segment,  may  be  represented  by  the  real  number. 

This  axiom  has  been  stated,  by  Dedekind,  in  a  form  corresponding  to  his 
definition  of  an  irrational  number :  that  a  section  of  the  rational  points,  in 
which  they  are  divided  into  two  classes,  is  made  by  a  single  point. 

Another  form  of  the  axiom  is  that  known  as  the  Axiom  of  Archimedes* : — 
that  if  AB,  A'B'  are  any  two  segments  of  the  straight  line,  of  which  AB  is 
the  smaller  one,  an  integer  n  can  always  be  found  such  that  n.AB>A'B'. 
As  in  the  case  of  the  arithmetic  continuum,  this  is  equivalent  to  the  negation 
of  the  existence  of  infinitesimal  segments  of  the  straight  line. 

This  axiom  being  assumed,  there  is  a  complete  correspondence  between 
the  points  of  the  straight  line  and  the  aggregate  of  real  numbers.  Thus  the 
nature  of  the  linear  continuum,  that  is,  so  far  as  its  possible  parts,  and  the 
possible  positions  in  it,  are  concerned,  is  completely  represented  and  described 
by  means  of  the  arithmetic  continuum,  the  axioms  relating  to  the  straight  line 
having  been  so  chosen  that  this  may  be  the  case.  It  will  be  observed  that 
there  is  no  real  disparity  between  the  rational  points  and  the  irrational  points 
of  the  straight  line ;  a  point,  which  with  one  origin  and  one  unit  of  length,  is 
a  rational  point,  may  be  an  irrational  point  if  another  origin,  or  another  unit 
of  length,  be  chosen. 

*  The  importance  of  the  Axiom  of  Archimedes  in  this  connection  was  pointed  out  and. discussed 
by  Stolz,  Math.  Annalen,  vols.  xxn  and  xxxix. 
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45.  The  mode  which  has  been  adopted  above,  of  establishing  a  complete 
correspondence  between  the  aggregate  of  real  numbers  and  the  aggregate  of 
points  in  a  straight  line,  though  the  most  convenient  mode,  is  not  the  only 
possible  one.  All  that  is  really  necessary  for  the  correspondence  is  that,  in 
accordance  with  some  systematic  scheme,  the  points  in  the  straight  line  shall 
be  made  to  correspond  with  the  numbers  of  the  arithmetic  continuum,  in  such 
a  way  that  the  relation  of  order  is  conserved  in  the  correspondence.  It  is  not 
necessary  that  the  difference  of  two  numbers  should  represent  the  length  of 
the  segment  of  the  straight  line  which  is  terminated  by  the  points  that 
correspond  to  the  two  numbers.  The  mode  of  correspondence  given  above  is 
however  the  simplest  one,  and  will  therefore  be  adopted  for  the  purpose  of 
enabling  us  to  use  the  language  of  geometry  in  analytical  discussion. 

In  the  case  of  space  of  two  or  of  three  dimensions,  it  will  be  assumed  as 
axiomatic  that  one  point  of  the  space,  and  one  only,  corresponds  to  each  pair 
or  triplet  of  real  numbers  which  represent  Cartesian  coordinates.  This 
axiom  may  be  considered  as  fundamental  in  the  Cartesian  system  of  analytical 
geometry. 

The  disputable  idea  that  the  theory  here  explained  necessarily  implies  that 
a  continuum  is  to  be  regarded  as  made  up  of  points,  which  are  elements  not 
possessing  magnitude,  has  frequently  been  a  stumbling-block  in  the  way  of 
the  acceptance  of  the  view  of  the  spatial  continuum  which  has  been  indicated 
above.  It  has  been  held  that,  if  space  is  to  be  regarded  as  made  up  of  elements, 
these  elements  must  themselves  possess  spatial  character ;  and  this  view  has 
given  rise  to  various  theories  of  infinitesimals  or  of  indivisibles,  as  components 
of  spatial  magnitude.  The  most  modern  and  complete  theory  of  this  kind  has 
been  developed  by  Veronese*,  and  is  based  upon  a  denial  of  the  principle  of 
Archimedes  which  has  been  already  referred  to.  In  Veronese's  system,  when 
a  unit  segment  of  a  straight  line  has  been  chosen,  there  exist  segments  which 
are  too  large,  and  others  that  are  too  small,  to  be  capable  of  representation  by 
finite  numbers  ;  and  these  segments  are  respectively  infinite,  and  infinitesimal, 
relatively  to  the  unit  segment  chosen.  Under  this  scheme,  a  section  of  the 
rational  points,  or  a  section  of  the  points  represented  by  real  numbers,  is  made, 
not  by  a  single  point,  but  by  an  infinitesimal  segment.  Veronese  has  con- ' 
sequently  introduced  systems  of  infinite  and  of  infinitesimal  numbers,  each 
of  an  unlimited  number  of  orders,  for  the  measurement  of  segments  which, 
relatively  to  a  given  scale,  are  infinite  or  infinitesimal.  From  his  point  of  view, 
the  points  on  a  straight  line  which  represent  the  real  numbers  form  only  a 
relative  continuum,  i.e.  one  which  is  relative  to  the  particular  scale  of  measure- 
ment employed;  and  he  contemplates  the  conception  of  an  absolute  continuum, 
for  the  representation  of  which  his  series  of  sets  of  infinite  and  infinitesimal 

*  See  his  Fondamenti  di  Gcometria,  Pisa,  1894 ;   a  German  translation  by  Schepp  has  been 
published  in  Leipzig. 
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numbers  are  requisite.    A  segment,  which  in  a  given  scale  is  finite,  may  be 
infinitesimal,  or  infinite  of  any  order,  when  measured  relatively  to  another  scale. 

The  validity  of  Veronese's  system  has  been  criticized  by  Cantor  and  others, 
on  the  ground  that  the  definitions  contained  in  it,  relating  to  equality  and 
inequality,  lead  to  contradiction ;  it  is  however  unnecessary  for  our  purpose 
to  enter  into  the  controversy  on  this  point.  The  straight  line  of  geometry  is 
an  ideal  object  of  which  any  properties  whatever  may  be  postulated,  provided 
that  they  satisfy  the  conditions,  (1)  that  they  form  a  valid  scheme,  i.e.  one 
which  does  not  lead  to  contradiction,  and  (2)  that  the  object  defined  is  such 
that  it  is  not  in  contradiction  with  empirical  straightness  and  linearity. 
There  is  no  a  priori  objection  to  the  existence  of  two  or  more  such  adequate 
conceptual  systems,  each  self-consistent,  even  if  they  be  incompatible  with  one 
another;  but  of  such  rival  schemes  the  simplest  will  naturally  be  chosen  for 
actual  use.  Assuming  then  the  possibility  of  setting  up  a  valid  non-Archimedean 
system  for  the  straight  line,  still  the  simpler  system,  in  which  the  principle  of 
Archimedes  is  assumed,  is  to  be  preferred,  because  it  gives  a  simpler  conception 
of  the  nature  of  the  straight  line,  and  is  adequate  for  the  purposes  for  which 
it  was  devised.  The  case  of  the  non-Euclidean  systems  of  geometry  is  an 
instance  of  the  existence  of  valid  geometrical  schemes  divergent  from  one 
another,  which  nevertheless  all  afford  a  sufficient  representation  of  physical 
space -percepts. 

An  answer  to  the  difficult  question,  in  what  sense  the  straight  line,  or  a 
space  of  two  or  of  three  dimensions,  admits  of  being  regarded  as  an  aggregate 
of  points,  can  only  be  discussed  after  a  full  treatment  of  the  nature  and 
properties  of  infinite  aggregates  has  been  developed.  The  discussions  in 
Chapters  n,  ill  and  iv,  of  infinite  aggregates,  and  especially  of  the  notion 
of  the  power  or  cardinal  number  of  such  an  aggregate,  will  throw  light  upon 
this  subject. 
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46.  AN  aggregate  of  real  numbers,  each  element  of  which  consists  of  a 
single  real  number,  is  defined  by  any  prescribed  set  of  rules  or  specifications 
which  are  of  such  a  nature  that,  when  any  real  number  whatever  is  arbitrarily 
assigned,  they  theoretically  suffice  to  determine  whether  such  real  number 
does  or  does  not  belong  to  the  aggregate.  The  difficulty  of  regarding  an 
aggregate,  so  defined,  as  a  definite  object,  is  bound  up  with  the  difficulties 
connected  with  the  notion  of  the  linear  continuum,  i.e.  the  aggregate  of  all 
real  numbers,  out  of  which  the  defined  aggregate  is  to  be  obtained  by  a  pro- 
cess of  selection  which,  except  in  the  case  of  a  finite  aggregate,  can  never  be 
actually  carried  out  in  its  entirety,  but  which  is  determined  by  a  rule  or  set 
of  rules.  The  precise  scope  of  the  definition  will  be  rendered  clearer  by  the 
consideration  of  various  classes  of  actually  defined  aggregates  which  will  be 
considered  in  the  present  Chapter;  moreover,  the  theoretical  difficulties  of  the 
notion  of  such  an  aggregate,  in  general,  will  be  in  some  measure  elucidated 
by  the  discussions  in  the  present  and  the  following  Chapters,  of  the  notion  of 
the  power,  or  cardinal  number,  of  an  aggregate. 

In  accordance  with  the  principle  explained  in  §  44,  each  number  of  a  given 
aggregate  may  be  represented  by  a  single  point  on  a  fixed  straight  line;  thus 
to  an  aggregate  of  numbers  there  corresponds  an  aggregate  of  points  on  the 
straight  line.  An  aggregate  of  single  numbers,  or  of  their  equivalent  points, 
we  shall  speak  of  as  a  linear  set  of  points. 

The  theory  of  linear  sets  of  points,  of  which  the  present  Chapter  contains 
an  account,  arose  historically  from  the  discussion  of  questions  connected  with 
the  theory  of  Fourier's  series  and  of  the  functions  which  can  be  represented 
by  such  series.  A  consideration  of  the  properties  and  peculiarities  of  the 
sets  of  points  at  which  infinities  or  other  discontinuities  of  such  functions 
exist,  led  to  a  study  of  the  properties  of  linear  sets  in  general,  and  to  the 
development  by  G.  Cantor,  P.  Du  Bois  Reymoncl,  Bendixson,  Harnack,  and 
others,  of  a  general  theory  which  has  lately  received  wide  applications  both 
in  Analysis  and  in  Geometry. 

Corresponding  to  the  theory  of  linear  sets  of  points,  there  exist  theories 
of  plane,  solid,  or  p-dimensional  sets  of  points.  Each  element  of  an  aggregate 
in  ^-dimensions  consists  of  an  association  of  p  real  numbers  (xw,  .T('->,  ...xip}), 
and  such  an  element  is  spoken  of  as  a  point  in  jt?-dimensional  space.  The 
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p-dimensional  continuum  consists  of  the  aggregate  of  all  such  points,  when 
each  of  the  numbers  a?'1',  #(>), ...  x^  may  have  any  value  in  the  linear  arith- 
metic continuum. 

A  considerable  part  of  the  theory  of  sets  of  points  in  two  or  more  dimensions 
is  completely  parallel  to  the  theory  of  linear  sets  of  points;  the  proofs  of  many 
theorems  for  linear  sets  of  points  being  applicable  without  essential  modifica- 
tion to  the  case  of  sets  in  any  number  of  dimensions.  This  remark  applies 
both  to  the  descriptive  properties  of  sets  considered  in  the  present  Chapter 
and  to  the  metric  properties  which  will  be  discussed  in  Chapter  III.  At  a 
certain  stage  of  the  descriptive  theory,  however,  the  properties  of  linear  sets 
of  points  exhibit  a  simplicity  which  the  properties  of  sets  in  two  or  more 
dimensions  do  not  possess.  The  greater  complexity  in  the  case  of  two  or 
more  dimensions,  than  in  one  dimension,  corresponds  to  the  greater  com- 
plexity of  the  Analysis  Situs  for  space  of  two  or  more  dimensions  than  in  the 
case  of  linear  space. 

In  the  present  Chapter  an  account  will  be  given  both  of  those  descriptive 
properties  which  are  practically  independent  of  the  number  of  dimensions  of 
the  sets,  and  of  those  properties  in  which  essential  divergence  exists  between 
the  case  of  linear  sets  and  that  of  sets  in  two  or  more  dimensions. 

The  whole  theory  of  sets  of  points  is  essentially  an  arithmetical  theory ; 
the  geometrical  nomenclature  and  representation  is  a  matter  of  convenience, 
not  of  necessity. 

THE  UPPER  AND  LOWER  BOUNDARIES  OF  A  LINEAR  SET  OF  POINTS. 

47.  A  simple  case  of  a  linear  set  of  points  is  that  in  which  the  set  consists 
of  all  the  points  of  a  linear  interval  (a,  6)  either  closed  or  open,  in  accordance 
with  the  definition  of  such  an  interval  given  in  §  41. 

Thus  the  set  of  points  which  form  a  closed  interval  (a,  b)  consists  of  all 
points  x  such  that  a  ^  x  ^  b ;  and  the  set  of  points  of  an  open  interval  (a,  b) 
consists  of  all  points  x  such  that  a  <  x  <  b.  Either  of  the  sets  of  points  x  for 
which  a  £  a;  <  b,  or  a  <  x  £  b,  may  be  said  to  be  the  points  of  a  semi-closed 
interval  (a,  b),  open  at  b  or  at  a. 

A  point  x  of  the  set  forming  a  closed  interval  (a,  b)  is  said  to  be  in  the 
interval  or  segment  (a,  b).  A  point  x  of  the  set  forming  an  open  interval  (a,  b) 
is  said  to  be  within,  or  interior  to,  the  interval  or  segment  (a,  b),  or  is  said  to 
be.  in  the  open  interval  (a,  b). 

Let  a  set  of  points  be  such  that  every  point  of  the  set  lies  upon  a  straight 
line,  the  position  of  each  point  being  determined  by  its  distance  from  a  fixed 
origin  upon  the  straight  line,  in  the  manner  explained  in  §  44.  If  a  point  ft 
exists,  such  that  no  number  of  the  set  is  greater  than  ft,  the  set  is  said  to  be 
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bounded  on  the  right.  In  this  case  it  will  be  shewn  that  there  is  a  definite 
point  b,  such  that  no  point  of  the  set  is  on  the  right  of  b,  and  such  that  either 
(1)  b  is  itself  a  point  of  the  set,  or  else  (2)  that  points  of  the  set  are  within  the 
interval  (b  —  e,  b),  however  small  the  positive  number  e  may  be  taken  to  be  ; 
or  that  both  the  conditions  (1)  and  (2)  are  satisfied. 

The  point  b  may  or  may  not  itself  be  a  point  of  the  given  set.  In  either 
case  it  is  said  to  be  the  upper  boundary  of  the  given  set.  If  b  is  itself  a  point 
of  the  given  set,  it  is  said  to  be  the  upper  extreme  point  of  the  set. 

When  there  are  points  of  the  given  set  interior  to  the  interval  (b  —  e,  6) 
for  every  value  of  e  (<  b),  the  point  b  is  said  to  be  the  upper  limit  of  the  set. 

In  case  b  is  both  the  upper  limit,  and  the  upper  extreme  point,  of  the  set, 
the  upper  limit  is  said  to  be  attained;  and  b  may  then  be  called  the  maximum 
point  of  the  set. 

To  prove  the  existence*,  under  the  condition  stated,  of  an  upper  boundary, 
as  above  defined,  it  may  be  observed  that  all  the  numbers  of  the  continuum 
of  real  numbers  can  be  divided  into  two  classes,  one  of  which  contains  every 
number  which  is  greater  than  all  the  numbers  of  the  set,  and  the  other  of 
which  contains  every  number  which  either  belongs  to  the  set  or  is  less  than 
some  or  all  of  the  numbers  of  the  set.  The  section  thus  specified  defines  a 
number  b  which  is  the  upper  boundary  of  the  set. 

In  a  similar  manner,  it  may  be  shewn  that,  if  the  set  is  bounded  on  the 
left,  i.e.  if  a  point  can  be  found  such  that  all  the  points  of  the  set  are  on  the 
right  of  such  point,  then  a  point  a  exists,  which  is  such  that  no  points  of  the 
set  are  on  the  left  of  a,  and  such  that  either  a  is  a  point  of  the  set,  or  else 
points  of  the  set  are  within  every  interval  (a,  a  4-  e),  where  e  is  an  arbitrary 
positive  number,  or  else  that  both  conditions  are  satisfied  simultaneously. 

In  case  points  of  the  set  lie  within  every  interval  (a,  a  +  e),  then  a  is  called 
the  lower  limit  of  the  set ;  and  the  lower  limit  is  said  to  be  attained  if  a  be 
itself  a  point  of  the  set.  In  any  case  in  which  a  is  a  point  of  the  set,  it  is  then 
said  to  be  the  lower  extreme  point  of  the  set.  The  term  lower  boundary  may 
in  all  cases  be  applied  to  a. 

A  set  of  points  which  has  both  an  upper  and  a  lower  boundary  is  said  to 
be  a  bounded  set.  Thus  a  set  is  bounded  if  every  point  x  in  it  is  such  that 
!  x  I  <  A,  where  A  is  some  fixed  positive  number. 

48.  If  no  point  /3  exists,  which  is  such  that  no  point  of  the  set  is  on  the 
right  of  /3,  then  the  set  is  said  to  be  unbounded  on  the  right ;  or  it  is  said 
that  the  upper  limit  of  the  set  is  +  oo  ;  the  two  statements  being  regarded 
as  tautological.  Similarly,  if  no  lower  boundary  a  exists,  the  set  is  said  to  be 
unbounded  on  the  left ;  or  it  is  said  that  the  lower  limit  is  —  oo . 

*  The  existence  of  upper  and  lower  boundaries  was  proved  by  Weierstrass,  in  his  lectures.    See 
also  Bolzano,  Abh.  d.  Bohmischen  Gesellach.  d.  Wiss.,  vol.  v,  Prag,  1817. 
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The  symbols  +  oo  ,  —  oo  do  not  really  represent  numbers ;  they  must  be 
taken  to  represent  what  is  sometimes  spoken  of  as  the  improperly  infinite, 
i.e.  the  mere  absence  of  an  upper  or  a  lower  boundary  respectively.  In  order, 
however,  to  avoid  circumlocution  in  the  statement  of  theorems  concerning  sets, 
it  is  usually  convenient  to  speak  of  +  oo  ,  -  x  ,  used  in  the  above  sense,  as  if 
they  were  numbers  which  correspond  to  upper  and  lower  limits  respectively. 

The  statement  that  +  oo  is  the  upper  limit  of  a  given  set  is  thus  taken 
to  be  equivalent  to  the  statement  that,  if  A  is  an  arbitrarily  chosen  positive 
number,  there  exist  points  #  of  the  set  such  that  x  >  A.  Similarly  if  —  oo  is 
the  lower  limit  of  a  set,  there  are  points  x  of  the  set  such  that  x  <  —  A. 

In  the  present  Chapter,  it  will  frequently  be  assumed  that  the  sets  treated 
of  are  bounded  ;  and  the  interval  (a,  6)  will  be  said  to  be  the  interval  in  which 
the  set  exists.  This  restriction  is  not  so  great  a  one  as  might  at  first  sight 
appear;  for  an  unbounded  set  can  be  placed  into  correspondence  with  a  bounded 
one,  in  such  a  manner  that  the  relative  order  of  any  two  points  in  the  one 
set  is  the  same  as  that  of  the  corresponding  points  in  the  other  set.  If 
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x'  =  — r= ,  where  the  radical  is  taken  to  have  always  the  positive  sign,  then 

to  a  point  x,  in  the  unlimited  interval  (—  oo  ,  +  oo  ),  there  corresponds  a  point  x, 
in  the  open  interval  (—1,4-1);  and  also  #/  =  x2',  according  as  o^  =  x*  The 
same  object  might  have  been  attained  by  using  the  transformation 
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x  =  —  tan"1  x. 
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There  is  no  real  loss  of  generality  in  considering  only  such  sets  as  lie  in 

//*  ""-*  Q 

a  given  interval,  say  (0, 1);  for  the  relation  #'/  =  7j —  establishes  a  complete 

P  M 

correspondence  between  sets  in  the  interval  (a,  /3)  and  sets  in  the  interval 
(0,  1),  the  relative  order  of  points  being  preserved  in  the  correspondence. 

The  points  of  the  interval  (a,  /3)  may  be  made  to  correspond  in  order  with 
the  points  of  the  interval  (0,  1),  in  such  a  manner  that  an  arbitrarily  chosen 
point  7  within  (a,  /3),  corresponds  to  an  arbitrarily  chosen  point  within  (0, 1); 
for  example  the  point  -|.  This  correspondence  can  be  effected  by  the  trans- 
formation 

x  x  —  a  j  —  /3 

of  —  \         x  —  (3'  y  —  a 

NON-LINEAR  SETS  OF  POINTS. 

49.  A  plane,  or  two-dimensional,  set  of  points  is  an  aggregate  of  which 
each  element  is  constituted  by  a  pair  of  real  numbers  (#U),  #(2)).  Such  an 
element  may  be  spoken  of  as  a  point  P  in  a  plane  ;  the  rectangular  coordinates 
of  P  being  x(l>,  #I2).  Corresponding  to  intervals,  in  the  case  of  linear  sets  of 
points,  we  now  consider  rectangles,  or  cells,  of  which  the  sides  are  parallel  to 
the  coordinate  axes. 
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A  closed  cell  (a(1),  a(2)  ;  b^,  6<2')  is  regarded  as  the  set  of  all  points  (a;'1', 
such  that  a'1'  ^  x™  <  &a>,  a<2>  ^  x®  £  6(2'. 

An  open  cell  (a(1),  a(2)  ;  6(1),  6(2))  is  regarded  as  the  set  of  all  points  (a;(1), 
such  that  a'1'  <  xm  <  JID,  a<«  <  ^w  <  &<». 

The  set  of  all  points  (ar(1>,  x^)  such  that  a(1)  ^  x(»  <  bw,  a<2)  £  x®  <  6'2>,  may 
be  spoken  of  as  the  semi-closed  cell  (a(1),  a(2)  ;  6(1),  6(2)). 

A  plane  set  of  points  (x(l},  a;(2))  is  said  to  be  bounded,  in  case  two  positive 
numbers  Aw,  J.(2)  exist,  such  that  j  xw  £AW,  \  x®  \  £  A®,  for  every  point 
(xV,x®)  of  the  set. 

When  all  the  points  of  a  given  set  are  in  a  closed  cell  (a(1),  a(2);  6(1),  6(2)), 
this  cell  may  be  spoken  of  as  a  fundamental  cell  for  the  given  set  of  points. 

More  generally,  a  p-dimensional  set  of  points  consists  of  elements  each 
of  which  is  constituted  by  an  association  (#(1),  x®,...xip))  of  p  real  numbers. 

The  ^-dimensional  closed  cell  (a(1),  aw,...a<»  ;  bw,  6(a»,...6'«)  is  the  set  of  all 
points  O(1>,  x®,  .  .  .«<«)  such  that  an>  ^  xw  £  6'1',  a(2>  ^  x®  <  6'2'  ,  ......  a<^>  ^  ««*•)  ^  &<«. 

The  p-dimensional  open  cell  consists  of  all  points  such  that  a(l)  <  #(1)  <  6(1), 
a(2)<a,(2)<^)(2))  ......  a(p)  <a;(p)  <  ^(P)  •  an(i  the  semi-closed  cell  consists  of  all  points 

such  that  a(l)  ^  a;'1'  <  6(I),  a(2)  ^  «(2)  <  b(~\  ......  a(*}  ^  x^  <  b(v\ 

All  points  of  the  closed  cell  such  that  x(q)  =  a(q\  or  x(i}  =  b(q)  for  some  value 
of  q  (=  1,  2,  3,  ...p)  are  said  to  be  on  the  boundary  of  the  cell. 

If  positive  numbers  A(l\A("\...A(P}  exist,  such  that  for  all  points  (rf^x^,.  .  .X(P]) 
belonging  to  a  given  set  of  points  the  conditions 


are  satisfied,  then  the  p-dimensional  set  is  said  to  be  bounded,  and  any  closed 
cell  in  which  all  the  points  of  the  set  are  contained  may  be  spoken  of  as  a 
fundamental  cell  for  the  given  set.  If  no  such  fundamental  cell  exists,  for  a 
given  set  of  points,  that  set  is  said  to  be  unbounded. 

An  unbounded  set  of  points  (x(i\  x(-\  .  .  .d&)  may  be  correlated  with  a  bounded 
set  (f(l),  |(2),...f(p))  for  which  a  fundamental  cell  is  (-1,  -1....-1;  1,  !,...!)  by 
means  of  the  relations 

£<<?>  =  a.(«)/{i  +  (x^ff,   (q  =  1,  2,  3,  .  .  .p}  ; 
or  by  means  of  the  relations' 

£W  =  2  tan-1  a/«',    (q  -  1  ,  2,  3,  .  .  .«). 

7T 

Any  cell  will  be  correlated  with  another  cell  by  means  of  either  of  these 
relations.  It  is  clear  that  these  relations  are  only  examples  of  an  indefinite 
number  of  other  relations  by  which  a  similar  correlation  may  be  obtained. 

In  order  that  every  point  on  the  boundary  of  the  cell  (-1,  —1,...;  1,  1...) 
may  correspond  to  a  point  in  the  ar-space,  it  is  convenient  to  adjoin  a  set  of 
points  at  infinity  to  that  space.  For  example,  in  the  case  p  =  2,  points  (a^1*,  GO  ) 


49,  so]  Convergent  sequences  of  Intervals  or  Cells  63 


for  all  values  of  OLM,  points  (a^0,  —  oo  )  and  points  (oo  ,  a^1),  (—  oo  ,  XM)  will  be 
taken  to  correspond  to  the  points  (gl},  1),  (gl),  —  1  ),  (  1,  £'2)),  (—  1,  p')  respectively. 
Also  the  four  points  (±00,  ±00)  will  be  adjoined  so  as  to  correspond  to  the 
four  points  (  +  1,  ±  1)  in  the  finite  rectangle.  In  this  manner  it  can  be  assumed 
as  a  matter  of  convenience  that  the  boundary  of  the  rectangle  (—!,  —  !;  1,  1) 
corresponds  to  a  boundary  adjoined  to  the  #-space.  The  advantage  of  this 
procedure  is  that  no  exception  need  be  made  as  regards  the  correspondence 
of  an  £-set  with  an  a>set,  when  the  f-set  has  points  on  the  boundary  of  the  cell 
in  which  it  is  contained. 

In  stating  properties  of  sets  of  points  which  are  independent  of  the  number 
of  dimensions  it  is  frequently  convenient  to  employ  a  notation  in  which  the 
number  p  of  dimensions  does  not  appear.  To  effect  this  the  point  (atl),  aP\  .  .  .a^) 
of  a  p-dimensional  set  may  be  denoted  by  a  single  letter  x,  as  in  the  case  of 
a  linear  set.  The  cell  (a(1),  a(2),  .  .  .a(?};  bM,  6(2),  .  .  .b(P})  may  then  be  denoted  by  \a,  b). 
Thus  (a,  b)  denotes  an  interval  when  p=l,  and  a  cell  when  p  >  1. 


LIMITING  POINT  OF  A  CONVERGENT  SEQUENCE  OF  INTERVALS,  OR  CELLS. 

50.  Let  (ttj,  6^,  (0.3,  b2), ...  (an,  bn) ...  be  an  unending  sequence  of  closed 
intervals  which  are  such  that  any  one  of  them  (an,  bn)  lies  entirely  in  the 
preceding  one  (an_!,  &„_,),  the  two  having  at  most  one  end-point  common;  thus 
an  ^  «n-i,  bn  £  &„_,;  moreover,  suppose  that  the  lengths  b±  —  aly  b2  —  a2, ... 
bn  —  an,...  form  a  sequence  which  converges  to  zero,  the  condition  for  which 
is  that,  corresponding  to  any  arbitrarily  small  e,  n  can  be  so  chosen  that  bm  —  am, 
for  all  values  of  m  which  are  ^  n,  is  <  e.  It  will  be  seen  that,  in  accordance 
with  the  axioms  explained  in  §  44,  there  exists  one  point  and  one  only  which  is 
in  every  interval  of  the  sequence.  This  point  may  be  called  the  limiting  point 
of  the  convergent  sequence  of  closed  intervals. 

Each  of  the  aggregates  (alt  «2,  a3,...an,...),  (6,,  &2,...&n,...)  being  con- 
vergent, defines  a  number ;  and  in  fact,  in  virtue  of  the  definition  of  equality 
in  §  26,  they  define  the  same  number  x.  This  number  x  is  not  less  than  an 
and  not  greater  than  bn,  whatever  n  may  be ;  the  point  x  therefore  lies  in  all 
the  intervals,  and  is  the  limiting  point  whose  existence  was  to  be  remarked. 
If  y  be  any  number  greater  (or  less)  than  a;  we  can  find  n  so  great  that 
bn  —  'f<y-x,  if  y  >  x ;  or  that  x  —  an  <  x  —  y,  if  .r  >  \j  :  thus  y  does  not  lie  in 
(an,  bn).  Hence  there  is  only  one  point  which  satisfies  the  prescribed  conditions. 

If  for  every  n,  from  and  after  some  fixed  value,  the  inequalities  an  >  an_, , 
bn<bn-i  both  hold,  then  the  limiting  point  #  is  in  the  interior  of  all  the 
intervals  of  the  sequence.  If,  from  and  after  some  fixed  value  of  n,  say  /;, , 
we  have  an  =  an_1;  bn<  bn-lt  the  limiting  point  x  coincides  with  the  common 
end-points  a,,^,  artl,  a,,1+lj.... 
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If  the  intervals  (aj,  b}\  (a2,  b^),...(an,  &„)...  are  all  open  intervals,  there  is 
not  necessarily  any  point  which  is  in  all  the  open  intervals.    Consider,  for 

example,  the  set  of  open  intervals  (0, 1),  (0, •= } ,  (0, 5) ...  (0,-), In  this 

\    2/     \     o/        \     n) 

case  the  point  x  defined  by  either  of  the  sequences 


(0,0,0,..), 

is  the  point  0,  which  is  not  in  any  of  the  open  intervals  of  the  given  set.  It 
thus  appears  that  the  theorem  does  not  hold  for  the  set  of  open  intervals  if, 
from  and  after  some  fixed  value  of  n,  we  have  an  =  an+1,  or  if,  from  and  after 
some  fixed  value  of  n,  we  have  bn=bn+l.  If  however,  from  and  after  some 
fixed  value  of  n,  both  the  conditions  an<an+l,  bn>bn+1  are  satisfied,  the 
theorem  is  valid ;  for  the  point  x,  defined  as  above,  is  a  point  in  every  open 
interval  (an,  bn)  from  and  after  some  fixed  value  of  n. 

If  the  intervals  of  the  given  set  are  semi-closed  ;  being  all  open  at  the 
right-hand  end-points,  the  theorem  holds  good  unless  bn  =  bn+1,  from  and  after 
some  fixed  value  of  ??. 

In  the  case  of  a  sequence  of  ^-dimensional  closed  cells 

(n  (1)  a  (2)        n  W  •  b  <!>  />  '2)        h  (p)\ 
\**n    >  <%    >  •  •  •  un      •>  VH    )  Vn    >••  -Un     )> 

where  n  =  1,  2, 3, . . . ,  each  of  which  is  in  the  preceding  one,  so  that  an+l(q]  =  an(q\ 
and  bn+1{q}  ^  bn  {q},  for  all  values  of  n,  and  for  the  values  1,  2,  3, . . . ,  p,  of  q,  and  if 
bn(q)  —  an(q\  for  each  value  of  q,  forms  a  sequence  of  numbers  which  converges  to 
zero,  for  n  —  1,2,3,...,  there  is  a  unique  point  which  is  in  all  the  cells  of  the  set, 
and  which  is  called  the  limiting  point  of  the  set.  For  the  sequence  of  linear 
intervals  (an(q\  bn(q))  defines,  for  each  value  of  q,  a  single  point  atq)  in  all  the 
intervals.  Thus  the  point  (#(l),  a/2', . .  .x(q)},  uniquely  determined,  is  in  all  the 
cells  of  the  sequence,  and  it  is  easily  seen  that  it  is  the  only  point  that  satisfies 
this  condition. 

If  the  cells  are  open,  the  corresponding  theorem  does  not  hold,  unless  there 
exists  some  value  of  n  from  and  after  which  an(q}  <  an+i(q\  bn(q]  >  bn+liq),  for  each 
value  of  q. 

If  the  cells  are  semi-closed,  for  the  validity  of  the  theorem  the  condition 
bn(q)  >  bn+l(q}  must  be  satisfied,  from  and  after  some  fixed  value  of  n,  for  every 
value  of  q. 

(<I=P  )i 

The  number  <  ^    (b(q)  —  a(9))H    may  be  called  the  span,  or  the  length  of  the 

U=i  J 

diagonal,  of  the  cell  (a(l),  «.(a),  ...a'^;  b(l\b(2\...b{^).  This  number  may  also  be 
spoken  of  as  the  distance  between  the  two  points  («(l),  a<2), . .  .a'^);  (b(l\  b(*\ .  ..b(p}), 
and  may  be  denoted  by  PQ,  when  the  points  are  denoted  by  P,  Q  respectively. 

A  convergent  sequence  of  closed  cells  is  one  in  which  each  cell  is  in  the 
preceding  one,  and  such  that  the  sequence  of  spans  of  the  cells  converges  to 
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zero;  such  a  convergent  sequence  defines  a  unique  point  in  all  the  cells  of 
the  sequence. 

In  the  case  of  such  a  convergent  sequence  of  cells,  the  convergence  of  the 
sequence  of  spans  to  the  limit  zero  ensures  that,  for  each  value  of  q,  where 
l£q£p,  bn(9]  —  <'n<9)  converges  to  zero,  as  n  is  indefinitely  increased.  A  sequence 
of  closed  cells,  each  of  which  contains  the  next,  may  however  be  such  that 
6n(9'  —  anW  converges  to  zero  for  some  of  the  values  of  q,  but  not  for  all  these 
values.  In  that  case  the  sequence  of  cells  may  be  said  to  be  incompletely  con- 
vergent ;  the  points  of  a  cell  of  dimensions  lower  than  p  will  all  be  contained 
in  each  of  the  cells  of  the  given  sequence.  This  cell  may  be  termed  the  limiting 
cell  (or  interval)  of  the  given  sequence. 

For  example,  if  p  =  3,  we  may  have  6n(1)  -  an(l)  and  bn®  -  an(2)  converging 
to  zero,  but  6n(3)  —  an(3)  converging  to  some  number  greater  than  zero  ;  in  that 
case  the  sequence  converges  to  a  limiting  interval.  Again,  if  6n(l)—  «n(1)  converges 
to  zero,  but  6n(2)  —  an(2),  bn{3)  —  an(3)  converge  to  numbers  greater  than  zero,  the 
limiting  cell  is  two-dimensional. 


SYSTEMS   OF   NETS. 

51.  Let  the  linear  interval  (a,  b)  be  divided  into  a  number  raj  of  parts,  such 
that  the  lengths  of  all  the  parts  do  not  exceed  a  positive  number  6^  let  Z)j 
denote  the  finite  set  of  intervals  so  obtained.  By  introducing  further  points 
of  division  of  (a,  $),  let  a  new  finite  set  of  intervals  D2  be  determined,  of  which 
the  number  is  m.2(>  raj),  and  let  the  length  of  each  interval  of  D2  not  exceed  a 
positive  number  82  (<  Sj).  Proceeding  in  this  manner,  there  can  be  defined,  in 
accordance  with  some  specified  set  of  rules,  a  sequence  Dly  D2,  ...  Dn,  ...  of 
finite  sets  of  intervals,  such  that  the  intervals  of  Dn  are  all  not  greater  than 
an  assigned  number  8n.  Let  it  be  supposed  that  the  numbers  81}  82,  ...  8n,  ... 
form  a  sequence  which  converges  to  zero. 

If  we  regard  the  intervals  of  each  set  Dn  as  semi-  closed,  i.e.  closed  on  the 
left  and  open  on  the  right,  except  that  the  extreme  interval  on  the  right  is 
taken  to  be  closed,  each  point  of  the  closed  interval  is  in  one  and  only  one  of 
the  intervals  of  the  set  Dn,  for  each  value  of  n. 

The  set  Dn  of  semi-closed  intervals  (except  that  the  extreme  one  on  the 
right  is  closed)  may  be  spoken  of  as  a  net,  and  the  nets  corresponding  to 
n=  1,  2,  3,...  may  be  called  a  system  of  nets  fitted  on  to  the  interval  (a,  b). 
The  particular  net  Dn  may  be  called  the  net  of  order  n,  or  the  nth  net,  of  the 
system.  The  mn  separate  intervals  of  the  net  Dn  may  be  spoken  of  as  the 
meshes  of  the  net  Dn. 

The  fundamental  property  of  such  a  system  of  nets  is  that  each  point  x  of 

the  interval  (a,  b)  is  in  one  and  only  one  mesh  of  each  of  the  nets  Dn. 

These    mashes,  for   n=l,    2,   3,   ...,   constitute   a   convergent   sequence  of 

intervals  of  which  the  point  x  is  the  limiting  point.    In  each  net  of  the 

H.  5 
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system  the  rank,  or  order,  of  the  meshes  may  be  taken  as  their  order  from  left 
to  right  in  the  interval  (a,  b). 

In  case  D,  consists  of  m  meshes  of  equal  breadth,  Z)2  of  m-  meshes  of 
equal  breadth,  and  in  general  Dn  consists  of  mn  meshes  of  equal  breadth,  the 
system  of  nets  will  be  said  to  be  symmetrical. 

In  some  cases  it  is  convenient  to  employ  nets  in  which  all  the  meshes  are 
closed  at  both  ends.  In  that  case  a  point  x  of  (a,  b)  that  is  an  end-point  of  a 
mesh  of  Dn  will,  in  general,  belong  to  two  adjacent  meshes  of  Dn,  and  of  all 
the  subsequent  nets  of  the  system.  Such  a  point  will  consequently  be  the 
limiting  point  of  more  than  one  sequence  of  meshes.  Such  a  system  of  nets 
may  be  spoken  of  as  a  system  of  nets  with  closed  meshes.  It  may  or  may  not 
be  a  symmetrical  system. 

In  case  the  interval  (a,  b)  is  replaced  by  an  indefinite  interval  (-00,00), 
(a,  oo  ),  or  (—  oo  ,  6),  the  number  of  meshes  of  a  net  fitted  on  to  such  indefinite 
interval  will  not  be  finite,  but  as  before,  the  set  of  numbers  81}  82,  ...  8n,  ... 
will  be  taken  to  converge  to  zero. 

The  definition  of  a  system  of  nets  can  be  extended  to  the  case  of  cells  of 
two,  or  of  any  number  p,  dimensions.  In  the  case  of  the  two-dimensional  cell 
(a(1),  a(l!);  6ll),  i(2)),  this  rectangle  is  divided  by  straight  lines  parallel  to  the  axes 
into  a  set  D-i  of  rectangular  cells,  the  spans  of  all  of  which  are  ^  8^  Some  or 
all  of  these  cells  are  again  divided  into  smaller  cells,  by  the  introduction  of 
j'resh  straight  lines  parallel  to  the  axes,  so  that  the  new  set  of  cells  _D2  is  such 
that  their  spans  are  all  ^  £>.  Proceeding  in  this  manner,  in  accordance  with 
a  specified  set  of  rules,  and  the  sequence  [8n]  being  taken  to  converge  to 
zero,  we  obtain  a  sequence  Dlt  D2,  ...  Dn,  ...  of  sets  of  cells.  We  take  each  of 
the  cells  of  Dn  to  be  semi-open,  except  that  those  which  have  a  side  in 
common  with  one  of  those  sides  of  *(a(l),  a<2);  6(l),  6(2))  which  do  not  contain  the 
point  (tt(1),  tt(2))  are  closed  along  that  side;  that  side  itself  being  taken  to  be 
semi-closed,  unless  it  has  (a(2),  b('2))  for  an  end-point,  in  which  case  it  is 
regarded  as  closed.  Each  of  the  sets  Dn  of  rectangular  cells  is  then  termed  a 
net;  each  of  the  cells  of  which  Dn  is  composed  is  termed  a  mesh  of  the  net  Dn\ 
and  the  totality  of  the  nets  {!)„}  is  called  a  system  of  nets  fitted  on  to  the 
cell  (a(l),  a<";  &<",  6<2)). 

Any  point  x  in  the  fundamental  cell  is  now  in  one  and  one  only  of  the 
meshes  of  Dn,  for  each  value  of  n;  and  consequently  x  is  the  limiting  point  of 
a  unique  sequence  of  meshes  of  the  cells  of  the  system. 

It  is  easy  to  arrange  the  meshes  of  a  net  Dn  in  ascending  order  or  rank. 
Thus  we  may  take  those  cells  which  have  a  side  in  common  with  a  part  of 
(a(1),  a(2) ;  6(1),  a(2))  in  ascending  order  from  left  to  right;  after  these  we  may  take 
in  order  from  left  to  right  those  cells  which  have  a  side  in  common  with  one  of 
riiose  just  referred  to;  then  we  take  in  order  Irom  left  to  right  those  cells 
which  have  a  side  in  common  with  one  of  these  latter,  and  so  on. 


51,52]  Systems  of  Nets  67 

If  the  nets  of  Dn  all  have  the  same  span  8n  for  each  value  of  n,  and  if  the 
ratio  of  8n+,  to  8n  is  independent  of  n,  the  system  of  nets  is  said  to  be 
symmetrical. 

As  before,  a  system  of  nets  may  be  fitted  on  to  the  unbounded  plane  of 


In  case  all  the  meshes  of  Dn,  for  every  value  of  n,  are  closed,  the  system 
is  said  to  be  a  system  of  nets  with  closed  meshes.  In  this  case  the  sequence 
of  meshes  of  which  the  limit  is  a  prescribed  point  (j?(1),  a^2))  is  in  general  not 
unique. 

It  is  unnecessary  to  point  out  in  detail  the  corresponding  definition  and 
convention  for  the  case  of  a  ^-dimensional  cell,  when  p  >  2.  It  is  clear  that 
the  number  of  dimensions  makes  no  difference  as  regards  the  properties  of  a 
system  of  nets. 

It  will  appear  that  a  system  of  nets  provides  an  apparatus  which  is  of 
considerable  utility  as  an  instrument  of  investigation  in  the  theory  of  sets  of 
points,  and  in  that  of  functions  of  one  or  more  variables.  The  employment  of 
this  nomenclature  has  the  advantages  of  preventing  repetition  in  the  proofs 
of  various  important  theorems,  and  of  providing  the  means  of  giving  those 
proofs  in  a  form  applicable  not  only  to  linear  sets  of  points  or  to  functions  of 
one  variable,  but  also  to  sets  of  any  number  of  dimensions,  or  to  functions  of 
any  number  of  variables. 

Although  the  method,  equivalent  to  the  employment  of  systems  of  nets,  has  been  in 
use  for  a  considerable  time  past,  the  first  formulation  of  the  idea  of  a  system  of  nets 
appears  to  be  due  to  de  la  Vallee  Poussin*.  His  terminology  is  however  not  identical  with 
that  here  employed.  He  speaks  of  what  is  here  termed  a  system  of  nets,  as  simply  a  net 
(reseau),  and  what  is  here  called  a  net  is  termed  by  him  a  lattice  (grillage).  Moreover  he 
considers  only  what  are  here  described  as  symmetrical  systems  of  nets,  and  this  is 
sufficient  for  the  special  purpose  for  which  he  employs  them.  For  the  more  general 
purposes  of  investigation,  it  is  however  convenient  not  to  make  this  restriction  ;  ac- 
cordingly the  wider  definition  given  above  has  been  here  adopted. 

THE  LIMITING  POINTS  AND  THE  DERIVATIVES  OF  A  SET. 

52.  If  a  point  cc  be  taken  in  the  interval  (a,  b),  an  interval  (x  —  e,,  x  +  e,) 
which  lies  entirely  in  (a,  6)  is  called  a  neighbourhood  of  the  point  x  ;  and  this 
neighbourhood  may  be  made  as  small  as  we  please  by  proper  choice  of  ex  and  e.,. 
An  interval  (x,  x  +  e2)  is  called  a  neighbourhood  of  x  on  the  right,  and  (x  —  e,,  ./•) 
is  called  a  neighbourhood  of  x  on  the  left.  The  end-points  a  and  b  can  only 
have  neighbourhoods  on  the  right  and  the  left  respectively.  A  neighbourhood 
of  a  point  may  be  open  or  closed. 

In  the  case  of  a  point.  (#"',  #(2))  in  a  plane,  the  cell 

(a-d)  _  ei<n  d*  _  6j(2).  ^D  +  €MI  xw  +  €M) 

*  See  his  treatise  Inteijrales  de  Lcbesyite,  Paris,  1916,  p.  16,  et  *e<i. 
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is  called  a  neighbourhood  of  (#(1),  a^2')  ;  it  may  be  closed  or  open.  Corre- 
sponding to  neighbourhoods  on  the  right  or  left,  of  a  point  x  of  a  linear 
interval,  there  are  in  the  plane  four  partial  neighbourhoods  of  the  point 
),  viz.  the  four  rectangles 


In  p  dimensions,  similar  definitions  hold  good;  a  point  has  2p  partial  neigh- 
bourhoods. The  neighbourhoods  can  always  be  supposed  to  be  in  a  funda- 
mental cell,  to  which  the  point  is  confined. 

If   a    set    of  points    G,    in    any    number    of   dimensions,    have    been 
defined,  and  a  point  P  be  such  that  every  neighbourhood  of  it  contains  a  point     JL 
of  G,  other  than  P  itself,  the  point  P  is  called  a  limiting  point  of  the  set  G, 
whether  P  belongs  to  G  or  not. 

It  should  be  observed  that,  if  every  neighbourhood  of  P  contains  a  point 
of  G  ,  other  than  P,  it  must  contain  an  infinite  number  of  points  of  G.  For 
let  us  suppose  that  a  certain  neighbourhood  H  of  P  contains  only  a  finite  set 
of  points  PJ,  p2,  ...  pm  belonging  to  G.  We  may  suppose  P  to  be  the  limiting 
point  of  the  sequence  \dn\  of  meshes  belonging  to  the  nets  {/)„}  of  a  system  of 
nets  fitted  on  to  H,  and  such  that  P  is  interior  to  all  the  meshes  dn.  Each  of 
the  points  pl,  p2,  ...  pm  can  be  in  only  a  finite  number  of  the  meshes  dn  of  the 
sequence;  hence,  from  and  after  some  fixed  value  of  n,  dn  contains  no  point  of 
G;  but  dn  is  a  neighbourhood  of  P  containing  no  points  of  G,  other  than  P 
itself,  which  is  contrary  to  the  hypothesis  made  above. 

A  limiting  point  of  G,  defined  as  above,  is  also  called  a  point  of  accumulation. 
Another  definition  of  a  limiting  point  is  that  it  is  a  point  P  such  that  G 
contains  a  sequence  of  points  [Pn]  for  which  the  distance  PPM  is  less  than 
an  arbitrarily  chosen  number  e,  for  all  sufficiently  large  values  of  n.  A  point 
which  satisfies  this  condition  is  certainly  a  limiting  point,  in  accordance  with 
the  definition  here  adopted,  but  the  question  whether  the  converse  holds  will 
be  discussed  in  Chap,  iv,  in  connection  with  the  multiplicative  axiom. 

The  fundamental  theorem  will  now  be  established,  that:  Every  bounded 
set  of  points  G  which  contains  an  infinite  number  of  points  possesses  at  least  one 
limiting  point. 

We  may  suppose  a  system  of  nets  to  be  fitted  on  to  the  fundamental  cell, 
or  interval,  in  which  G  is  contained.  Of  the  meshes  of  D1  at  least  one  must 
contain  an  infinite  set  of  points  belonging  to  G',  let  c?,  be  that  one  of  such 
meshes  which  is  of  lowest  rank.  Of  the  meshes  of  D.2  which  are  contained  in 
dl  at  least  one  must  contain  an  infinite  set  of  points  belonging  to  G',  let  d2  be 
that  one  of  such  meshes  which  is  of  lowest  rank.  Proceeding  in  this  manner 
a  definite  sequence  of  meshes  dlt  d.^,  ...  dn,  ...  is  defined,  each  of  which 
contains  an  infinite  set  of  points  belonging  to  G.  The  limiting  point  P  of 
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this  sequence  {dn}  of  meshes  is  a  limiting  point  of  G.  For  any  assigned 
neighbourhood  of  G  contains  dn>  from  and  after  some  fixed  value  of  n,  and 
therefore  contains  an  infinite  number  of  points  of  G. 

It  has  thus  been  shown  that  G  has  at  least  one  limiting  point.  It  may 
have  a  finite  number,  or  an  indefinitely  great  number,  of  limiting  points.  It 
should  be  observed  that  a  limiting  point  of  G  may  or  may  not  itself  be  a 
point  of  G.  In  the  case  of  a  linear  set  of  points,  if  either  boundary  of  the  set 
be  not  a  point  of  G,  it  is  certainly  a  limiting  point  of  G;  it  may  however  be 
both  a  point  of  G  and  a  limiting  point  of  G. 

A  limiting  point  P  of  a  linear  set  G1  is  a  limiting  point  on  both  sides,  if 
an  indefinitely  great  number  of  points  of  G  lie  in  every  neighbourhood  of  P 
on  the  right,  and  also  in  every  neighbourhood  of  P  on  the  left.  Otherwise  P 
is  a  limiting  point  of  G  on  one  side  only.  This  distinction  may  be  extended 
to  the  case  of  a  limiting  point  P  of  a  set  G  in  p  dimensions,  the  2P  partial 
neighbourhoods  of  P  corresponding  to  neighbourhoods  on  the  right  and  left 
of  a  point  of  a  linear  set. 

53.  The  theorem  of  §  52  does  not  hold  for  the  case  of  unbounded  sets  of 
points.  For  example,  the  linear  set  which  consists  of  the  points  1,  2,  3,  . . . ,  n,  . . . 
has  no  limiting  point,  in  accordance  with  the  definition  given  in  §  52.  The 

same  remark  applies  to  the  set  of  points  —  1,  —  2,  —  3,  . . . ,  —  n, If  we  apply 

to  either  of  these  sets  the  transformation  !;  =  x/*>/x*+l,  where  the  positive 
value  of  the  radical  is  always  taken,  the  sets  on  the  interval  of  £  which  cor- 
respond to  the  above  sets  are 

1/V2,         2/\/5,        3/VlO,          n/\/n*^l,    ...; 
and  -1/V2,     -  2/V5,     -3/VfO,       -  n/Vn»Ti,    ..., 

which  have  as  limiting  points  the  points  1,  —  1  respectively;  these  points  n, 
1,  —  1  being  the  end-points  of  the  interval  (—  1,  1)  of  £.  To  the  indefinitely 
great  interval  of  x,  which  is  an  open  interval,  there  corresponds  the  open 
interval  (— 1,  1)  of  £,  and  in  the  former  interval  there  are  no  points  which 
correspond  to  the  points  —  1,1  of  the  interval  of  x.  If  we  agree  to  adjoin  to 
the  set  of  real  numbers,  two  improper  numbers  —  oc  ,  x  ,  which  are  taken  to 
correspond  to  the  numbers  —1,1  in  the  closed  interval  (—  1,  1)  of  £,  we  now 
regard  (—  x  ,  x  )  as  the  closed  interval  of  x  corresponding  to  the  closed  interval 
(—  1, 1)  of  £.  To  a  neighbourhood  (1  —  6,  1)  on  the  left  of  the  point  £  =  1,  there 

will  correspond  the  interval  (    7-~— — \  ,  oo  ]  on  the  left  of  the  adjoined  point 

\6  y  —        € )  / 

x  =  x  in  the  linear  interval  of  x.  The  point  x  =  x  may  be  regarded,  in  an 
extended  sense  of  the  term,  as  the  limiting  point  of  the  set  of  points  1,  2,  3,  — 
It  corresponds  to  the  limiting  point  1  of  the  corresponding  set  1/V2,  2/Vo, 
3/VlO in  the  segment  (—1, 1). 
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In  this  extended  sense  of  the  term  "  limiting  point,"  the  point  oo  will  be 
a  limiting  point  of  any  set  G  which  is  such  that,  corresponding  to  any  arbitrarily 
chosen  positive  number  A,  there  are  points  x  of  G  for  which  x  >  A.  Similarly, 
the  point  -  oo  may  be  regarded  as  a  limiting  point  of  the  set  G,  if  there  are 
points  of  G  for  which  x  <  —  A,  for  all  values  of  the  positive  number  A.  When 
oo  or  —  oo  ,  is,  in  this  extended  sense  of  the  term,  a  limiting  point  of  G,  the 
point  1,  or  —  1,  is  a  limiting  point  of  the  set  of  points  which  corresponds  to  G 
in  the  £-  segment. 

To  a  neighbourhood  (x  —  e1,a;  +  e2)  of  a  point  x,  not  oo  or  —  oo  ,  in  the  interval 
(—00,00),  there  corresponds  a  neighbourhood  (£  —  e/,  f  +  e./)  of  the  point  £ 
which  corresponds  to  x.  It  is  easily  seen  that  sequences  of  values  of  en  e2  which 
converge  to  zero,  correspond  to  sequences  of  e/,  e/  which  converge  to  zero.  Thus 
a  finite  limiting  point  of  a  set  G  in  the  interval  (—  oo  ,  oo  )  of  x  corresponds  to 
a  limiting  point  of  the  corresponding  set  in  (—1,  1)  which  is  in  the  interval. 

It  thus  appears  that,  when  the  two  points  4-  oo  ,  —  oo  are  adjoined  to  the 
indefinite  interval  of  x,  so  that  it  becomes  a  closed  interval,  the  theorem  as  to 
the  existence  of  a  limiting  point  holds  of  the  closed  interval.  The  difference 
in  the  form  of  the  condition  that  either  of  the  points  oo  ,  —  oo  should  be  a 
limiting  point,  from  the  condition  applicable  to  a  finite  point,  is  seen  to  be 
unessential,  as  it  disappears  when  the  set  is  transformed  into  a  set  in  the  finite 
closed  interval  (—  1,  1). 

The  intervals  (A,  oo  ),  (—  oo  ,  —  A)  may  be  termed  neighbourhoods  of  the 
points  oo  ,  —  oo  respectively. 

The  case  of  a  plane  set  may  be  considered  in  a  similar  manner.  Employing 
the  transformation 


by  which  points  (x(l},  x®}  of  the  unclosed  infinite  cell  (—  oo  ,  —  oo  ;  00,00)  are 
placed  in  correspondence  with  points  of  the  unclosed  cell  (—!,  —  !;  1,  1)  in  the 
plane  of  (£(1),  £(2));  we  see  that,  corresponding  to  a  set  of  points  in  the  latter 
plane  which  has  a  limiting  point  on  the  boundary  of  the  rectangle  (—  1,  —  1  ;  1,  1), 
there  will  be  a  set  of  points  in  the  plane  of  (#(I),  #(2))  which  has  a  limiting  point, 
in  the  extended  sense  of  the  term,  on  a  boundary  adjoined  to  the  rectangle 
(—  oc  ,  —  oc  ;  oo  ,  oc  ).  In  this  case  also,  any  limiting  point  of  a  set  in  the  rect- 
angle (—],  —  !;  1,  1)  in  which  the  points  (£"',  £  ('->))  are  contained,  will  correspond 
to  a  limiting  point  of  the  corresponding  set  in  the  rectangle  (—  oo  ,  —  oo  ;  00,00) 
in  which  the  points  (#(1),  a-(2))  are  contained.  It  is  clear  that  there  is  an  arbi- 
trary element  in  the  particular  transformations  employed. 

The  set  of  points  O'1',  #'a>)  such  that  «(1>  -  e  ^  xw  ^  a<"  +  e',  and  x(z)  S=  A, 
where  e,  e  ,  A  are  positive  numbers,  may  be  called  a  closed  neighbourhood  of 
the  point  (a(l),  oo  ).  A  similar  definition  may  be  given  for  an  open  neighbour- 
hood of  the  point.  The  set  of  points  for  which  x(l}  ^  A,  x^  ^  -  B,  where  A  and  B 
are  positive  numbers,  may  be  termed  a  closed  neighbourhood  of  the  point 
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(oo  ,  —  oo  );  if  #"'  >  A,ocw  <  —  B,  the  neighbourhood  is  open.  Similar  defini- 
tions will  apply  to  points  at  infinity  of  the  other  types. 

54.  Returning  to  the  case  of  a  set  Q  in  a  finite  interval  or  cell,  we  observe 
that  the  limiting  points  of  G  form  a  set  of  points  which  may  be  finite  or 
infinite;  this  set  is  called  the  derived  set*,  or  first  derivative  of  G,  and  niay 
be  denoted  by  G'.  In  case  the  set  G'  contains  an  infinite  number  of  points, 
it  possesses  itself  a  derivative  set  G",  which  is  called  the  second  derivative 
of  G.  If  we  proceed  in  this  manner,  we  may  obtain  a  series 

G',  G",G'",  ...  G"» 

of  derivatives  of  G.  If  the  nth  derivative  G(n)  contains  a  finite  number  only  of 
points,  then  these  have  no  limiting  point,  and  we  mav  say  that  6r(n+1)  =  0. 
It  may  however  happen  that,  however  large  the  integerV  may  be,  the  deriva- 
tive G(n}  contains  an  indefinitely  great  number  of  points;  and  thus  a  next 
derivative  exists. 

A  set  G  which  possesses  only  a  finite  number  of  derivatives  is  said  to  be  of 
the  first  species. 

In  this  case,  if  G(s)  contains  only  a  finite  number  of  points,  the  set  G  is 
said  to  be  of  orderfs.  Thus,  for  example,  a  set  of  the  first  species  and  order 
zero  contains  only  a  finite  number  of  points  ;  and  a  set  of  the  first  species 
and  order  1  has  a  first  derivative  which  contains  only  a  finite  number  of  points. 
It  will  be  observed  that  the  order  of  each  derivative  of  G  is  less  by  unity  than 
that  of  the  one  which  precedes  it. 

A  set  G  which  possesses  an  indefinite  number  of  derivatives  is  said  to  be  of 
the  second  species. 

As  an  example,  we  may  consider  the  set  of  rational  numbers  in  the  interval 
(0,  1).  The  first  derivative  of  this  set  contains  every  real  number  in  (0,  1), 
and  all  subsequent  derivatives  are  identical  with  the  first. 

EXAMPLES. 


1.     LetJ  0  =  (l,  L  \,  -  ...  -  ..-) 


We  see  that  G'  consists  of  the  single  point  0,  which  does  not  belong  to  G  ;  thus  G  is  of  the 
first  species  and  of  order  1. 

2.     Let§  the  points  of  G  be  given  by 

1+1  +  IJ.JL 

3§-      5'*      78>      11*' 

*  The  notion  of  the  derivative  of  a  set  was  introduced  by  Cantor,  Math.  Annalen,  vol.  v  (1872), 
p.  128.  Du  Bois  Reymond  contemplated  the  existence  of  limiting  points  of  various  orders,  Crelle's 
Journal,  vol.  LXXIX  (1874),  p.  30;  in  Math.  Annalen,  vol.  xvi,  p.  128,  Du  Bois  Reymond  denned 
a  limiting  point  of  infinite  order. 

t  Cantor,  Math.  Annalen,  vol.  v,  p.  129.  J  Cantor,  Math.  Annalen,  vol.  v  (1872). 

§  Ascoli,  Ann.  di  Mat.,  Series  n,  vol.  vi,  p.  56,  1875. 
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where  «j,  «2»  *»»  8t  each  have  all  positive  integral  values.  Here  G'  consists  of  the  four  sets 
of  points  given  by 

1       _1       JL     J_J_      _L      l,ljj_      1+1     _L 
3«,      5'.      7-3'    3"'      5'2+HV    3*'      78'      11"*'    5*     7*      11*J 
and  of  the  six  sets  of  points 

1+1    1+1    !j.J_    1+L    1,J_    1,  J_ 

3''     5'«'    3''      78''    38'      llf<'    58°     78''    5*«     ll'«'    7*'     ll'*' 

and  of  the  four  sets  of  points 

111         1 

3^'    5V    7^'    11s*  5 

together  with  the  single  point  0.  G"  .  consists  of  the  last  ten  of  these  sets,  and  of  the 
point  0.  The  second  derivative  G'"  consists  of  the  last  four  sets,  and  of  the  point  0  ;  G"" 
consists  of  the  point  0  only.  The  set  G  is  of  the  first  species  and  of  the  fourth  order. 

3.     Let*  the  points  of  G  be  given  by 


where  n  is  a  fixed  number,  and  each  of  the  numbers  «j,  «2,  ...  an  takes  every  positive 
integral  value.     In  this  case  G  is  of  order  n. 

4.     The   zeros  f    of  the   function   sin  -  form  a   set   si  milar  to   that   in   Example    1  . 

The    zeros    of    the    function   sin  /  —  —  -  \    form   a   set   of    the   second  order,   those   of 

I  sin-   I 
V       *J 

form  a  set  of  the  third  order,  and  so  on. 


5.     LetJ  the  points  of  G  be  given  by 

where  mll  mz,  ...  mn  have  all  positive  integral  values,  including  zero,  and  n  is  a  fixed 
integer.     It  can  be  seen  that  £<">  consists  of  the  point  zero  only. 


DESCRIPTIVE  TERMINOLOGY. 

55.  If  6rj,  Gz,  G3,  •••  Gn  denote  a  number  of  sets  of  points  (either  linear 
or  in  any  number  of  dimensions),  the  set  which  contains  every  point  that 
belongs  to  one  or  more  of  the  given  sets  is  called  their  greatest  common 
measure,  and  is  denoted  by  M  (G1}  G^, . . .  Gn).  In  case  no  two  of  the  given  sets 
have  a  point  in  common,  the  common  measure  of  the  sets  may  be  denoted  by 
GI  +  G2  +  •  ••  +  Gn,  and  it  may  be  spoken  of  as  their  sum.  By  some  writers  the 
term  "sum"  is  employed  for  the  greatest  common  measure. 

*  H.  J.  S.  Smith,  Proc.  Lond.  Math.  Soc.,  vol.  vi,  p.  145,  1875. 
t  P.  Du  Bois  Raymond,  Journ.f.  Math.,  vol.  LXXIX,  p.  36. 
£  Mittag-Leffler,  Acta  Math.,  vol.  iv,  p.  58. 
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That  set  which  contains  all  those  points  which  belong  to  every  one  of  the 
given  sets  is  called  their  greatest  common  divisor,  and  it  may  be  denoted  by 
D(Gi,  (ra,  ...  Gn).  These  definitions  are  still  applicable  in  the  case  of  an 
unending  sequence  of  sets  Glt  Gz,  03,  .... 

If  all  the  points  of  a  set  H  are  points  of  a  set  G,  H  is  said  to  be  contained 
in  G,  or  to  be  a  part,  or  component,  of  G.  The  set  G  is  said  to  contain  H. 
Those  points  of  G  that  do  not  belong  to  //  form  a  set  which  may  be  denoted 
by  G-  H.  The  set  G  —  H  is  said  to  be  the  complement  of  H  with  respect 
to  G,  and  is  sometimes  denoted  by  CG  (H)  If  the  set  G  consists  of  all  the 
points  of  the  fundamental  interval,  or  cell  (open  or  closed),  which  contains  H, 
the  set  G  —  H  is  called  the  complement  of  H,  and  is  denoted  by  C  (H). 

A  set,  all  of  whose  limiting  points  belong  to  the  set  itself,  is  said  to  be  closed. 
Thus,  in  a  closed  set,  the  derivative  G'  is  a  component  of  G. 

Although  the  definition  of  a  closed  set  would  be  applicable  to  an  unbounded 
set,  in  case  limiting  points  in  the  extended  sense  of  the  term  are  admitted,  as 
explained  in  §  53,  it  is  usually  convenient  to  restrict  statements  as  to  closed 
sets  to  the  case  in  which  they  are  bounded.  Accordingly,  it  will  in  general  be 
assumed  that  a  closed  set  is  a  bounded  one. 

A  set  of  points  G  is  said  to  be  an  isolated  set  when  no  point  of  the  set  is  a 
limiting  point  of  the  set. 

Thus,  we  have,  for  an  isolated  set  G,  the  condition  D(G,  G')=  0. 

If,  from  any  set  G,  we  remove  those  points  which  also  belong  to  its 
derivative,  the  remainder  is  an  isolated  set;  thus  G  —  D(G,  G')  forms  an 
isolated  set.  Any  set  G  may  be  regarded  as  the  sum  of  an  isolated  set  and  of 
a  component  of  the  derivative  G'. 

If  a  component  H  of  the  set  G  is  such  that  every  point  of  G  is  a  limiting 
point  of  H,  the  set  H  is  said  to  be  dense  in  G. 

If  we  consider  the  case  in  which  H  is  identical  with  G,  we  obtain  the 
definition: 

If  every  point  of  G  is  a  limiting  point  of  the  set,  G  is  said  to  be  dense 
in  itself. 

For  a  set  G,  dense  in  itself,  G  is  a  component  of  the  derivative  G'.  The 
rational  numbers  of  the  interval  (0,  1)  form  a  set  that  is  dense  in  itself. 

A  set  G  which  is  both  closed  and  dense  in  itself  is  said  to  be  perfect*. 
Thus  a  perfect  set  G  is  identical  with  its  derivative.    It  follows  that  every 
perfect  set  is  of  the  second  species. 

By  some  writers  f  the  term  perfect  is  applied  to  sets  which,  in  accordance 
with  the  terminology  of  Cantor  here  adopted,  are  only  closed,  without 

*  Cantor,  Math.  Annalen,  vol.  xxi. 

•f  For  example  Jordan,  see  Cours  d' Analyse,  vol.  i,  p.  19. 
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necessarily  being  dense  in  themselves;  what  is  here  called  a  perfect  set  is  then 
spoken  of  as  an  absolutely*  perfect  set. 

In  case  the  set  G  consists  of  all  the  points  of  a  closed  interval,  or  cell, 
a  set  H  which  is  dense  in  G  is  said  to  be  everywhere  dense f  in  the  given 
interval,  or  cell.  This  is  equivalent  to  the  statement  that: 

A  set  H  is  said  to  be  everywhere  dense,  or  dense  in  an  interval,  or  in  a 
cell,  provided  that  no  interval,  or  cell,  contained  in  the  given  one,  exists  which 
contains  no  points  of  H. 

Similarly,  if  a  set  H  is  dense  in  a  set  G,  no  interval,  or  cell,  can  be 
determined  such  as  to  contain  points  of  G  and  no  points  of  H. 

If  H  is  dense  in  G,  the  derivative  H'  of  H  contains  every  point  of  G. 
The  derivative  G  of  a  set  G  which  is  everywhere  dense  in  an  interval,  or  cell, 
must  contain  every  point  of  the  closed,  interval,  or  cell.  This  property  may  be 
used  as  a  definition |  of  the  term  "everywhere  dense." 

By  Du  Bois  Reymond§  the  term  pantachisch  was  used  with  the  same 
meaning  as  everywhere  dense. 

Any  interval,  or  cell,  contained  in  a  fundamental  interval,  or  cell,  may  be 
spoken  of  as  a  sub-interval,  or  sub-cell. 

If,  in  every  sub-interval,  or  sub-cell  (a,  b'),  of  the  fundamental  interval,  or 
cell  (a,  b),  in  which  a  set  of  points  G  is  contained,  another  sub-interval,  or 
sub-cell  (a",  b"),  can  be  determined  which  contains  no  points  of  G,  the  set  G  is 
said  to  be  nowhere  dense,  or  non-dense  in  (a,  b). 

Thus  a  non-dense  set  is  one  such  that  no  interval  or  cell  exists  in  which 
the  set  is  everywhere  dense. 

A  component  H  of  a  set  G  is  said  to  be  non-dense  in,  or  relatively  to  G,  if, 
in  any  interval,  or  cell,  that  contains  points  of  G,  a  sub-interval,  or  sub-cell,  can 
be  determined  which  contains  a  point  of  G  but  no  point  of  H. 

A  point  P  of  a  set  G  is  said  to  be  an  interior  point  of  G,  if  a  neighbourhood 
of  P  can  be  determined  all  the  points  of  which  are  points  of  G. 

If,  however,  the  set  (rbe  bounded,  and  contained  in  a  fundamental  interval, 
or  cell,  a  point  of  G  on  the  boundary  of  the  interval,  or  cell,  may  be  regarded 
as  an  interior  point^gf  G  relatively  to  the  interval  or  cell,  if  a  neighbourhood 
of  the  point  exists  such  that  every  point  of  that  neighbourhood  which  is  in 
the  fundamental  interval,  or  cell,  is  a  point  of  G.  Such  a  point  is  however  not 
an  interior  point  of  G,  relatively  to  unbounded  space,  or  to  an  interval,  or  cell, 
which  contains  the  fundamental  interval  or  cell  in  its  interior. 

^ 

*  Bor|l,  Leqons  nur  la  theorle  des  fonctions,  p.  36. 
+  Cantor,  Math.  Annalen,  vol.  xv,  p.  2. 

*  Baire,  Annali  (L  Mat.,  ser.  3,  vol.  in,  p.  29. 
§  Math.  Annalen,  vol.  xv,  p.  287. 
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Those  points  of  a  set  0  which  are  not  interior  points  of  Q,  together  with 
those  points  of  C(G)  which  are  not  interior  points  ofC(G),form  a  set  which  is 
called  the  frontier  of  0  or  of  C(G);  the  set  C(G)  denoting  the  complement  of  G 
in  the  unbounded  interval  or  space  in  which  G  is  contained. 

If,  however,  G  is  contained  in  an  interval,  or  cell,  and  C(G)  denotes  the 
complement  of  G  with  respect  to  such  interval,  or  cell,  the  set  of  points  of  G  not 
interior  to  G,  relatively  to  the  interval  or  cell,  together  with  those  points  of  0(0} 
which  are  not  interior  to  G  (G),  relatively  to  the  interval  or  cell,  forms  the 
boundary  of  G  or  of  0  (G)  relatively  to  the  fundamental  interval,  or  cell. 

To  illustrate  the  distinction  here  made,  we  consider  for  example  the 
linear  set  consisting  of  all  the  points  of  the  closed  interval  (a,  b);  the  boundary 
of  G  consists  of  the  two  points  a,  b,  which  are  not  interior  points  either  of  G 
or  of  C(G),  the  complement  of  G  relatively  to  (—  oo  ,  x  ).  But,  relatively  to 
the  interval  (a,  b),  the  points  a,  b  are  interior  points  of  the  closed  set  (a,  6), 
in  accordance  with  the  convention  made  above;  and  thus  the  set  has  no 
boundary  relatively  to  the  interval  (a,  b). 

Every  point  of  an  open  interval,  or  cell,  is  an  interior  point  of  the  open 
interval,  or  cell,  regarded  as  a  set  of  points.  For  example,  every  point  of  an 
unbounded  space  is  an  interior  point  of  the  unbounded  space  considered  as  a 
set  of  points. 

An  open  set*  is  one  in  which  every  point  is  an  interior  point.  A  set  is 
open  relatively  to  a  fundamental  interval,  or  cell,  when  every  point  is  an 
interior  point  relatively  to  the  interval,  or  cell. 

The  term  open  set  is  however  employed  by  some  writers  to  denote  any  set 
which  is  not  closed. 

A  non-dense  set  has  no  interior  points,  but  an  everywhere  dense  set  may 
also  have  no  interior  points.  An  open  set  contains  none  of  the  points  of  its 
frontier. 

It  is  frequently  of  importance  to  consider  the  properties  of  sets  which  are 
contained  in  a  given  perfect  set  G,  or  which  have  a  part  in  common  with  G. 

A  point  P  of  a  set  H  is  said  to  be  an  interior  point  of  H  relatively  to  G,  if 
it  is  a  point  of  G  and  is  such  that  a  neighbourhood  of  J?  exists  for  which  all 
the  points  of  G  in  that  neighbourhood  are  also  points  of  Sr 

Those  points  of  G  which  are  limiting  points  both  of  H  and  of  CG(H),  where 
H  is  a  component  of  G,  are  said  to  form  the  frontier  of  H  and  of  CG(H) 
relatively  to  G. 

A  set  H,  all  the  points  of  ivhich  are  interior  points  of  H  relatively  to  G,  is 
said  to  be  open  relatively  to  G. 

*  de  la  Vallee  Poussin,  Integrates  de  Lebesgue,  p.  10. 
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A  set  H,  such  that  no  point  which  it  has  in  common  with  G  is  an  interior 
point  of  H  relatively  to  0,  is  said  to  be  diffuse  relatively  to  G,  or  to  be  diffused 
in  G. 

If  H  has  points*  in  common  with  G,  and  is  not  diffuse  in  G,  it  is  said  to  be 
compact  in  G.  . 

This  property  of  compactness  was  formulated  by  de  la  Vallee  Poussin, 
whose  definition  is  equivalent  to  that  here  given. 


PROPERTIES  OF  CLOSED  AND  OPEN  SETS. 

56.  The  complement  C(G)  of  a  closed  set  G  with  respect  to  a  closed  interval, 
or  cell,  in  which  G  is  contained  is  an  open  set,  relatively  to  the  interval  or  cell. 
Conversely,  the  complement  of  an  open  set  contained  in  a  closed  interval,  or  cell, 
is  a  closed  set. 

For  any  point  P  of  C  (G)  which  is  not  a  limiting  point  of  the  closed  set  G 
is  such  that  a  neighbourhood  of  P  can  be  determined  which  contains  no  points 
of  G.  All  the  points  of  such  neighbourhood  (or  of  the  part  of  it  in  the 
fundamental  interval,  or  cell)  are  points  of  C  (G).  Therefore  P  is  an  interior 
point  of  C(G).  It  follows  that  C  (G)  is  an  open  set. 

If  H  be  an  open  set,  a  limiting  point  of  C  (H)  cannot  belong  to  H,  because 
every  point  of  H  has  a  neighbourhood  none  of  the  points  of  which  belong  to 
C(H).  Since  every  limiting  point  of  C(H)  belongs  to  C  (H),  the  set  C(H) 
is  closed. 

A  closed  set  G  being  essentially  bounded,  the  complementary  set  C(G)  with 
respect  to  the  unlimited  interval  or  space  in  which  G  is  contained  is  open  in 
the  absolute  sense. 

An  unbounded  open  set  will  only  necessarily  have  a  closed  set  as  its 
complement,  provided  the  meaning  of  the  term  closed  set  is  extended  by 
admitting  adjoined  points  at  infinity  as  points  of  the  closed  set  (see  §  53). 
For  example,  consider  the  linear  set  of  all  points  x  such  that  0  <  x\  the 
complementary  set  with  respect  to  the  indefinite  interval  (—  oo  ,  oc  )  is  the  set 
for  which  —  oo  <  x  ^  0;  and  this  can  only  be  regarded  as  a  closed  set  if  the 
improper  point  —  oo  is  admitted  as  part  of  it. 

The  complement  CQ  (H)  of  a  cloved  set  with  respect  to  a  perfect  set  G  which 
contains  H  is  open  relatively  to  G.  Conversely  the  complement  with  respect  to 
the  perfect  set  G  of  a  set  H  contained  in  it,  and  open  with  respect  to  it,  is  a 
closed  set. 

The  proof  of  this  theorem  is  precisely  similar  to  that  of  the  last  theorem. 

IfG1}  G2  be  closed  sets,  both  the  sets  M(Gly  G2),  D(G^  G2)  are  closed. 

*  Integrities  de  Lebesgue,  p.  106. 
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Let  P  be  a  limiting  point  of  M  ((?,,  02) ;  then  P  must  be  a  limiting  point 
of  one  at  least  of  the  sets  6r,,  G3.  For,  if  this  were  not  the  case,  a  neighbour- 
hood of  P  could  be  determined  so  as  to  contain  no  points  of  (7,  or  of  6r2,  other 
than  P,  and  therefore  none  of  M  (G^G^).  The  point  P  consequently  belongs  to 
one  at  least  of  the  closed  sets,  and  therefore  to  M(Glt  6r2),  which  is  therefore 
closed.  A  limiting  point  P,  of  D(Gl}  (r2),  is  clearly  a  limiting  point  both  of 
Gl  and  of  (r2;  therefore  P  belongs  to  both  these  sets,  and  thus  to  D  ((?1(  (r2), 
which  is  consequently  closed. 

The  theorem  can  be  readily  extended  to  the  case  of  any  finite  number  of 
closed  sets.  Thus: 

IfG^Gz,...  Gn  be  dosed  sets,  both  the  sets  M(Gl,G2,...  Gn),  D(G,,G,2>...  Gn) 
are  closed. 

If  the  number  of  closed  sets  is  indefinitely  great,  so  that  they  form  a 
sequence  G1}  G2,  ...  Gn,  ...  of  such  sets,  the  set  M(Glt  G2, ...)  is  not  necessarily 
closed.  For  if  P1  belong  to  Glt  P2  to  G2,  P3  to  G3, . . ,  ,  the  point  P,  the  limiting 
point  of  the  sequence  Plt  P2,  •  ••  ,•  does  not  necessarily  belong  to  any  of  the  sets, 
and  thus  does  not  necessarily  belong  to  M(Gl,  G.2,  ...).  On  the  other  hand  the 
set  D(Glt  G2,...),  if  it  exists,  is  necessarily  closed,  for  every  limiting  point  of  it 
is  also  a  limiting  point  of  Gm,  for  each  value  of  m,  and  is  therefore  a  point  of 
Gm  for  each  value  of  m. 

//"(>!,  02,  03,...be  a  finite  number,  or  a  sequence,  of  open  sets,  M(01}  02)  03,  ...) 
is  also  an  open  set. 

The  sets  Oi,02, . . .  all  being  open  sets,  the  complementary  sets  0(0^),  0(0^), . . . 
are  all  closed  sets:  and  consequently!)  {0(0^,  C(02), ...}  is  a  closed  set,  whether 
the  number  of  the  given  sets  is  finite  or  not.  But  the  set  D  {^(Oj),  C*(02), ...} 
is  clearly  the  complement  of  M(Q1}  02, ...),  which  is  consequently  an  open  set. 

In  case  the  sets  0^  02,  ...  are  all  contained  in  a  finite  closed  interval  or  cell, 
the  complementary  sets  C^Oj),  (7(0.2),  ...  are  taken  relatively  to  this  interval 
or  cell.  If  this  is  not  the  case,  the  sets  0(0^,  (7(02), ...  will  only  be  closed,  in 
an  extended  sense  of  the  term,  when  limiting  points  at  infinity  are  admitted. 
This  being  taken  into  account,  the  proof  applies  to  this  case. 

//O],  02, .. .  Om  be  a  finite  number  of  open  sets,  the  set  D(0lt  02,  ...  Om),  if  it 
exists,  is  also  an  open  set. 

For  the  set  D  (Oi,02,...0TO)  is  the  complement  of  the  set  J/{C(0,),  C(Q»\... 
^(OMI)',,  which  is  a  closed  set. 

The  theorem  does  not  hold  for  the  case  of  an  infinite  number  of  open  sets. 

It  has  been  seen  that  the  properties*  of  a  set  of  being  perfect,  closed,  open, 

dense,  or  non-dense,  are  invariant  for  a  wide  class  of  transformations  of  which 

examples  have  been  given  in  §  53  ;    special  account  being  taken  of  the  cases 

in  which  boundaries  at  infinity  must  be  adjoined  to  the  space  in  which  a  set 

*  See  a  paper  by  E.  H.  Neville,  Ada  Math.,  vol.  XLII,  p.  63. 
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exists.  These  properties  of  a  set  are  accordingly  said  to  be  descriptive  properties, 
as  distinct  from  the  non-invariant  metric  properties  which  will  be  considered 
in  Chapter  in. 

PROPERTY  OF  THE  SUCCESSIVE  DERIVATIVES  OF  A  SET. 

57.  The  following  fundamental  property  of  the  successive  derivatives  of  a 
set  of  points,  in  any  number  of  dimensions,  will  now  be  established. 

All  the  derivatives  G',  G",  G", ...  G(n], ...  of  a  given  set  are  closed  sets,  and 
each  of  these  derivatives,  after  the  first,  consists  of  points  belonging  to  the 
preceding  one,  and  therefore  to  G'. 

This  theorem,  usually  stated  for  a  bounded  set,  holds  also  for  an  unbounded 
set,  provided  the  extended  meaning  be  given  to  the  terms,  limiting  point, 
closed  set,  which  has  been  formulated  in  §  53. 

If  a  point  P  of  G(n},  where  n  ^  2,  existed,  which  did  not  belong  to  G',  then 
a  neighbourhood  of  P  could  be  determined,  so  as  to  contain  only  a  finite  set  of 
points  of  G,  or  no  such  points;  and  this  neighbourhood  would  therefore  contain 
no  points  of  G',  and  consequently  none  of  G",  G"',  ...  6r(n);  which  would  be 
contrary  to  the  hypothesis  that  P  belongs  to  G(n}.  Therefore  every  point  of 
G(n}  (n  =  2)  belongs  to  G'.  By  considering  the  case  n  —  2,  we  see  that  G'  is  a 
closed  set. 

If  we  take  G(n~z]  to  be  the  original  set,  it  follows  from  the  above  that  every 
point  of  G(n),  the  second  derivative,  belongs  to  G(n~1},  the  first  derivative.  We 
have  thus  shewn  that 

G(n]  =  D(G',  G",...  G<»>). 

The  derivative  G'  of  a  set  G  which  is  dense  in  itself  is  perfect. 

For  G'  is  closed,  and  every  point  of  G  belongs  to  G' ;  thus  G'  contains  no 
point  which  is  not  a  limiting  point  of  G'.  Therefore  G',  being  both  closed  and 
dense  in  itself,  is  perfect. 

ENUMERABLE  AGGREGATES. 

58.  An  aggregate  which  contains  an  indefinitely  great  number  of  elements 
is  said  to  be  enumerable*,  or  countable  (abzalbar,  denombrable),  when  the  aggre- 
gate is  such  that  a  (1,  1)  correspondence  can  be  established  bettveen  the  elements 
and  the  set  of  integral  numbers  1,  '2,  3,  ...  . 

An  aggregate  of  objects  is  therefore  enumerable  if  the  objects  can  be 
arranged  in  a  series  which  has  a  first  term  and  in  which  any  assigned  object 
belonging  to  the  aggregate  has  a  definite  place  assigned  by  a  definite  ordinal 
number  n.  Thus  the  elements  of  an  enumerable  aggregate  can  be  represented 
by  a  sequence  of  symbols 


Cantor,  Crelle's  Journal,  vol.  LXXVII  (1873),  p.  258. 


56-58]  Enumerable  Aggregates  79 

It  follows  from  this  definition  that  the  elements  of  two  enumerable 
aggregates  are  such  that  a  (1,  1)  correspondence  can  be  established  between 
them. 

If  a  new  aggregate  be  defined  by  selecting,  in  accordance  with  a  rule  or  finite 
set  of  rules,  elements  from  those  which  belong  to  an  enumerable  aggregate,  an 
indefinitely  great  number  of  such  elements  being  taken,  then  the  new  aggre- 
gate is  also  enumerable.  For  such  an  aggregate  selected  from  ut ,  u.z,  ...  un,  ... 
is  ur,  ns,  ut,  ...  (r<  s<t ...),  which  satisfies  the  conditions  of  having  a  first 
term,  and  of  having  each  element  of  the  aggregate  in  a  definite  place  in  the 
series.  It  thus  appears  that  a  (1, 1)  correspondence  can  be  established  between 
an  enumerable  aggregate  and  one  which  is  a  part  of  that  aggregate,  provided  this 
part  be  not  finite.  This  is  the  characteristic  property  which  distinguishes  an 
aggregate  containing  an  indefinitely  great  number  of  elements  from  one  con- 
taining only  a  finite  number  of  elements.  For  example,  a  (1,  1)  correspon- 
dence exists  between  all  the  integral  numbers  and  all  the  odd  numbers,  or 
between  all  the  integral  numbers  and  all  the  prime  numbers. 

If  a  finite  number  of  enumerable  aggregates  be  given,  or  even  if  the  number 
of  such  aggregates  be  indefinitely  great,  but  enumerable,  then  the  new  aggregate 
formed  by  combining  these  aggregates  into  a  single  one  is  itself  enumerable*. 

We  may  denote  such  a  composite  aggregate  by  the  letters 

U\\,     1{12,     U13 ,    ...    Uln,    ... 


and  we  shall  shew  that  the  double  sequence  so  formed  represents  an  enumer- 
able aggregate.    To  see  this,  it  is  sufficient  to  write  the  sequence  in  the  form 


U12, 


where  the  sum  of  the  indices  is  the  same  for  all  the  terms  which  are  written 
in  one  horizontal  line.  It  is  now  clear  that  each  number  upq  has  a  definite 
place  in  a  sequence  in  which  un  has  the  first  place  ;  the  double  sequence  is 
therefore  enumerable. 

*  Cantor,  Crelle's  Journal,  vol.  IAXXIV  (1875). 
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An  important  particular  case  of  the  above  result  is  the  following 
theorem  : 

The  aggregate  of  all  the  rational  numbers  is  enumerable. 

A  rational  number  pfq  may  be  denoted  by  up>q:  therefore  the  aggregate 
is  enumerable.  It  makes  no  difference  that  any  particular  number  p/q  occurs 
an  indefinite  number  of  times  as  urpt  rq  ',  since,  if  all  such  terms  except  those 
for  which  r  =  l,  and  p/q  is  in  its  lowest  terms,  be  removed,  the  aggregate  left 
is  still  enumerable. 

For  example,  the  aggregate  of  rational  numbers  in  the  open  interval 
(0,  1)  may  be  arranged  in  the  order  ^,  £,  f  ,  \,  f  ,  i,  f  ,  f  ,  |  ,  £,  f  ,  ...  ;  in  which 
each  fraction  that  occurs  is  in  its  lowest  terms. 

A  method  of  placing  the  rational  fractions  between  0  and  1  into  corre-  . 
spondence  with  the  integers  1,  2,  3,  ...  has  been  given  by  Faber*.    By  this 
method  the  integer  corresponding  to  a  given  fraction  can  be  calculated  by 
means  of  formulae.    Faber  shews  that  there  exists  a  unique  representation  of 
a  given  fraction  p/q,  in  the  finite  form 

Oj      aa     «  3  an 

2!     3!     4!          +(w  +  l)!' 

where  ar  is  an  integer  (or  zero)  <  r  +  1  ;  whereas  every  integer  is  uniquely 
representable  in  the  form 

b,  +  b,  .  2  +  63  .  2  .  3  +  .  .  .  +  bn  .  2  .  3  .  4  .  .  .  n, 
where  br  is  an  integer  (or  zero)  <  r  +  I. 

To  the  fraction  p/q  there  corresponds  the  unique  integer 

.  +  an  .  n  I 


59.  A  more  general  theorem  has  also  been  established  by  Can  tor  f. 
An  algebraical  number  is  one  which  is  a  root  of  an  algebraical  equation  in 
which  the  coefficients  are  all  rational  numbers,  so  that  the  coefficients  may 
without  loss  of  generality  be  taken  to  be  integers.  Cantor's  theorem  is,  that 
all  the  algebraical  numbers  form  an  enumerable  aggregate. 

To  prove  this  theorem,  let 

p0xn  +p1x11-1  +  ...+/>„=  0 

be  an  equation  in  which  p0,p!,  ...pn  are  all  positive  or  negative  integers; 
and  let 

bo!  +  LPi!  +  .-.  +  pn  +n  =  N; 

then  N  is  a  positive  integer  which  may  be  called  the  rank  of  the  equation. 
It  is  clear  that  there  are  only  a  finite  number  of  equations  of  any  given 
rank,  these  equations  having  only  a  finite  number  of  roots.  If  then  we 
let  N  =  3,  4,  5,  ...  successively,  all  the  algebraical  numbers  can  be  arranged 

*  Math.  Annalen,  vol.  LX,  p.  1%. 
j"  Crelle't  Journal,  vol.  LXXVJI. 
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in  a  simple  sequence;  and  thus  they  form  an  enumerable  aggregate.  The 
aggregate  which  is  formed  of  all  the  real  algebraical  numbers  is  consequently 
also  itself  enumerable. 

A  number  which  is  not  an  algebraical  number  is  said  to  be  transcen- 
dental. The  existence  of  transcendental  numbers  was  first  established*  by 
Liouville,  who  shewed  how  examples  of  such  numbers  could  be  formed.  No 
general  criterion  is  known  by  which  it  can  be  decided  whether  a  number, 
defined  by  a  given  analytical  procedure,  is  algebraical  or  transcendental. 
The  first  case  in  which  such  a  number,  well  known  in  Analysis,  was  shewn  to 
be  transcendental  was  that  of  the  number  e,  the  base  of  the  natural  system  of 
logarithms ;  and  the  first  proof  that  e  is  transcendental  was  given  by  Herrnite. 
The  next  case  in  which  a  number,  defined  analytically,  was  shewn  to  be 
transcendental  was  that  of  the  number  TT.  The  first  demonstration  of  this 
important  fact  is  due  to  Lindemann  f,  who  proved  the  more  general  theorem 
that,  if  ex  =  y,  the  two  numbers  x,  y  cannot  both  be  algebraical,  except  in  the 
case  #=0,  y=l.  It  follows  that  the  natural  logarithms  of  all  algebraical 
numbers  are  transcendental,  as  also  are  all  numbers  of  which  the  natural 
logarithms  are  algebraical. 

60.    The  following  fundamental  theorem  will  now  be  established  j  : 

The  aggregate  which  consists  of  the  continuum  of  numbers  in  a  given 
interval  is  not  enumerable. 

Suppose  that  <alt  a>2,  w3>  ...  denote  the  numbers  in  an  enumerable  aggre- 
gate ;  it  will  then  be  shewn  that,  between  any  two  real  numbers  a,  ft  whose 
difference  is  as  small  as  we  please,  a  number  occurs  which  does  not  belong  to 
the  enumerable  aggregate.  It  will  then  follow  that,  in  the  given  interval 
(a,  ft),  there  are  an  unlimited  number  of  points  which  do  not  belong  to  the 
enumerable  aggregate,  and  thus  that  the  latter  cannot  contain  all  the  points 
of  the  continuum.  If  the  enumerable  set  of  points  is  not  everywhere  dense 
in  (a,  ft),  then  smaller  sub-intervals  inside  (a,  ft)  can  be  taken  which  contain 
no  points  of  the  aggregate ;  and  thus  we  have  only  to  consider  the  case  in 
which  the  given  aggregate  is  everywhere  dense  in  (a,  ft).  Let  <wKi  be  the  first 
of  the  points  a>i,  a>2,  •••  which  lies  within  (a,  ft),  and  <aKt  be  the  next  of  these 
points  which  lies  within  (a,  ft),  so  that  ^  <  *2.  Let  a'  be  the  smaller,  and 
ft'  the  greater  of  the  numbers  wKi,  &)<2,  then  a  <  a'  <  ft'  <  ft,  and  x^  <  /c2 ;  and 
if  p.  <  *2,  then  &>M  does  not  lie  within  the  interval  («',  ft').  Considering  this 
latter  interval,  let  wKt,  WK<  be  the  first  two  of  the  numbers  of  the  enumerable 
aggregate  which  lie  within  (a',  ft'),  and  let  a"  be  the  smaller  and  /3"  the 
greater  of  these  ;  then  a'  <  a"  <  ft "  <  ft' ,  and  K.2  <  /c3  <  K4.  Proceeding  in  this 
manner,  we  obtain  a  whole  series  of  sub-intervals  each  one  of  which  is  entirely 

*  Liouville's  Journal,  vol.  xvi,  1851. 

t  See  Math.  Annalen,  vol.  xx. 

J  Cantor    Crelle's  Journal,  vol.  LXXVII. 
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within  the  preceding  one;  thus  (o.M  ,  /3(l/))  lies  within  (a1""1',  ^(v~1));  and  if 
A*  <  KZV,  then  &>M  does  not  lie  within  (aM  ,  ft(v))\  also 


and  2i>  ^  /c2l,;  and  thus  &>„  lies  outside  (aM  ,  /8(v)).  Since  the  numbers  a'.,  a", 
a'"  ...  are  in  ascending  order,  and  all  lie  within  (a,  /3),  they  have  a  limit  A  ; 
similarly  £',  £",  ff"  ...  have  a  limit  B;  and  a<">  <  4  £  B</3{v].  If  A  <  B, 
then,  since  all  the  numbers  &>„  are  outside  the  interval  (A,  B),  the  given 
aggregate  is  not  everywhere  dense  in  (a,  /3)  ;  which  is  contrary  to  hypothesis. 
Hence  we  have  A  =  B',  and  the  number  A,  or  B,  is  a  number  which  does  not 
occur  in  the  aggregate  o)1}  to2,  ...  ;  thus  the  assumption  that  all  the  real 
numbers  in  a  given  interval  can  be  effectively  arranged  in  a  simple  sequence 
has  been  shewn  to  lead  to  a  contradiction. 

It  will  be  observed  that  the  point  of  the  foregoing  proof  consists  in  the 
fact  that  an  everywhere  dense  enumerable  aggregate  necessarily  has  limiting 
points  which  do  not  belong  to  the  aggregate. 

A  second  proof*,  also  due  to  Cantor,  that  the  continuum  is  not  enumerable 
is  the  following  :  —  Without  loss  of  generality,  the  interval  may  be  taken  to 
be  (0,  1).  Suppose  it  to  be  possible  to  state  a  set  of  rules  by  which  all  the 
numbers  within  this  interval  are  arranged  in  a  sequence,  so  that  there  is  a  first, 
a  second,  a  third,  and  so  on  ;  and  so  that  every  number  occurs  somewhere  in 
the  arrangement.  Since  certain  rational  numbers  are  capable  of  double 
representation,  viz.  by  means  of  a  decimal  in  which,  from  and  after  some  fixed 
place,  all  the  digits  are  zero,  and  also  by  a  decimal  in  which,  from  and  after 
some  fixed  place,  all  the  digits  are  9,  we  shall  suppose  this  last  mode  of 
representation  excluded,  so  that  each  number  is  represented  uniquely. 
Subject  to  this  convention,  let  the  numbers,  in  order,  be  exhibited  as  decimals 


'P21  P&  P'23 
"P«l  P32  Ps3 


where  each  p  stands  for  one  of  the  digits  0,  1,  2,  ...  9. 

It  is  assumed  that  we  are  in  possession  of  a  set  of  rules  by  means  of 
which  the  ?nth  digit  of  the  nth  number  can  be  determined,  for  each  pair  of 
values  of  m  and  n,  by  means  of  a  finite  number  of  applications  of  the  given 
set  of  rules.  Let  it  be  assumed  that,  if  possible,  all  the  real  numbers  in  the 
open  interval  (0,  1)  occur  in  the  above  sequence.  If  now  a  number  in  the 
open  interval  can  be  determined  which  does  not  occur  in  the  above  sequence, 
a  contradiction  will  have  been  shewn  to  be  involved  in  the  assumption  that 
all  the  numbers  in  the  interval  occur  in  the  above  sequence.  Now  such  a 

*  Jahresbericht  der  deutsclien  Math.  Vereinig.,  vol.  i,  p.  77. 
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number  can  be  defined,  for  example,  by  the  following  rules.  Let  the  nth 
digit  of  the  number  be  pnn  =  pnn  +  1 ,  unless  pnn  =  8  or  9 ;  if  pnn  =  8  or  9,  let 
pnn  =  Q.  The  number  •  pn  p^  p^  ...  pnn  ...  differs  in  respect  of  at  least 
one  digit  from  every  number  in  the  above  sequence,  and  therefore  cannot 
occur  in  the  sequence.  The  contradiction  in  the  original  assumption  is  thus 
established. 

It  will  be  observed  that,  in  general,  when  any  sequence  of  numbers  in  the 
interval  a  <  x  <  b  has  been  defined,  the  existence  of  the  sequence  provides  the 
means  of  defining  other  numbers  in  the  interval  thatdo  not  occur  in  thesequence. 

THE  POWER,  OR  CARDINAL  NUMBER,  OF  AN  AGGREGATE. 

61.  A  notion  of  fundamental  importance  in  the  theory  of  aggregates  is 
that  of  the  power,  or  cardinal  number,  of  an  aggregate.  This  notion  will  be 
considered  more  generally  and  fully  in  Chapter  iv,  where  it  will  be  shewn 
that  the  power  of  an  aggregate  is  the  generalization  of  the  notion  contained 
in  the  cardinal  number  of  a  finite  aggregate.  At  present,  an  account  of  the 
notion  of  the  power  of  an  aggregate  will  be  given,  so  far  as  it  is  necessary  for 
the  application  to  the  case  of  sets  of  points. 

Two  aggregates  of  objects  are  said  to  have  tlie  same  power,  or  cardinal 
number,  when  a  (1,  1)  correspondence  can  be  established  between  them,  so  that 
each  element  of  either  of  the  aggregates  corresponds  to  one  single  element  of 
the  other. 

Finite  aggregates  have  the  same  power  when  they  consist  of  the  same 
number  of  elements,  i.e.  when  they  have  the  same  cardinal  number.  Of 
aggregates  which  are  not  finite  we  consider  first  enumerable  aggregates. 
Every  enumerable  aggregate  has  the  power  of  the  aggregate  of  integral 
numbers ;  and  this  we  may  denote  by  a.  It  has  been  shewn  above  that,  if 
from  an  aggregate  of  power  a  any  elements  be  removed,  then  the  remaining 
aggregate,  provided  it  contains  a  non-finite  number  of  elements,  has  still  the 
same  power  a.  It  has  further  been  shewn  that  the  composite  aggregate 
formed  of  a  finite,  or  enumerable,  set  of  enumerable  aggregates  has  the  same 
power  a.  It  follows,  as  an  interesting  case,  that  the  set  of  all  those  points  of 
an  n-dimensional  space  whose  coordinates  are  rational  numbers  has  the  power 
a  of  the  set  of  integral  numbers,  or  of  the  rational  numbers  in  a  given  linear 
interval. 

It  is  easily  shewn  that  the  power  of  the  set  of  all  the  points  in  an  interval 

OC  —~  (J- 

(a,  6)  is  the  same  as  that  in  any  other  finite  interval,  say  (0,  1);  for  ,  -  -  =  £ 
establishes  a  (1,  1)  correspondence  between  the  points  x  of  (a,  b)  and  the 

points   £  of  (0,    1).    Again,  the    relation  -  - .  =  £  establishes    a    (1,    1) 

•\/x2  +  h~ 

correspondence  between  all  real  numbers,  and  those  in  the  interval  (  —  1,  1) ; 

6—2 
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and  thus  the  power  of  all  real  numbers  is  the  same  as  that  of  all  those  in  a 
finite  interval.  This  power  is  called  the  cardinal  number  of  the  continuum, 
and  may  be  denoted  by  c. 

As  regards  unenumerable  aggregates  in  general,  it  can  be  shewn  that  the 
power  of  such  an  aggregate  is  unaltered  by  removing  from  the  aggregate 
any  elements  which  form  an  enumerable  aggregate.  Let  A  denote  the  given 
aggregate,  and  a  the  enumerable  aggregate  which  is  removed;  and  let  B  denote 
the  remaining  aggregate,  which  cannot  be  enumerable,  for  otherwise  (a,  B),  or 
A,  would  be  so  also.  From  B,  suppose  an  enumerable  aggregate  a  to  be 
removed,  leaving  the  aggregate  C,  thus  A  =  (a,  a',  C),  B  =  (a,  C).  Now  (a,  a') 
and  a',  being  both  enumerable,  have  the  same  power;  and  a  (1,  1)  correspon- 
dence therefore  exists  between  their  elements ;  and  since  A  and  B  have  the 
aggregate  C  in  common,  it  therefore  follows  that  A  and  B  have  the  same 
power.  As  an  example  of  this  theorem,  we  see  that  the  set  of  irrational  points 
in  a  given  interval  has  the  power  c  of  the  set  of  all  numbers  in  the  interval. 
Again  the  set  of  transcendental  numbers  in  a  given  interval  has  the  power  c 
of  the  continuum  ;  whereas  the  set  of  algebraical  numbers  in  the  same  interval 
has  the  power  a.  In  this  proof,  the  assumption  has  been  made  that  the 
unenumerable  aggregate  B  necessarily  contains  an  enumerable  part  a'. 

The  known  infinite  sets  of  points  defined  in  accordance  with  the  methods 
usual  in  the  theory  of  sets  of  points,  in  a  line  or  in  a  continuum  of  any 
number  of  dimensions,  have  either  the  power  a  or  the  power  c ;  but  it  has  not 
yet  been  established  that  the  assumption  of  the  existence  of  an  infinite  set 
of  points  which  has  neither  the  power  a  nor  the  power  c  leads  to  a  contradiction. 
Other  aggregates  have  been  contemplated  which  have  a  power  higher  than 
c ;  these  will  be  referred  to  later,  in  dealing  with  the  theory  of  functions. 

62.  The  p-dimensional  continuum  has  the  power  c  of  the  one-dimensional 
continuum*. 

To  prove  this  theorem,  we  use  the  fact  that  any  irrational  number  in  the 
interval  (0,  1)  can  be  exhibited  as  an  infinite  continued  fraction 

11  1 


where  a1}  «2,  •  •  •  ap>  •  •  •  are  determinate  integers  for  any  given  irrational  number 
x  in  the  interval  (0,  1).    Let 

1         1  1 

CC  i  — 

«i  +  ctp+i  +  a->p+i  +  ..., 
1         1  1 

5/2  — 

Q2  +  ctp+2  +  a2p+2  +  ..., 


p        .2p        3p      ..., 
Cantor,  Crelle's  Journal,  vol.  LXXXIV,  p.  242. 
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thus,  corresponding  to  any  value  of  a?,  a  set  of  irrational  numbers  #1(  #2»  •••  xp 
is  uniquely  determined,  and  conversely,  corresponding  to  any  set  of  irrational 
numbers  #,,  a?2,  ...  xp,  an  irrational  number  #  is  uniquely  determined. 

It  has  thus  been  shewn  that  the  irrational  points  of  the  linear  continuum 
(0,  1)  correspond  uniquely  to  those  points  of  the  p-dimensional  continuum  in 
which  each  coordinate  is  in  the  interval  (0,  1),  and  is  irrational.  It  has  been 
shewn  in  §  Gl  that  the  set  of  irrational  values  of  xly  in  the  interval  (0,  1),  has 
the  same  power  as  the  set  of  all  the  numbers  in  this  interval.  Since  this  holds 
also  for  x2,  xy,  ...  xp,  it  follows  that  a  (1, 1)  correspondence  exists  between  that 
set  of  points  in  the  n-dimensional  continuum,  for  which  xl}  xz,  ...  xp  all  have 
irrational  values,  and  the  set  in  which  x1,  x2,  ...  xp  have  all  values  rational  or 
irrational;  thus  these  sets  have  the  same  power.  Hence  the  set  of  all  points 
of  the  jt»-dimensional  continuum,  in  which  each  coordinate  is  in  the  interval 
(0,  1),  has  the  same  power  as  the  set  of  all  points  in  the  linear  interval  (0,  1). 
It  has  thus  been  shewn  that  the  ^-dimensional  continuum  has  the  same 
power  c  as  that  of  one  dimension. 

THE  ARITHMETIC  CONTINUUM. 

63.  The  arithmetic  continuum  being  regarded  as  obtained  by  adjoining  to 
the  set  of  rational  numbers  the  set  of  all  their  limiting  points,  the  question 
arises  how  far  it  is  legitimate  to  consider  the  complete  set  so  obtained  as 
constituting  a  single  object,  determined  by  means  of  the  elements  of  which 
it  is  composed.  A  finite  set  of  numbers,  or  points,  constitutes  a  single  object 
determined  by  means  of  its  parts,  in  the  sense,  that  those  parts  can  be 
exhaustively  exhibited  by  means  of  a  finite  number  of  specifications  repre- 
sentable  by  a  finite  number  of  symbols.  An  enumerable  set  of  numbers,  or  of 
points,  in  particular  the  set  of  rational  numbers,  is  not  determinate  in  the 
sense  that  the  elements  of  the  set  can  be  exhaustively  exhibited;  but  it  is 
determinate  in  the  sense  that  a  rule  of  tabulation  can  be  given,  such  that  each 
particular  number  of  the  set  occupies  a  determinate  place  in  the  table;  and 
each  particular  number  can  be  represented  by  means  of  a  finite  number  of 
symbols.  Such  a  set  may  be  regarded  as  an  aggregate,  or  single  object,  in 
the  same  sense  in  which  the  natural  numbers  1,  2,  3, ...  may  be  regarded  as 
forming  an  aggregate.  When  we  come,  however,  to  the  case  of  the  continuum, 
or  aggregate  of  all  real  numbers,  the  fact  that  this  aggregate  is  unenumerable 
introduces  a  new  element  into  the  question  of  the  legitimacy  of  considering 
the  set  of  these  numbers  as  forming  a  determinate  whole,  or  as  constituting 
a  single  object  of  thought.  The  set  of  real  numbers  cannot  be  tabulated  in 
such  a  manner  that  no  number  fails  to  occur  at  some  definite  place  in  the 
table.  In  fact  it  has  been  shewn,  in  §  60,  that  the  assumption  of  the  existence 
of  such  a  table  leads  to  contradiction.  It  thus  appears  that  no  set  of  rules  or 
specifications  can  be  given  which  suffice  to  determine  successively  all  the 
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numbers  of  the  set;  and  no  finite  set  of  symbols  can  exhaustively  exhibit  the 
numbers.  The  only  sense  in  which  the  numbers  of  the  set  are  determinate  is 
that  each  such  number  is  the  limit  of  a  convergent  sequence  of  numbers, 
taken  from  the  unending  table  formed  by  the  rational  numbers.  It  may  fairly 
be  doubted  whether  such  a  negative  specification  of  elements  amounts  to  a 
valid  synthetical  definition  of  a  determinate  aggregate;  this  point  will  however 
be  further  discussed  in  Chapter  iv,  in  connection  with  the  general  theory  of 
aggregates.  It  will  there  be  shewn  that  the  arithmetic  continuum  has  an 
order-type  possessing  definite  characteristics  which,  in  their  totality,  uniquely 
characterise  it.  This  expresses  the  only  kind  of  unity  which  can  appertain  to 
the  continuum,  considered  as  a  synthetic  arithmetic  construction.  From  the 
point  of  view  of  Arithmetical  Analysis  the  existence  of  the  aggregate  of  real 
numbers,  as  a  single  definite  object,  possessing  assigned  properties,  may  be 
regarded  as  a  fundamental  postulate ;  the  validity  of  such  postulation  being 
subject  to  the  law  of  contradiction.  If  it  be  held  that  we  possess  an  in- 
dependent knowledge  of  the  existence  of  the  geometrical  continuum,  derived 
by  a  process  of  idealization  from  our  intuition  of  space,  we  may  regard  the 
function  of  the  set  of  real  numbers  to  consist,  not  in  a  synthetical  formation  of 
the  concept  of  the  continuum,  but  inversely  in  an  analysis  of  the  contents  of 
the  continuum.  It  is  difficult  to  see  how  precision  can  be  introduced  into  the 
intuitional  notion  of  the  spatial  continuum  apart  from  some  theory  relating 
either  to  points  or  to  infinitesimals;  and  the  language  employed  in  such  a 
theory  must  be  of  a  symbolical  character  amounting  to  the  use  of  some  kind 
of  arithmetical  notation.  Regarding  the  geometrical  continuum  in  this  way 
as  a  single  object  of  which  we  have  a  direct  knowledge  obtained  from  our 
intuitions  of  space  and  time,  the  reduction  to  a  precise  abstract  form  may  be 
regarded  as  being  made  upon  the  assumption  that  the  s^tem  of  rational 
numbers,  with  their  limits  adjoined,  is  adequate  to  the  analytical  description 
of  the  continuum,  in  the  sense  that  each  point  in  the  continuum  is  represented 
uniquely  by  a  single  real  number,  and  that  there  is  no  point  in  the  continuum 
which  is  not  so  represented.  This  amounts  to  a  definition,  in  a  certain  sense, 
of  the  contents  of  the  geometrical  continuum.  Such  definition  is  not  the  only 
possible  definition,  but  it  is  a  legitimate  one,  provided  it  suffices  for  the 
purposes  we  have  in  view  in  Analysis  and  Geometry,  and  provided  it  does 
not  conflict  with  the  concept  of  Continuity  as  derived  from  intuition.  The 
generic  distinction  between  a  continuous  geometrical  object,  and  a  point, 
or  set  of  points,  situated  in  that  object,  is  not  capable  of  direct  arithmetic 
representation.  This  does  not,  however,  impair  the  efficiency  of  Arithmetical 
Analysis  in  dealing  with  geometrical  objects.  In  Cartesian  geometry,  for 
example,  Analysis  is  really  concerned  only  with  the  points  that  can  be 
determined  in  the  geometrical  objects  with  which  it  deals.  This  does  not 
mean  that  a  continuous  geometrical  object  is  analysed  into  points  which  are 
of  necessity  to  be  regarded  as  its  "parts,"  but  it  does  mean  that,  for  the 
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particular  purposes  of  Analytical  Geometry,  an  adequate  treatment  of  it  is  to 
regard  it  as  a  set  of  points. 

Taking  as  data  the  set  of  rational  numbers,  an  irrational  number  of  the 
continuum  is  defined  in  some  manner  which  involves  the  use  of  words  and 
symbols.  Such  a  symbol  may  have  a  unique  meaning,  or  it  may  be  of  the 
nature  of  a  variable  having  as  its  field  some  enumerable  set  of  rational,  or  of 
integral,  numbers  already  defined;  for  example  the  symbol  n  may  be  taken  to 
denote  any  integer  of  the  sequence  of  natural  numbers.  By  means  of  a  given 
stock  of  such  words  and  symbols  it  is  possible  to  define  only  an  enumerable 
set  of  elements  of  the  continuum,  each  word,  or  symbol,  being  employed  a  finite 
number  of  times  only.  On  this  ground  it  has  been  maintained*  by  Kdnig  that 
the  elements  of  the  arithmetic  continuum  fall  into  two  classes;  the  first  class 
consisting  of  numbers  capable  of  finite  definition,  and  those  of  the  second  class 
being  inherently  incapable  of  finite  definition.  If  this  view  were  justifiable, 
grave  difficulties  would  arise  in  the  whole  theory  of  the  arithmetic  continuum, 
which  is  the  basis  upon  which  Arithmetic  Analysis  rests.  For  an  object  that 
is  inherently  incapable  of  being  defined  finitely  may  be  held  not  to  be  definable 
at  all,  and  such  an  object  is  regarded  by  many,  if  not  most,  mathematicians  as 
not  being  an  existent  object  for  mathematical  thought.  The  fallacy  involved 
in  the  introduction  of  this  distinction  appears  clearly  when  the  method  and 
implications  of  Cantor's  second  proof,  given  in  §  60,  that  the  continuum  is 
unenumerable,  are  fully  scrutinized.  It  was  there  shewn  that  if,  by  means  of 
a  given  stock  of  words  and  symbols,  a  set  of  numbers  forming  an  enumerable 
aggregate  is  defined,  the  existence  of  such  aggregate  enables  us  to  introduce 
a  new  word  or  symbol,  or  to  give  a  new  meaning  to  an  existing  symbol,  which 
will  represent  the  enumerable  aggregate  itself,  and  can  then  be  employed  for 
the  purpose  of  defining  new  elements  of  the  continuum  which  do  not  occur  in 
the  aggregate.  Such  a  new  element  has  a  finite  definition  in  the  same  sense 
as  that  in  which  the  elements  of  the  enumerable  aggregate  in  question  have 
finite  definitions.  It  must  be  remembered  that  an  enumerable  aggregate  may 
itself  be  denoted  by  a  single  word  or  symbol  that  can  be  created  or  assigned 
ad  hoc,  whenever  the  enumerable  aggregate  has  been  defined. 

The  essence  of  the  proof  in  §  60  is  then  that  there  exists,  and  can  exist,  at 
any  time,  no  stock  of  words  and  symbols  which  cannot  be  increased  for  the 
purpose  of  defining  new  elements  of  the  continuum.  The  theorem  that  the 
continuum  is  unenumerable  is  equivalent  to  the  fact  that  the  assumption  of 
the  existence  of  a  final  and  complete  stock  of  words  and  symbols,  by  means  of 
which  alone  elements  of  the  continuum  can  be  defined  by  finite  definitions, 
leads  to  a  contradiction.  Those  elements  of  the  continuum  which  are  at  the 
present  time,  or  will  be  at  a  future  time,  capable  of  definition  by  means  of  the 

*  See  KSnig,  Math.  Annalcn,  vol.  i,xi,  p.  157,  also  vol.  LXIII,  p.  217.  See  also  Richard,  Acta 
Math.,  vol.  xxx,  p.  295,  and  Hobson,  Proc.  Land.  Math.  Soc.  (2),  vol.  iv,  p.  170,  and  Whitehead 
and  iiussell,  Principia  Mathematica,  vol.  i,  p.  C4. 
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words  and  symbols  in  existence,  may  form  an  enumerable  set;  but  this  does 
not  prove  that  there  exists  any  element  of  the  continuum  that  is  inherently 
incapable  of  finite  definition.  In  fact  it  can  be  proved  that  those  elements  of 
the  continuum  that  remain  when  the  hypothetical  elements  incapable  of  finite 
definition  are  removed,  form  an  aggregate  which  has  all  the  properties  of  the 
continuum,  and  can  therefore  be  identified  with  it. 

Let  us  consider  the  continuum  of  real  numbers  in  the  interval  (0,  1),  and 
let  0  denote  the  set  of  those  numbers  in  it  that  are  capable  of  finite  definition. 
In  the  first  place,  the  set  G  is  dense  in  itself,  for  any  element  of  it  is  the  limit 
of  a  sequence  of  rational  numbers,  all  of  which  belong  to  G.  Next,  the  set  G 
is  closed;  for  if  every  number  of  a  convergent  sequence  {xn}  belong  to  G,  it 
can  be  proved  that  the  limit  of  the  sequence  belongs  to  G.  Denoting  the  set 
\xn]  by  E,  the  number  which  is  the  limit  of  the  sequence  can  be  finitely  defined 
as  follows:  Let  the  numbers  of  E  be  represented  by  non-terminating  decimals 
so  that  in  none  of  them  are  all  its  digits,  from  and  after  a  fixed  one,  equal  to  9. 
Let  the  mth  digit  of  a  number  x  be  defined  as  that  digit  which  is  identical 
with  the  mth  digit  of  an  infinite  number  of  the  elements  of  E.  The  number  x 
so  defined  is  the  limit  of  the  sequence  of  numbers  all  of  which  belong  to  G, 
and  x,  being  thus  finitely  defined,  itself  belongs  to  G. 

The  set  G  is  thus  dense  in  itself  and  closed,  and  is  therefore  perfect;  it  is 
clearly  everywhere  dense  in  the  interval  (0,  1).  It  has  therefore  all  the 
properties  of  the  continuum,  and  therefore  the  hypothetical  non-definable 
elements  can  be  disregarded. 

It  will  be  observed  that  the  elements  of  the  continuum  cannot  be  exhaus- 
tively represented  by  any  finite  set  of  symbols,  each  used  a  finite  number  of 
times.  Thus  there  can  exist  no  systematic  notation  which  suffices  to  represent 
all  the  numbers  of  the  continuum. 

TRANSFINITE  ORDINAL  NUMBERS. 

64.  The  theory  of  transfinite  ordinal  numbers  had  its  origin*  in  the 
investigation  of  the  theory  of  sets  of  points.  The  general  abstract  theory 
of  such  numbers,  or  order-types,  will  be  deferred  until  Chapter  IV  ;  it  is 
necessary  however  to  introduce  here  the  conceptions  connected  with  the 
formation  of  these  numbers,  with  a  view  to  utilizing  them  in  the  theory  of 
sets  of  points. 

Let  Plt  P2>  ...  Pn,  ...  denote  a  sequence  of  points  in  a  given  interval, 
representing  a  sequence  a1}  a.,,  a3,  ...  of  increasing  numbers,  so  that 


*  An  account  of  Cantor's  earliest  presentation  of  this  subject  will  be  found  in  Math.  Annalen, 
vol.  xxi,  p.  545. 
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This  sequence  of  points  has  a  limiting  point  which  is  not  one  of  the  points  of 
the  sequence,  and  is  on  the  right  of  all  those  points;  this  limiting  point  we 
may  denote  by  Pu.  The  symbol  o>  may  be  regarded  as  denoting  <r  new  ordinal 
number  which  comes  after  all  the  ordinal  numbers  1,  2,  3,  ...  n,  ... ;  it  is 
called  the  first  transfinite  ordinal  number.  The  number  w  is  not  contained  in 
the  sequence  of  finite  ordinal  numbers,  but  comes  after  all  of  them;  and  we 
shall  see  that  it  may  be  taken  as  the  first  of  a  new  sequence  of  ordinal 
numbers,  all  of  which  must  be  regarded  as  ordinally  greater  than  the  finite 
ordinal  numbers. 

u[Ro |Wi        Pan- a *  Pu»+n          

*    ~K~      *•«  p3p»"  p«.a  B 

FIG.  1. 

Suppose  that  beyond  the  point  Pa  there  are  other  points  which  we  wish 
to  regard  as  belonging  to  the  same  set  as  the  points  Pl}  P2, ...  PM,  ...  Pw; 
then  these  points  will  be  denoted  by  P^+i,  P^+z,  •••  P^+n,  -•',  and  if  these 
points  are  finite  in  number,  there  will  be  one  of  them  Pu+m  which  is  the  last 
on  the  right.  The  indices  of  all  the  points  of  the  set  will  then  be 

1,  2,  3,  . . .  n,  . . .  a),  M  +  1,  co  +  2, . . .  <w  +  m; 

and  the  numbers  o>,  &>  -f  1,  ...  o)  +  m  are  regarded  as  a  set  of  transfinite  ordinal 
numbers,  which  commences  with  the  first  transfinite  ordinal  number  to,  and 
contains  the  m  succeeding  transfinite  ordinal  numbers.  It  may  however  happen 
that  the  set  of  points  P^,  P^+i,  Pw+2,...  has  no  last  point.  In  that  case, 
assuming  that  the  points  are  all  contained  in  a  finite  interval,  the  set  has 
a  limiting  point  which  is  not  contained  in  the  set  itself;  and  this  limiting 
point  we  denote  by  Pta+u  or  Pw .  2 ,  where  w  .  2  is  an  ordinal  number  which  is 
not  contained  in  the  set  &>,  &>  +  1,  &>  +  2,  ...,  but  comes  after  the  numbers  of 
that  set. 

If  we  wish  to  include  further  points  which  are  on  the  right  of  Pu.2)  we 
must  introduce  numbers  denoted  by  &>.  2  +  1,  CD. 2  + 2,  ...;  and,  in  case  these 
points  form  an  infinite  set  in  a  finite  interval,  they  will  have  a  limiting  point 
which  will  be  denoted  by  PM.2+<a  or  PM.S.  We  have  now  the  ordinal  numbers 

1,  2,  3,  ...0,0  +  1,  0  +  2,  ...0.2,0.2  +  1,  0.2  +  2,  ...0.3. 
If  we  proceed  further  in  this  manner,  it  is  clear  that  we  may  require  numbers 
<o  .  n,  &>.??  +  !,  o> .  n  +  2,  ...  w  .  n  +  1,  ...  ,  where  n  denotes  any  finite  number. 

Further,  it  may  happen  that  the  set  of  points  Pw,  P<o.2,  P<u.s,  ...  Pm.n,  ... 
is  itself  infinite,  and  has  a  limiting  point  on  the  right  of  all  these  points.  This 
point  we  denote  by  Pwz;  and  the  number  <a'2  we  consider  to  be  a  new  ordinal 
number  which  succeeds  all  the  numbers  <y .  n  +  m,  where  n  and  m  have  all 
possible  finite  values. 

Points  on  the  right  of  P^  may  be  denoted  by  means  of  the  indices 
a)2  +  1,  &>2-f  2,  <o2  4-  3,  . . . ;  and  if  these  points  are  infinite  in  number,  they  may 
have  a  limiting  point  PW2+W. 
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Points  on  the  right  of  P^+la  may  be  denoted  by  the  indices  &>2  +  CD  + 1, 
<a2  +  o>  +  2,  . . . ;  if  these  have  a  limiting  point,  it  will  be  denoted  by  the  index 
w2  +  a> .  2.  Proceeding  in  this  manner,  we  may  have  points  of  which  the  indices 
are  <w2  4-  o> .  3,  a>2  +  a> .  4,  . . . .  If  there  is  an  infinite  set  of  such  points,  and  the 
set  has  a  limiting  point,  on  the  right  of  the  set,  this  limiting  point  will  have 
to2  +  o>2,  or  <a2 .  2,  for  its  index. 

If  we  proceed  still  further,  we  see  as  before  that  we  may  have  to 
contemplate  numbers  of  the  form  o)2.p  +  co.q  +  r,  where  p,  q,  r  are  finite; 
afterwards  &>3,  (W3  +  1,  ...,  to3. p  +  o>2.  q  +  ca .  r  +s,  &c.  The  general  type  of 
ordinal  numbers  which  can  be  obtained  in  this  manner  is  represented  by 
«<>n  •  Pn  +  vn~l  -Pn-i  +  ...  +a)  -Pi+Po',  and  it  is  clear  that,  for  the  representation 
of  points  of  a  given  set,  such  numbers  may  be  required  as  indices. 

It  may  happen  that  the  set  of  points  whose  indices  are  &>,  o>2,  &>3,  ...  is  not 
finite;  then  the  limiting  point  of  such  set  will  be  denoted  by  the  index  co". 
Starting  afresh  with  this  number,  we  may  form  numbers  such  as 

a)wn-Pn  +  <»n~l-Pn-i  +  —  +Po. 

If  the  points  whose  indices  are  to10,  to102,  w"8, ...  do  not  form  a  finite  set,  their 
limiting  point  will  be  denoted  by  w^10. 

In  a  similar  manner  we  may  denote  by  ej  the  number  which  comes  after 
the  sequence  ca,  ca03,  cow",  &)«w<u...;  and  starting  from  e1(  we  may  similarly 
proceed  to  form  further  numbers  in  endless  succession. 

65.  All  the  ordinal  numbers  which  can  be  formed  in  the  manner  above 
described  are  formed  by  means  of  the  application  of  Cantor's  two  principles  of 
generation  (Erzeugungsprinzipien). 

(1)  After  any  number  another  immediately  succeeding  it  is  formed  by  the 
addition  of  unity. 

(2)  After  any  endless  sequence  of  numbers,  a  new  number  is  formed  which 
succeeds  all  the  numbers  in  the  sequence,  and   has  no  number  immediately 
preceding  it. 

All  transfinite  ordinal  numbers  which  can  be  formed  by  means  of  these 
two  principles  of  generation  are  said  to  be  ordinal  numbers  of  the  second  class. 
The  finite  ordinal  numbers  are  said  to  be  of  the  first  class;  they  are  formed 
successively,  starting  with  the  number  1,  by  means  of  the  first  principle  of 
generation  alone. 

The  numbers  of  the  second  class  are  of  two  essentially  distinct  species: 
(1)  non-limiting  numbers,  those  numbers  which  have  each  a  number  im- 
mediately preceding  them,  and  from  which  they  are  formed  by  the  addition 
of  unity;  for  example  w  +  n,  w" .  p  +  w  .  q  -f  1,  cow  +  w  +  1 :  and  (2)  limiting 
numbers,  those  which  have  no  number  immediately  preceding  them,  from 
which  they  are  formed  by  the  addition  of  unity ;  for  example  o>,  or  +  «,  &>"•  +  w~ 
are  limiting  numbers. 
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Any  particular  number  of  the  second  class  can  be  denoted  by  a  finite 
number  of  symbols,  but  there  is  no  upper  limit  to  the  number  of  symbols 
required  to  denote  such  numbers. 

Cantor  has  further  postulated  the  existence  of  a  number  Q  which  comes 
after  all  the  numbers  of  the  second  class,  and  is  the  first  number  of  a  new 
set  which  is  called  the  third  class.  The  number  H  cannot  be  obtained  as  the 
number  which  succeeds  a  simple  sequence,  by  means  of  the  second  principle 
of  generation;  for  every  number  which  can  be  so  obtained  is  itself  a  number 
of  the  second  class.  This  number  H  can  be  obtained  only  by  means  of  a  third 
principle  of  generation,  which  postulates  the  existence  of  a  new  number 
coming  after  all  the  numbers  of  the  complex  formed  by  the  application  of  the 
first  and  second  principles  of  generation.  The  validity  of  the  postulation  of 
the  existence  of  the  number  fi,  and  of  the  higher  numbers  of  the  third  class, 
will  be  discussed  in  Chapter  iv. 

66.  A  fundamental  property  of  the  numbers  of  the  second  class  may  be 
expressed  as  follows: 

Let  PI,  P2,  Ps,  ...  Pn, ...  Pn,  Pu+},  ...  be  an  infinite  set  of  points  such  that 
either  (1)  there  is  a  last  point  Pp,  where  /3  is  some  number  of  the  second  class, 
or  (2)  there  is  no  last  point,  but  every  index  occurs  which  is  less  than  some 
limiting  number  7  of  the  second  class,  whereas  the  index  7  itself  does  not  occur; 
the  set  of  points  is  then  enumerable. 

The  sets 

P  P  P 

•*  U  •*  -2,  . ..  ±n,  . .. 

P  P  P 

-*•&)»  -*-fa>-J-1s  •••-*•     *il -t- 71  1*»» 


where  every  index  less  than  w2  occurs,  form  an  enumerable  aggregate  of 
enumerable  sets  of  points;  and  this  has  been  shewn,  in  §  58,  to  be  itself  an 
enumerable  set.  Now  consider  the  sets 

-*  li  *  2>  PS>  •  ••  P<o>  P<o  +  l>  •  •  •  P<o.2>   •••  Pu>.3>  •  •  • 

•*«->  P|0»  +  l»  •••P<02  +  (U)  Pu)'2  +  &)  +  l  )    •  •  • 

Pu>'.2,     P(o2.2  +  l»  •••  Pw!.2  +  <o,  .•• 


such  that  in  the  first  set  there  is  every  index  less  than  a>2,  in  the  second, 
every  index  less  than  &>2.  2,  and  so  on.  Each  of  these  sets  is  enumerable,  and 
there  is  an  enumerable  set  of  such  sets ;  hence  the  whole  set,  which  contains 
every  index  less  than  &>3,  is  enumerable.  In  this  manner  it  can  be  shewn  that, 
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if  every  index  less  than  wn  occurs,  the  set  is  enumerable.  If  the  theorem 
holds  for  sets  which  contain  every  index  less  than  /91}  #2,  /83...  ,  then  it  holds 
for  a  set  which  contains  every  index  less  than  ft,  the  limiting  number  of  the 
sequence  j3},  /32,  @a ... .  For  the  points  with  indices  less  than  &,  with  indices 
^  /3j  and  <  /32,  with  indices  =  /3.2  and  <  /33,  &c.  form  an  enumerable  sequence 
of  enumerable  sets;  therefore,  by  the  theorem  of  §  58,  the  whole  set  with 
indices  <  /3  is  enumerable.  Since  the  theorem  holds  for  /3i  =  &>,  /32  =  <w2, 
/33  =  a)3,  ...  it  holds  for  /3  =  tw".  By  continual  application  of  this  method,  since 
any  number  can  be  reached  by  means  of  the>two  principles  of  generation,  and 
since  every  number  is  either  a  limiting  number,  or  is  obtained  from  one  by 
adding  a  finite  number,  we  see  that  the  general  theorem  holds. 

It  will  now  be  shewn,  conversely,  that  if  a  set  of  points  P1;  P2, ...  Pn. 
. . .  Pw,  ...  P0,  ...  is  enumerable,  there  must  be  some  definite  number  7  of  the  first 
or  of  the  second  class,  such  that  7  does  not  occur  among  the  indices  of  the  points, 
and  such  that  every  number  less  than  7  does  so  occur. 

In  case  7  is  a  limiting  number,  there  is  no  last  point  of  the  set;  but  if  7 
is  not  a  limiting  number,  there  is  a  last  point,  viz.  the  one  of  which  the  index 
is  the  number  immediately  preceding  7. 

To  prove  the  theorem,  we  observe  that,  since  the  given  set  of  points  is 
enumerable,,  it  may  be  placed  in  correspondence  with  a  set  of  points  Qlt 
Qt>  •••  Qn>  •••  in  which  all  the  indices  are  numbers  of  the  first  class.  Let  us 
suppose  that,  if  possible,  no  number  7  exists ;  and  let  Pai  be  the  point  of  {P} 
which  corresponds  to  the  point  Q1  of  {Q}.  Let  Q^  be  the  point  of  {Q}  of 
smallest  index,  such  that  the  corresponding  point  of  {P}  has  an  index  which 
is  >al;  denote  this  index  by  «2.  Then  let  Qp2  be  that  point  of  {Q},  of 
smallest  index,  such  that  the  corresponding  point  of  {P}  has  an  index  >«2; 
denote  this  index  by  «3.  Proceeding  in  this  manner,  we  have  a  set  of  points 
Qi,  QPl>  Qp.2>---Qpn>  •••  corresponding  in  order  to  a  set  of  points  Paj  Pa2, 
Paj,  ...  Pan, ...  where  a^  <  a2  <  a3  ...  <  an  <  ...  .  There  exists  a  number  a  of 
the  second  class,  which  is  the  limit  of  the  sequence  a1}  «2,  ....  an,  ... ;  and  by 
hypothesis  there  exists  a  point  Pa,  which  has  a  for  index.  Now  the  set  {Q} 
can  contain  no  point  which  corresponds  to  Pa,  because  each  point  Qn  corre- 
sponds to.  a  point  of  {P}  with  an  index  less  than  a,  and  thus  there  is  a 
contradiction  in  the  hypothesis  that  a  occurs  amongst  the  indices  of  the  points 
of  {P}.  Hence  there  exist  numbers  of  the  second  class  which  do  not  occur 
as  indices  in  the  set  {P},  and  these  numbers  form  a  set  which  is  a  part  of  the 
aggregate  of  numbers  of  the  second  class.  In  this  set  there  must  be  a  lowest 
number  7,  and  this  number  7  is  the  first  which  does  not  occur  amongst  the 
indices  of  the  set  {P}.  That  every  part  of  the  aggregate  of  numbers  of  the 
first  and  second  classes  has  a  lowest  number  will  be  shewn,  in  Chapter  IV, 
to  be  a  consequence  of  the  structure  of  the  ordered  aggregate. 
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EXAMPLES. 

1.  On  a  straight  line  AB,  let  us  denote  by  Plt  P.z,  P3,  ...,  those  points  at  which  the 
ratio  AB/PB  has  the  values  1,  2,  3,  ....    The  point  PI  coincides  with  A,  and  the  point 
B  can  only  be  represented  by  Pa.    Now  take  any  one  of  the  segments  PrPr^.l-  this  may 
for  convenience  be  represented  on  an  enlarged  scale.    Denote  by  Qn,  (J^,  Qr3,  ...,  the 
points  on  PrPr  +  l,  at  which  PrPr  +  \/QPr+\  take/}  the  values  1,  2,  3,  ...;  thus  Pr  +  l  can 
only  be  represented  by   Qru.    Supposing  this  to.  have  been  done  with  every  segment 
PrPr  +  i  of  AB,  let  us  imagine  all  the  points  Q  to  be  marked  on  AB,  and  to  be  numbered 
from  left  to  right. 

In  P\P-i,  we  shall  have  1,  2,  3,  ...  u>, 

in  P2 P3  there  will  be  &>  +  !,  «  +  2,  ...&>.  2, 

andinPa/^  w  .  2  +  1,  o> .  2  +  2,  ...  «.  3; 

the  point  B  can  be  represented  only  by  &>2.  If  now  we  proceed  to  take  each  segment 
Qr*Qr,»  +  \,  and  to  divide  it  in  a  similar  manner,  at  points  R  for  which  QrgQr,g  +  il^Qr,t  +  i 
has  the  values  1,  2,  3,  ...,  and  then  imagine  all  the  points  R  obtained  in  every  such 
segment  QT»Qr, »  +  i  to  be  marked  on  AB,  and  numbered  as  before,  from  left  to  right,  it  will 
be  seen  that  all  the  numbers  aPp  +  aq  +  r  will  be  required,  and  that  the  point  B  can  be 
represented  by  o>3.  The  points  PI,  P2,  ...  P*  will  have  for  their  ordinal  numbers 
1,  <o2,  co2.2,  w2.  3,  ...  o>3;  the  point  Qrg  will  be  numbered  o>2.  ?•  +  «.  s;  the  finite  numbers 
are  all  used  up  in  the  first  sub-segment  of  A  B.  By  proceeding  to  further  subdivision,  we 
may  exhibit  on  AB,  the  ordinal  numbers  <anpn  +  a>n~lpn-i  +  ...+p(>,  and  the  point  B  will 
then  be  represented  by  wn  +  1. 

2.  The  properties  of  the  integral  numbers  in  relation  to  their  prime  factors  may  be 
employed  to  rearrange  the   series   1 ,  2,  3,   . . . ,  so  that  the  numbers  may   be  made  to 
correspond  with  a  series  of  ordinal  numbers  of  the  first  and  second  classes. 

First  take  the  primes  1,  2,  3,  5,  7,  11,  ... ;  these  correspond  with  the  numbers  of  the 
first  class  1,  2,  3,  ...  n,  ....  Then  take  the  squares  of  the  primes,  omitting  unity ;  we  thus 
have  22,  32,  52,  72,  II2,  ...,  corresponding  to  <a,  &>  +  !,  o>  +  2,  ...  a>  +  n, 

We  then  take  the  cubes  of  the  primes, 

23,  33,  53,  73,  II3,  ...,  corresponding  to  o> .  2,  w.2  +  1,  ...a  .2  +  n,  ..., 

and  in  general,  2r  +  1,  37"1"1,  5r  +  1,  ...,  corresponding  to  co  .  r,  w.r  +  1,  ...<o.r  +  n,  ....  We 
may  then  take  the  numbers  ab  which  consist  of  the  product  of  two  prime  factors ;  these, 

arranged  in  ascending  order,  correspond  to  w2,  oj2  +  l,  ...  <a2+n, Next  take  the  numbers 

a262,  which  consist  of  the  squares  of  the  last  set;  these  correspond  to  cos+o>,  &>2  +  a>+], 

We  then  take  the  successive  sets  of  numbers  of  the  forms  a363,  a464,  ... ;  we  thus  obtain 
the  numbers  which  may  be  taken  to  correspond  with 

a)2  +  w  .  2,  w2  +  a).  2  4- 1,  ...  co2  +  to  .  3,  ...  co2  +  a) .  p  +  q,  ..., 
all  of  which  are  less  than  o>2 .  2.    The  sots  of  numbers  of  the  forms 

«2/>,  (a26)2,  ...  (a26)»,  ...  a36,  (a35)2,  ...  (o36)»  ...  a4Z>,  (a46)2,  ..., 

may  then  be  taken.  Afterwards,  we  may  proceed  with  the  numbers  which  contain  three 
different  prime  factors,  and  so  on.  It  is  clear  that  this  mode  of  rearranging  the  integral 
numbers  in  their  natural  order,  so  that  they  correspond  in  the  new  order  with  ordinal 
numbers  of  the  first  and  second  classes,  admits  of  great  variety.  In  every  case,  there  will 
be  some  lowest  number  of  the  second  class,  which  is  not  employed  in  the  correspondence 
established. 
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PROPERTIES   OF   AGGREGATES   OF   CLOSED   SETS. 

67.  The  following  theorem,  due  to  Cantor,  relating  to  a  sequence  of  closed 
sets,  each  of  which  contains  the  next,  will  be  established : 

Having  given  a  sequence  of  closed  sets  GlyG2, ...  Gm, . . .  each  of  which  contains 
the  next,  there  exists  a  closed  set  GM  each  point  of  ivhich  belongs  to  Gm,for  every 
value  of  m. 

The  sets  may  be  in  any  number  of  dimensions. 

Apply  to  the  fundamental  cell  (or  interval)  in  which  G^  is  contained  a 
system  of  nets  'D1}  D2, ...  Dn, ....  There  must  be  at  least  one  mesh  dlf  of  the 
net  DI,  that  contains  points  which  belong  to  all  of  the  sets  Grn ;  for  otherwise, 
for  some  value  of  m,  no  mesh  would  contain  a  point  of  Gm.  In  case  more  than  one 
mesh  of  Dl  has  this  property,  we  may  suppose  dl  to  be  that  one  of  such  meshes 
which  has  lowest  rank  in  the  ordered  set  of  all  the  meshes  of  D1.  Consider  next 
those  meshes  of  D%  that  are  in  c^;  as  before  there  must  be  at  least  one  of 
such  meshes  which  contains  points  that  are  in  all  the  sets  Gm\  let  d2  be  that 
one  of  such  meshes  which  has  lowest  rank  of  all  the  meshes  of  D2  that  are  in 
di.  Proceeding  in  this  manner,  there  is  determined  a  sequence  dl}d2,  •  •  •  dm, ..., 
of  meshes  belonging  to  Dl}  D2, ...  Dm, ...  respectively,  each  of  which  contains 
the  next,  and  each  of  which  contains  points  that  belong  to  Gm  for  every  value 
of  TO.  The  sequence  {dm}  defines  uniquely  a  point  P  that  is  in  all  the  meshes 
of  the  sequence.  This  point  P  is  a  limiting  point  of  Gm,  for  each  value  of  TO; 
and  since  Gm  is  closed,  it  belongs  to  Gm.  Thus  the  existence  of  at  least  one 
point  of  Gm  has  been  established.  The  set  Gu  is  possibly  finite,  and  is  in  any 
case  closed.  For,  in  an  arbitrarily  small  neighbourhood  of  a  limiting  point  P 
of  the  set  Gu,  there  exist  points  of  Ga,  and  therefore  points  of  Gm\  hence  P  is 
a  limiting  point  of  Gm.  Since  Gm  is  closed,  P  must  be  a  point  of  Gm  for  each 
value  of  TO,  and  therefore  P  belongs  to  Gu. 

The  corresponding  statement  does  not  necessarily  hold  good  for  a  sequence 
{Gm}  of  sets,  each  of  which  contains  the  next,  when  the  sets  Gm  are  not  closed; 
for  the  point  P,  in  the  above  proof,  is  then  not  necessarily  a  point  of  the 
unclosed  set  Gm.  The  set  Gw  therefore  does  not  necessarily  exist. 

This  is  illustrated  by  the  case  in  which  Gm  is  the  linear  set  of  points 

The  point  zero  is  such  that,  in  any  neighbourhood 

TO'      771+1         TO  +  2' 

of  it,  points  of  Gm  exist  for  each  value  of  TO,  but  that  point  does  not  belong 
to  Gm ;  consequently  the  set  GM  does  not  exist. 

The  following  generalization  of  Cantor's  theorem  has  been  given*  by 
Sier-pinski : 

Having  given  a  family  F  of  closed  sets,  the  necessary  and  sufficient  condition 
that  there  exists  one  point  at  least  which  belongs  to  all  the  closed  sets  P,  of  F,  is 

*  Bulletin  of  Acad.  of  Sciences  of  Cracow,  April — May,  1918. 
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that  every  finite  number  of  the  sets  P  belonging  to  the  family  F  has  at  least  one 
point  in  common. 

This  may  be  proved  by  the  use  of  a  system  of  nets. 

The  following  is  a  corollary  of  this  theorem  : 

If  F  is  a  family  of  closed  sets  P,  such  that,  if  any  pair  of  the  sets  P  be  taken, 
one  of  them  contains  the  other,  there  exists  at  least  one  point  common  to  all  the 
sets  P  which  belong  to  the  family  F. 

THE   TRANSFINITE    DERIVATIVES   OF   A   SET   OF   POINTS. 

68.  If  G  denotes  a  set  of  points  in  a  cell  (or  interval)  (a,  b),  it  has  been 
shewn  in  §  57  that  the  derivatives  Gw,  G®, ...  G(n}, ...  are  all  closed  sets,  and 
that  all  the  points  of  any  one  of  these  sets,  after  the  first,  are  contained  in  the 
preceding  set.  If  G  is  of  the  second  species,  G(n}  exists  for  all  values  of  n, 
and  in  this  case,  in  accordance  with  the  theorem  of  §  67,  the  set  D  (6r(1),  G&, . . .) 
which  contains  points  belonging  to  every  G(n)  exists,  and  is  closed.  This  closed 
set  may  be  denoted  by  6r(w),  where  <u  is  the  first  transfinite  number.  It  is 
defined  to  be  the  derivative*  of  G,  of  order  o>.  In  case  G(ta)  contains  more 
than  a  finite  set  of  points,  we  can  proceed  to  form  successive  derivatives  of 
GM  in  a  manner  similar  to  that  in  which  the  derivatives  Gn),  G&),  ...  of  G 
were  formed.  These  successive  derivatives  may  be  denoted  by  G(<a+l},  G(<a+2},  ... 
G(<a+n}, ...  ,  and  are  regarded  as  the  derivatives  of  G  of  orders  «  +  1,  <y  -f  2, 

...  a)  +n, They  have  the  same  properties  as  the  derivatives  of  finite  order, 

viz.  that  all  the  points  of  each  are  points  of  G(l),  and  that  all  the  points  of  any 
one  of  them  are  points  of  the  preceding  ones. 

It  may  happen  that  one  of  the  derivatives  6r((0+n)  contains  no  points;  then 
the  process  of  forming  derivatives  has  come  to  an  end,  the  last  one  being 
(?<»+»-!).  If  this  is  not  the  case,  a  repetition  of  the  above  reasoning  shews  that 
the  set  D(G(<a+l\  Cr(<0+2V, ...  G(<a+n\  ...)  contains  at  least  one  point,  and  is 
a  closed  set;  this  set  is  denoted  by  Cr(a"2),  and  is  defined  to  be  the  derivative 
of  G  of  order  w .  2.  In  the  same  manner  we  can  proceed  to  form  further 
derivatives,  whose  orders  are  numbers  of  the  second  class. 

In  general,  if  alt  oz,  «3, ...  an, ...  denote  a  sequence  of  numbers  of  the 
second  class,  whose  limiting  number  is  @,  the  same  reasoning  as  before 
shews  that,  if  all  the  derivatives  GM,  G(a*\  ...  G(a^ ,  ...  exist,  then  the  set 
D  (G(<Il),  G (a^ , . . .  G lan) , . . . )  contains  at  least  one  point,  and  is  a  closed  set.  This 
is  denoted  by  Gift],  and  is  defined  to  be  the  derivative  of  G  of  order  ft. 

If  we  form  the  successive  derivatives  of  the  set  G,  whose  orders  are  the 
numbers  of  the  first  and  second  classes,  it  may  happen  that  there  is  a  £rst 
number  7,  of  the  first  or  second  class,  for  which  G^y]  =  0 ;  but  this  number  7 
cannot  be  a  limiting  number  of  the  second  class. 

*  See  Cantor,  Math.  Annalen,  vol.  xvn,  p.  357. 
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It  may,  however,  happen  that  no  number  7,  of  the  first  or  second  class, 
exists  for  which  G("*}  =  0,  so  that  derivatives  of  G  exist  of  orders  corresponding 
to  all  the  numbers  of  the  first  and  second  classes.  It  will  be  shewn  in  §  82 
that,  if  G  v  does  not  vanish,  for  some  number  7,  of  the  first  or  of  the  second 
class,  then  there  necessarily  exists  a  number  j8,  of  the  first  or  second  class, 
such  that  QW=  G(?+l)  =  G(?+-}  ...  .  This  set  QW  is  a  perfect  set,  and  it  is 
frequently  denoted  by  (?(n),  where  H  is  the  first  transfinite  number  of  the 
third  class.  The  notation  G(n)  may  however  be  employed,  independently  of 
the  acceptance  of  the  theory  of  numbers  of  the  third  cla'ss. 

Conversely,  if  G(n}  does  not  exist,  G(v}  must  first  vanish  for  some  number 
7  of  the  first  or  second  class,  which  number  cannot  be  a  limiting  number. 

EXAMPLES. 

1*.     Let  0  denote  the  enumerable  set  of  points,  each  one  of  which  is  given  by 

1    ,       1  1    '  ' 

+     + 


2n  +  mi  +  ?n2  2n  +  "*i  +  mz  +  •••  +  mn  ' 

where  n  has  all  positive  integral  values,  excluding  zero,  and  mlt  mz,  ...  mn  have  all  positive 
integral  values  including  zero,  independently  of  one  another. 

It  is  easily  seen  that  in  (?("•),  the  points  —  ,  ^pfj,  ...  all  occur,  and  hence  that  (?(<*) 

—  .  — 

exists,  and  consists  of  the  single  point  zero. 

2*.     Let  G  denote  the  enumerable  set  of  points,  each  one  of  which  is  given  by 
11  1  1  1 

__  l_  4-       -I  __    -I  ---  1  ----  •  _  -  ____ 

2mi        2mi  +  TO2  2mi  +  m2~l"-  +  mn        2mJ  +  Jn2+--  +mn  +  p        2mi  +  ?"2+"+mTi  +  P  +  «i 


' 


where  m15  7W2,  ...  inn,  p,  q±,  q2,  ...  qp  have  all  positive  integral  values,  including  zero.    In 
this  case  G^*"")  consists  of  the  single  point  zero. 

3*.     Let  G  denote  the  enumerable  set  of  points,  each  one  of  which  is  given  by 
!_          1  1_  1  _  1_ 

'  '  "•"•••  "1"        +       +      +         '          +       -  + 


,  ___   _  , 

'  +      +        +     +       ~         ~ 


+  mt+  ...  +mn+p  +  gt  +  qa  +  ...  +qp  ' 

where  n,  mlt  m%,  ...  mn,  p,  qlt  ...  qp,  have  all  positive  integral  values.    In  this  case  (JO0-2) 
exists,  and  consists  of  the  single  point  zero. 


SETS  OF  INTERVALS  OR  CELLS. 

69.  The  properties  of  a  set  of  intervals,  or  of  cells,  which  intervals,  or  cells, 
are  assigned  in  any  manner,  are  closely  connected  with  the  properties  of  sets 
of  points,  and  will  therefore  be  considered  here  in  some  detail. 

*  These  examples  were  given  by  Mittag-Leffler,  Acta  Math.,  voL  iv,  p   58. 
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Let  us  suppose  that  a  finite  set  of  non-overlapping  intervals  has 
been  defined,  all  in  the  finite  interval  (a,  b).  Denoting  these  intervals  by 
(a,,  6,)  (03,  6a)  ...  (an,  6n),  let  us  suppose  that  all  the  intervals  (a,.,  br)  of  the 
set  are  such  that  none  of  their  lengths  br  —  ar  are  less  than  a  fixed  positive 
number  e.  Then  the  number  n  of  the  intervals  cannot  exceed  (6  —  a)/e.  For 
the  2n  points  a,,  &,,  a2,  6J(  ...  an,  bn  divide  (a,  6)  into  at  most  2n  + 1  parts,  the 
sum  of  the  lengths  of  which  is  b  -  a.  These  parts  consist  of  the  «  given 
intervals,  and  in  general  of  complementary  intervals;  it  follows  that  the  sum 
of  the  lengths  of  the  n  given  intervals  cannot  exceed  b  —  a;  and  thus  that  the 
number  of  the  -mbegrals  cannot  exceed  (6  —  a)/e. 

Next,  suppose  that  any  non-finite  set  of  non-overlapping  intervals  is  defined, 
all  of  the  intervals  lying  in  the  fundamental  interval  (a,  6).  Choose  a  sequence 
fi>  *a>  •••  *n,  •••  of  positive  decreasing  numbers  which  converges  to  zero.  The 
number  of  intervals  of  the  given  set  which  are  of  length  ^  en  is  finite,  since  it 
does  not  exceed  (b  —  a)'/en.  The  intervals  of  the  given  set  can  now  be  arranged 
in  order  of  their  lengths,  taking  first  those  that  are  =  e1}  then  those  that  are 
<  6j  and  =  €2;  and  so  on;  there  being  only  a  finite  number  of  intervals  in  each 
set.  In  case  a  number  of  intervals  are  of  equal  length  their  order  may  be  that 
in  which  they  occur  in  (a,  6),  from  left  to  right.  Therefore,  since  all  the 
intervals  of  the  given  set  can  be  arranged  as  a  simply  infinite  aggregate,  they 
form  an  enumerable  aggregate. 

Next,  suppose  that  the  intervals  of  a  given  non-overlapping  set  are  in  the 
unbounded  interval  (  —  oo  ,  oo  ),  in  which  the  position  of  any  point  is  denoted 
by  x.  If  we  consider  the  correspondence  £  =  x/\/x2  +  1,  where  the  radical  always 
has  the  positive  sign,  the  unlimited  interval  (  —  oo  ,  oo  )  corresponds  to  the  open 
segment  (—1,  1)  in  which  the  point  £  lies.  The  intervals  of  the  given  set 
correspond  uniquely  to  intervals  in  (  —  1,  1),  and  the  set  of  these  latter 
intervals  is  enumerable;  hence  the  given  set  is  so  also.  It  has  thus  been 
shewn  that: 

Every  set  of  intervals  in  a  bounded,  or  unbounded,  interval  which  is  such 
that  no  two  of  the  intervals  overlap  is  either  finite,  or  forms  an  enumerable 
aggregate  *. 

The  corresponding  theorem  can  be  shewn  to  hold  for  a  set  of  non-over- 
lapping cells  in  two  or  more  dimensions.  Two  such  cells  may  have  a  portion 
of  their  boundaries  in  common. 

It  will  be  sufficient  to  consider  the  case  of  plane  cells;  the  method  of  proof 
can  be  extended  to  the  case  of  cells  of  any  number  of  dimensions. 

First,  let  a  finite  set  of  non-overlapping  cells  (ar(l\  a,.'2';  br(l),  br(-}} 
(r=l,  2,  3,  ...n)  be  contained  in  the  cell  («(1),  a(2);  b(l\  6(2)),  and  let  us 
suppose  that  brw-ar(l)  £  e(1),  6r(2>  -  or(2)  £  e(2),  for  /•=],  2,  3,...n;  where  e"1, 

*  Cantor,  Math.  Annalen,  vol.  xx,  p.  118  et  seq. 
H. 
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e(2)  denote  fixed  positive  numbers.    The  number  n  of  the  cells  cannot  exceed 


For,  let  the  fundamental  cell  (a(1),  a(2);  bw,  6(2))  be  divided  into  parts  by 
constructing  straight  lines  parallel  to  the  #<2)  axis  through  the  points  on  the 
id1)  axis  which  are  represented  by  the  2n  numbers  a^,  bjl\  cr2(1),  62(1),  ... 
anw>  bn(l\  and  by  constructing  straight  lines  parallel  to  the  x(l)  axis  through 
those  points  of  the  #(2)  axis  that  are  represented  by  the  2n  numbers 

n  (2)     J,  (2)     n  (2)     J,  (2)  n    (2)     /,    (2) 

ttj      ,   I*!      ,    «2     )    W2     >  •  •  •  an     ,   Vn     . 

The  sum  of  the  products  of  the  sides  of  these  parts  into  which  the 
fundamental  rectangle  is  divided  is  clearly  equal  to  (i(I)  —  a(1))  (£>(2)  —  a(2)).  Every 
one  of  the  rectangles  of  the  given  non-overlapping  set  is  either  identical  with 
one  of  these  parts,  or  is  the  sum  of  two  or  more  of  the  parts.  It  then  easily 
follows  that 

'  if  (br(1}  -  «r(I))  (6,-(2)  -  a™)  £  (bw  -  a(1>)  (6<2)  -  a<2>); 

r=l 

and  from  this  it  follows  that  the  number  of  the  rectangles  of  the  given  set 
cannot  exceed  (Z>(1>-  a(1>)  (b(2)  -  a(2))/e(1)e<2). 

Now  let  us  consider  a  given  non-finite  set  of  cells,  no  two  of  which  overlap, 
and  all  in  the  rectangle  (a(1),  a(2);  6(1),  b(2)).  Give  to  e(1)  the  values  in  a  sequence 
of  diminishing  numbers  e^1',  e2(1),  ...  en(1),  ...  which  converges  to  zero;  e(2)  having 
the  values  in  another  such  sequence  e^2',  e2(2)> 

The  number  of  cells  of  the  given  set  which  are  such  that  one  side  is  =eM(1),  and 
the  other  >  en(2),  is  finite  ;  for  it  is  not  greater  than  (b(l)  -  a(I))  (^(2)  -  a(2))/en(1)  en(2). 

We  can  now  arrange  the  cells  of  the  given  set  in  order,  taking  first  those 
for  which  the  sides  are  =  e/11  and  =  e/2';  then  those  for  which  the  sides  are 
<  e/1',  e/2'  and  =  e2(1),  e,,(2);  and  so  on;  there  being  only  a  finite  number  in  each 
set.  In  each  of  such  finite  sets  the  cells  are  arranged  in  descending  order  of 
magnitude  of  the  products  of  the  sides,  or,  in  case  of  equality,  in  order  of  rank. 
Thus,  since  the  whole  set  can  be  arranged  as  a  simply  infinite  aggregate,  the 
set  is  enumerable. 

The  case  in  which  the  cells  of  the  set  are  in  unbounded  plane  space  can  be 
considered  as  before  by  employing  the  correspondence 


We  have  now  the  general  result: 

Any  set  of  non-overlapping  cells  in  p-dimensional  space  forms  an  enumerable 
aggregate. 

It  will  be  observed  that,  although  geometrical  language  is  employed  above, 
the  proof  of  the  theorem  is  essentially  arithmetical. 
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70.  Every  isolated  set  is  enumerable. 

This  theorem  may  be  proved  by  applying  the  theorem  in  §  69. 

Let  P(x)  be  a  point  of  an  isolated  linear  set.  The  positive  numbers  h  can 
be  divided  into  two  classes,  the  first  consisting  of  the  numbers  h  such  that  the 
closed  interval  (x,  x  +  A)  contains  no  points  of  the  given  set,  except  the  point  x, 
and  the  second  class  consisting  of  those  numbers  h  for  which  this  is  not  the 
case.  The  two  classes  will  be  separated  by  a  number  hlt  where  x  +  hi  is  either 
a  point  of  the  given  set,  or  else  is  a  limiting  point  of  the  given  set  but  does 
not  belong  to  it.  A  similar  interval  (#  —  /kj,  #)  may  be  defined  on  the  lefc  of  #; 
the  interval  (x  —  h2,  x  +  hj)  contains  no  point  of  the  given  set  in  its  interior, 
except  x.  Let  p  be  the  smaller  of  the  two  numbers  ^A2,  £A1}  and  let  the 
interval  (x—  p,  x  +  p)  correspond  to  the  point  x.  Taking  the  set  of  all 
intervals  (x  —  p,  x  +  p)  that  correspond  to  the  points  of  the  given  set,  we  see 
that  this  set  of  intervals  is  non-overlapping,  and  is  consequently  enumerable. 
Since  each  interval  corresponds  to  the  point  x  at  its  centre,  it  follows  that  the 
given  isolated  set  of  points  is  enumerable. 

In  the  case  of  an  isolated  set  in  space  of  two,  three,  or  more,  dimensions,  it 
can  be  shewn  that,  with  each  point  x  as  centre,  a  circle,  sphere,  or  hyper-sphere 
of  radius  2p  can  be  so  determined  that  no  point  other  than  x  is  interior  to  it, 
and  such  that  this  is  not  the  case  for  any  circle,  or  sphere,  of  greater  radius. 
We  then  take  the  circle,  or  sphere,  of  radius  p,  corresponding  to  each  point  x 
of  the  set.  The  set  of  all  such  circles,  or  spheres,  will  be  non-overlapping.  In 
every  sphere  a  cell  with  equal  edges  can  be  inscribed;  and  the  set  of  all  such 
cells  will  be  non-overlapping;  to  each  cell  there  corresponds  the  point  of  the 
isolated  set  at  its  centre.  It  now  follows,  as  before,  that  the  given  isolated  set 
is  enumerable. 

Any  set  of  points  G  is  the  sum  of  an  isolated  set  and  of  one  which  is  a 
component  of  G'.  It  follows  that,  if  the  derivative  G'  is  enumerable,  so  also 
is  G]  but  the  converse  does  not  hold. 

Every  set  of  points  luhich  is  of  the  first  species  is  enumerable. 

For,  if  s  be  the  order  of  G,  the  set  G(s)  contains  only  a  finite  set  of  points: 
and  therefore  G(s~l}  is  enumerable.  Consequently  G(s~'2),  G(s~3},  ...  G  are  all 
enumerable  sets. 

A  set  of  points  of  the  second  species  is  enumerable  if  one  of  its  transfinite 
derivatives  is  enumerable.  An  unenumerable  set  G  cannot  have  any  enumerable 
derivative. 

71.  Let  us  consider  a  given  set  of  overlapping  intervals,  not  necessarily 
enumerable,  contained  in  a  finite  segment  (a,  b).    Let  $  be  the  set  of  those 
points  of  (a,  b)  each  of  which  is  an  interior  point  of  one  interval  at  least  of  the 
given  set. 
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If  P  be  a  point  of  S,  it  is  interior  to  an  interval  a  of  the  given  set,  and 
therefore  a  neighbourhood  of  P  exists,  such  that  every  point  of  it  is  an  interior 
point  of  «;  and  therefore  all  the  points  of  that  neighbourhood  of  P  belong 
to  S.  Hence  P  is  an  interior  point  of  S;  and  therefore  S  is  an  open  set. 
Now  it  may  be  shewn  that: 

Every  open  linear  set  of  points  consists  of  the  interior  points  of  a  finite,  or 
enumerable,  non-overlapping  set  of  intervals. 

For,  if  x  be  a  point  of  S,  it  can  be  shewn,  in  exactly  the  same  manner  as 
in  |  70,  that  an  interval  (x  —h^,x-\-  Aj)  can  be  defined  which  has  the  property 
that  every  interior  point  of  it  belongs  to  S,  but  that  x  —  h2,  x  +  ht  do  not  belong 
to  S.  In  this  manner,  to  each  point  a,  of  S,  there  is  correlated  a  definite 
interval  S  (#);  to  all  the  points  interior  to  8(x)  the  same  interval  &(&} 
corresponds.  Now  let  us  consider  the  set  of  all  such  intervals  8;  these  intervals 
are  non-overlapping,  and  thus  form  a  finite,  or  enumerable,  set  of  intervals. 
Their  interior  points  are  identical  with  the  set  S. 

The  following  theorem  has  been  established: 

Every  set  of  intervals  contained  in  a  finite  segment  can  be  replaced  by  a  set 
of  non-overlapping  intervals  of  which  the  interior  points  are  the  same  as  those 
of  the  given  set. 

If  we  consider  the  given  set  as  a  set  of  open  intervals,  the  theorem  is 
equivalent  to  the  statement  that  the  set  of  points,  each  of  which  belongs  to 
one  or  more  open  intervals  of  the  given  set,  is  itself  an  open  set.  This  is  a 
generalization,  for  the  case  of  an  enumerable  set  of  open  sets,  of  a  theorem 
given  in  §  56. 

The  new  set  may  be  spoken  of  as  the  set  of  non-overlapping  open  intervals 
equivalent  to  the  given  set  of  open  intervals. 

In  the  case  of  a  ja-dimensional  set,  it  is  shewn  in  exactly  the  same  manner 
as  in  the  case  p  =  1,  that,  having  given  a  set  of  overlapping  cells,  the  set  S,  of 
points,  each  of  which  is  interior  to  one  at  least  of  the  cells,  is  an  open  set.  The 
nature  of  an  open  set  in  space  of  more  than  one  dimension  will  be  considered 
later. 

72.  The  properties  of  any  set  of  open  intervals  in  a  finite  segment  have 
been  shewn  to  depend  upon  those  of  a  non-overlapping  set  of  such  intervals; 
and  we  therefore  proceed  to  the  consideration  of  the  latter. 

Every  point  of  (a,  b)  which  is  not  interior  to  an  interval  of  the  non- 
overlapping  set  is  either 

(1)  a  common  end-point  of  two  intervals  of  the  given  set;  or 

(2)  a  point  interior  to,  or  at  an  end  of,  an  interval  not  belonging  to  the 
given  set,  this  interval  containing  no  point  which  is  interior  to  any  interval  of 
the  set;  or 
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(3)  a  limiting  point,  on  both  sides,  of  end-points  of  intervals  of  the  set;  or 

(4)  an  end-point  of  an  interval  of  the  given  set,  and  also  a  limiting  point, 
on  one  side,  of  end-points  of  intervals  of  the  given  set. 

If  either  a  or  6  is  an  end-point  of  an  interval,  we  reckon  that  point  as 
belonging  to  the  points  (1). 

The  points  described  in  (2)  and  (3)  may  be  described  as  external  points 
of  the  given  set;  and  if  a  or  b  is  a  limiting  point  of  end-points,  it  will  be 
reckoned  as  an  external  point. 

The  points  described  in  (4)  may  be  spoken  of  as  semi-external*  points. 

Those  points  of  the  segment  (a,  b)  which  are  not  i^oints  of  a  given  set  of  non- 
overlapping  open  intervals  form  a  closed  set. 

This  theorem  is  that  particular  case  of  the  theorem  proved  in  §  56,  that  the 
complementary  set  of  an  open  set  (relatively  to  a  closed  interval  in  which  it  is 
contained)  is  a  closed  set,  which  arises  when  the  open  set  is  the  given  set  of 
non-overlapping  open  intervals. 

It  will  now  be  shewn  thatf,  unless  a  given  set  of  non-overlapping  intervals 
is  a  finite  set,  there  must  be  at  least  one  external  or  semi-external  point;  in 
other  words,  the  whole  interval  (a,  6)  cannot  be  filled  up  by  an  indefinitely 
great  number  of  non-overlapping  intervals,  each  one  of  which  abuts  on  the 
next,  without  leaving  at  least  one  point  over,  which  is  neither  interior  to  an 
interval  nor  is  an  end-point  of  two  intervals ;  the  points  a,  b  being  regarded 
as  end-points  of  two  intervals  if  they  are  end-points  of  one  interval  of  the 
given  set. 

If  there  be  any  complementary  intervals,  i.e.  intervals  in  which  no  point 
belongs  to  the  given  set  of  non- overlapping  intervals,  then  the  points  of  these 
intervals  are  all  external  points,  and  we  therefore  need  only  consider  the 
case  in  which  no  such  complementary  intervals  exist.  We  observe  that,  when 
the  number  of  intervals  is  not  finite,  their  end- points  must  have  at  least  one 
limiting  point  P.  Now  this  point  P  cannot  be  interior  to  one  of  the  given 
intervals;  for,  if  it  were  so,  it  would  have  a  neighbourhood,  viz.  the  interval 
to  which  it  is  interior,  within  which  are  no  end-points.  Neither  can  P  be  a 
common  end-point  of  two  intervals;  for  it  would  then  have  a  neighbourhood 
on  the  right,  and  also  one  on  the  left,  within  which  there  is  no  end-point 
except  P  itself.  The  point  P  must  consequently  either  be  an  external  point, 
i.e.  one  which  is  not  an  end-point  but  is  a  limiting  point,  on  both  sides,  of 
end-points;  or  else  it  must  be  an  end-point  of  one  interval,  and  a  limiting 
point,  on  one  side,  of  end-points.  If  a,  or  b,  is  not  an  end-point,  it  is  regarded 
as  an  external  point.  It  will  subsequently  be  shewn  that  the  external  and 
semi-external  points  form  a  set  which  may  be  either  finite,  or  of  cardinal 
number  a,  or  of  cardinal  number  c. 

*  This  term  is  due  to  \V.  H.  Young,  Proc.  Lund.  Math.  Soc.  (1),  vol.  xxxv,  p.  250. 
f  This  theorem  was  given  by  W.  H.  Young,  ibid.  p.  251. 
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Having  given  an  overlapping  system  of  intervals,  those  points  that  are  end- 
points  of  intervals  of  the  system  and  are  not  interior  points  of  any  interval  of 
the  system  form  a  non-dense  enumerable  set. 

If  P  be  a  left-hand  end-point  of  an  interval  Pp  of  the  given  set,  and  if  P 
is  not  interior  to  any  interval  of  the  set,  Pp  cannot  have  in  its  interior  any 
other  left-hand  end-point  Q  of  an  interval  Qq  of  the  given  system,  such  that  Q 
is  not  interior  to  any  interval  of  the  set.  Hence  all  such  intervals  Pp  are 
non-overlapping,  and  therefore  form  an  enumerable  set. 

Their  left-hand  end-points  form  an  enumerable  set,  which  is  clearly  non- 
dense.  Similar  reasoning  applies  to  those  points  which  are  right-hand  end- 
points  of  intervals  of  the  given  set,  and  are  not  interior  to  any  interval. 

Thus  the  theorem  has  been  proved. 

EXAMPLES. 

2n~  i  —  l      2"  —  1 


( 


, 

2,  2, 

also  the  intervals  obtained  by  reflecting  these  intervals  in  the  point  \.  The  point  \  is 
external  to  all  the  intervals,  and  jet  the  limiting  sum  of  the  intervals  is  equal  to  1,  the 
length  of  the  whole  interval  (0,  1)  in  which  the  enumerable  set  of  intervals  is  contained. 

If  instead  of  reflecting  the  intervals  in  the  point  £,  we  take  the  interval  (£,  1),  the 
point  \  is  now  a  semi-external  point,  and  the  limiting  sum  of  the  intervals  is  the  same  as 
before. 

2*.    Take  the  set  ($,  1),  (0,  |)  ...  f^-g-1-,    '     +T]  ...  of  intervals,  and  divide  each 

V     £'1         &      / 

interval  into  a  set  of  sub-interval's  similar  to  the  whole.  We  now  have  a  new  enumerable 
set  of  intervals  which  has  no  external  points,  but  of  which  the  serni-external  points  form  an 
enumerable  set  |,  ^,  J,  -f$,  .... 

THE   HEINE-BOREL   THEOREM. 

73.  If  a  set  A  of  intervals,  all  in  the  fundamental  interval  (a,  b),  be  such 
that  every  point  of  the  closed  interval  (a,  b)  is  an  interior  point  of  at  least  one 
interval  of  the  set  A  (the  end-points  a,  b  being  regarded  as  interior  to  an 
interval  when  either  of  them  is  an  end-point  of  such  interval),  then  a  finite  set 
of  intervals,  all  belonging  to  A,  exists,  which  has  the  same  property  as  the  set 
A,  viz.  that  every  point  of  the  closed  interval  (a,  b)  is  interior  to  at  least  one 
interval  of  the  finite  set. 

This  theorem,  which  is  known  as  the  Heine-Borel  theorem,  is  of  great 
importance  in  the  Theory  of  Functions,  and  may  be  proved  as  follows: 

Apply  a  system  of  nets  with  closed  meshes  to  the  interval  (a,  b).    There 

must  exist  some  value  of  n  such  that  every  mesh  of  the  net  Dn  is  interior  to 

one  interval  at  least  of  the  set  A.    For,  let  us  suppose,  if  possible,  that  this  is 

not  the  case.    Let  an  denote  those  meshes  of  Dn  each  of  which  is  not  interior 

*  See  W.  H.  Young,  Proc.  Land.  Math.  Snc.  (1),  vol.  xxxv,  pp.  249—251. 
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to  any  interval  of  the  set  A.  By  hypothesis,  «„  exists  for  every  value  of  ?/; 
and  it  is  clear  that  the  closed  sets  Oi,  «2>  ...  an>  •••  are  such  that  each  set 
contains  the  next.  In  accordance  with  the  theorem  established  in  §67,  there  is 
at  least  one  point  x  that  is  contained  in  all  the  closed  sets  {crnj.  The  set  of 
all  such  points  x  is  closed ;  let  x  be  the  lower  extreme  of  the  set.  The  point 
x  is  interior  to  some  interval  (x  —  e,  x  +  e')  of  A ;  or  to  an  interval  (a,  a  +  e) 
in  case  x  =  a;  or  to  an  interval  (b  —  e',  b)  in  case  x  =  b. 

When  n  is  sufficiently  large,  dn  the  maximum  breadth  of  the  meshes  of 
Dn  is  less  than  both  e  and  e';  or,  when  x  =  a,  is  less  than  e';  or  when  x  =  b,  is 
less  than  e.  For  such  a  value  of  n,  x  is  either  interior  to  a  mesh  of  Dn,  which 
mesh  is  interior  to  the  interval  (x  —  e,  x  +  e'),  or  else  a;  is  a  common  end-point 
of  two  meshes  of  Dn  both  of  which  are  interior  to  that  interval ;  or  x  may  be 
at  a  or  at  6.  In  any  case  a  mesh  of  an  is  interior  to  an  interval  of  A ;  contrary 
to  the  supposition  made  above. 

It  has  thus  been  shewn  that,  for  a  sufficiently  large  value  of  n,  each  of 
the  meshes  of  Dn  is  interior  to  one  at  least  of  the  intervals  A ;  if,  with  each 
mesh  of  Dn  there  be  associated  an  interval  of  A  to  which  it  is  interior,  the 
finite  set  of  such  intervals  of  A  is  a  set  such  as  is  required. 

In  the  above  theorem  it  is  not  assumed  that  the  given  set  of  intervals  A 
is  enumerable.  It  has  been  proved  that  the  interval  (a,  6)  can  be  divided 
into  a  finite  number  of  parts,  each  of  which  is  interior  to  a  set  of  intervals 
belonging  to  A.  Each  such  set  of  intervals  may  be  finite,  or  infinite  (enu- 
merable or  not).  The  effective  selection  of  a  finite  set  of  intervals  of  A,  which 
satisfies  the  condition  of  the  theorem,  presupposes  a  knowledge  of  the  mode  in 
which  the  set  of  intervals  A  is  defined.  The  theorem  does  not  by  itself  pro- 
vide any  general  method  of  making  such  effective  selection.  In  the  most 
general  case,  the  interval  (a,  b)  is  divided  into  a  finite  number  of  parts, 
each  of  which  has  associated  with  it  an  (in  general)  infinite  set  of  intervals 
of  A,  to  all  of  which  it  is  interior;  the  effective  selection  of  the  required 
finite  set  of  intervals  involves  the  choice  of  one  of  the  possibly  infinite  set  of 
intervals  all  of  which  contain  one  of  these  finite  parts  of  (a,  b).  The  number 
of  such  choices  to  be  made,  in  each  particular  case,  being  finite,  the  principle 
known  as  the  "multiplicative  axiom,"  which  will  be  discussed  in  Chapter  iv, 
is  not  required  in  its  general  form. 

It  is  clear  that  the  above  proof  is  applicable  to  the  case  in  which  cells  in 
any  number  of  dimensions  take  the  place  of  linear  intervals.  The  method  of 
employment  of  systems  of  nets  is  precisely  the  same  as  in  the  case  of  the 
linear  system.  Accordingly  we  have  the  Heine-Borel  theorem  for  the  case  of 
^-dimensional  sets. 

If  a  yet  of  p-dimensional  cells  A^',  all  in  the  fundamental  cell  (a,  b),  be  such 
that  every  point  of  the  closed  cell  (a,  b)  is  an  interior  point  of  at  least  one  cell 
of  A(p)  (a  point  on  the  boundary  of  (a,  b}  being  regarded  <ts  interior  to  a  cell  to 
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whose  boundary  it  belongs},  then  a  finite  set  of  cells,  all  belonging  to  A(p),  exists, 
which  has  the  same  property  as  the  set  A(p). 

74.  The  Heine-Borel  theorem  can  be  generalized  so  as  to  apply  to  the 
case  in  which  the  points  that  are  to  be  interior  to  a  finite  number  of  intervals, 
or  cells,  form  any  given  closed  set  G,  instead  of  consisting  of  all  the  points  of 
the  closed  cell  (or  interval)  in  which  the  given  set  A'**'  is  contained.    The 
generalized  theorem  may  be  stated  as  follows: 

Having  given  a  (bounded)  closed  set  G,  and  a  set  of  cells  &(p\such  that  each 
point  of  G  is  interior  to  one  cell  at  least  of  the  set  A(^',  there  exists  a  finite  set 
of  cells,  all  belonging  to  A(p),  such  that  each  point  of  G  is  interior  to  one  at 
least  of  the  cells  of  the  finite  set. 

To  establish  this  theorem  only  a  slight  modification  of  the  proof  of  the 
theorem  whe*n"(:r  consists  of  all  the  points  of  (a,  b)  is  required.  We  may 
choose  the  cell  (a,  6)  so  that  it  contains  G,  and  we  can  neglect  any  part  of  the 
set  A(p)  that  is  not  in  (a,  b).  Those  meshes  of  the  nets  D1}  D2,  ...  which  con- 
tain no  points  of  G  are  throughout  disregarded.  The  finite  set  an  consists  of 
those  meshes  of  Dn  that  contain  points  of  G  and  are  not  interior  to  any  cell  of 
A(p).  It  is  then  clear  that  the  point  x  is  necessarily  a  point  of  the  closed  set  G. 

If  the  given  set  G  were  not  closed,  the  point  x  might  be  a  limiting  point 
of  G  which  did  not  belong  to  G,  and  thus  the  argument  would  break  down. 

The  following  theorem  may  be  established  by  employing  the  generalized 
Heine-Borel  theorem: 

Having  given  a  closed  set,  an  enumerable  set  can  be  determined  which  has 
for  derivative  the  given  set. 

With  each  point  of  the  closed  set  G  as  centre  let  there  be  taken  a  cell  of 
span  d.  A  finite  number  of  these  cells  are  such  that  every  point  of  G  is 
interior  to  one  of  them;  let  this  set  be  Cd,  and  let  P^,  P,}d\  ...  Pnd(d)  be  their 
centres,  all  points  of  G.  Let  d  have  a  sequence  of  decreasing  values  dlt  dz,  ... 
converging  to  zero. 

The  totality  of  the  points  P^,  ...  Pndid)  taken  for  every  value  of  d  in  the 
sequence  is  an  enumerable  set  H  of  points  of  G.  Since  H  consists  of  points 
of  the  closed  set  G,  every  limiting  point  of  H  is  a  point  of  G.  Every  point  of 
G  is  interior  to  a  sequence  of  cells  belonging  to  Cdl,  Cd2,  ...  and  is  therefore 
a  limiting  point  of  the  centres  of  the  cells,  i.e.  every  point  of  G  is  a  limitin 
point  of  #.  Thus  #'=(?. 

75.  It  has  been  pointed  out*  by  de  la  Valise  Poussin  that  the  generalized 
form  of  the  Heine-Borel  theorem  still  holds  good  in  case  A(p)  is  replaced  by 
an  aggregate  S  of  open  sets.    This  may  be  stated  as  follows: 

If  a  closed  (bounded)  set  G,  and  an  aggregate  S  of  open  sets,  be  such  that 

*  See  Integrates  de  Lebcsgve,  pp.  15,  112. 
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every  point  of  G  belongs  to  at  least  one  of  the  sets  of  S,  a  finite  number  of  the 
sets  of  S  exists  such  that  each  point  of  0  belongs  to  at  least  one  of  them. 

To  prove  that  theorem,  it  may  be  observed  that  with  each  point  P  of  S,  a 
cell  may  be  associated,  with  P  in  its  interior,  and  such  that  the  cell  is  interior 
to  one  or  more  of  the  open  sets  of  S.  The  aggregate  of  all  such  cells  may  be 
taken  to  be  the  set  A(p)  in  the  generalized  form  of  the  Heine-Borel  theorem. 
A  finite  part  of  the  set  A(p)  exists  such  that  each  point  of  G  is  interior  to  one 
or  more  of  the  cells  of  this  finite  set.  Each  of  these  cells  is  contained  in  an 
open  set  belonging  to  S.  Hence  there  exists  a  finite  aggregate  of  the  open 
sets  of  8  which  has  the  required  property. 

In  case  the  open  sets  S  form  an  aggregate  Hly  H.2,  ...  //„,  ...  7/0,  ...  such 
that  each  one  is  contained  in  the  next,  the  closed  set  G,  each  point  of  which  is 
in  one  or  more  of  the  open  sets,  must  be  contained  in  H$,  where  /3  is  some  fixed 
number  of  the  first  or  of  the  second  class. 

For  a  finite  number  of  the  sets  //  exists  such  that  every  point  of  G  is 
contained  in  one  or  more  of  them.  ,If  /3  be  the  index  of  that  one  of  this  finite 
set  which  has  the  highest  index,  G  must  be  contained  in  Hp. 

From  this  theorem,  the  following  may  be  deduced : 

//'  G},  GZ,  ...  Gw,  ...  G$,  ...  are  closed  sets  such  that  each  contains  the  next, 
and  F  is  a  closed  set  of  points  such  that  no  point  of  F  is  contained  in  all  the 
sets  G,  there  exists  a  definite  number  /3,  of  the  first,  or  of  the  second,  class,  such 
that  no  point  of  F  belongs  to  Gp. 

To  prove  this,  we  consider,  with  respect  to  some  cell  containing  6?!,  open 
sets  C(Gl),  C(Go),  complementary  to  Glt  Gz,  ....  Each  of  these  open  sets  is 
contained  in  the  next.  Any  point  of  F  is  contained  in  some  of  the  sets  C(G}), 
C(G2),  ... ;  and  the  set  F  is  therefore  contained  in  C (G&)  for  some  value  of  ft, 
and  is  therefore  not  contained  in  G0. 

It  has  already  been  pointed  out  that,  when  G  is  a  set  that  is  not  closed, 
and  each  point  of  G  is  interior  to  at  least  one  of  the  cells  of  a  given  set,  there 
does  not  necessarily  exist  any  finite  part  of  the  given  set  of  cells  such  that 
every  point  of  G  is  interior  to  one  of  them.  However,  in  any  case,  an  enu- 
merable part  of  the  given  set  of  cells  exists  which  has  this  property.  Thus: 

If  G  be  any  set  of  points,  not  necessarily  closed,  and  each  point  of  G  be 
interior  to  one  at  least  of  the  cells  of  a  given  unenumerable  set  A(p),  there  exists 
an  enumerable  set  of  cells  all  belonging  to  A(^',  such  that  every  point  of  G  is 
interior  to  one  at  least  of  the  cells  of  this  enumerable  set. 

This  theorem  can,  as  in  the  case  of  the  Heine-Borel  theorem,  be  proved  by 
means  of  a  system  of  nets  applied  to  the  finite,  or  infinite,  interval,  or  cell,  in 
which  G  is  contained. 

Of  a  net  Dn,  for  any  large  enough  value  of  n,  say  nlt  some  of  the  meshes 
containing  parts  of  G  will  be  interior  to  one  or  more  of  the  cells  (or  intervals) 
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of  the  given  unenumerable  set.  Let  these  meshes  be  denoted  by  &.  For 
values  of  n  greater  than  ^ ,  we  consider  only  those  meshes  of  Dn  that  are  not 
contained  in  &.  For  some  such  value  of  n,  say  n2,  there  will  be  meshes  y32, 
containing  points  of  G,  not  contained  in  /?!,  each  of  which  is  contained  in  one 
or  more  of  the  intervals,  or  cells,  of  the  given  set.  We  obtain  in  this  way  a 
sequence  /3j,  /&,,  @3,  ...  of  finite  sets  of  meshes,  each  of  which  contains  points 
of  G,  and  is  contained  in  one  or  more  of  the  intervals,  or  cells,  of  the  given  set. 
Each  point  of  G  must  be  defined  by  a  sequence  of  meshes  belonging  to  this 
set  {j3},  for  such  point  is  interior  to  an  interval,  or  cell,  of  the  given  set.  The 
meshes  of  /9j ,  /32,  •  •  •  form  an  enumerable  set,  and  corresponding  to  each  of 
such  meshes,  there  exists  a  cell  (or  interval)  of  the  given  set.  Hence  there 
exists  an  enumerable  part  of  the  given  set  such  that  every  point  of  G  is 
interior  to  one  or  more  of  the  intervals,  or  cells,  of  this  enumerable  part. 

In  the  choice  of  one  out  of  a  possibly  infinite  number  of  cells  which  have 
a  given  mesh  in  their  interiors,  the  number  of  choices  being  in  this  case 
indefinitely  great,  the  "multiplicative  axiom"  is  in  general  involved. 

However,  the  following  theorem  may  be  established*,  which  is  sufficient 
for  applications,  and  in  the  proof  of  which  no  use  is  made  of  the  multiplica- 
tive axiom: 

If  each  point  of  a  given  set  is  interior  to  a  definite  cell  corresponding  to 

that  point,  an  enumerable  set  of  these  cells  can  be  so  determined  that  all  the 
points  of  the  set  are  interior  to  one  or  more  of  the  cells  of  the  enumerable  set. 

If  the  set  G  be  not  bounded,  we  may  place  it  in  correspondence  with  a 
bounded  set,  so  that,  to  the  given  cells,  there  corresponds  a  set  of  cells  having  the 
corresponding  property  with  reference  to  the  bounded  set  of  points.  There  is 
accordingly  no  loss  of  generality  in  considering  a  bounded  set  only,  which 
we  can  suppose  to  be  contained  in  a  fixed  cell  with  equal  sides.  Instead  of 
the  cell  of  the  given  set  A  (P)  corresponding  to  a  point  P  of  the  given  set  G, 
we  may  substitute  a  cell  A'(P)  with  the  point  P  as  centre  and  with  its  sides 
all  equal,  and  take  A'  (P)  to  be  the  greatest  such  cell  which  has  no  point 
exterior  to  A  (P).  We  have  now,  corresponding  to  each  point  P  of  the  given 
set,  a  cell  A'(P)  with  P  as  centre.  Take  a  set  of  positive  diminishing  numbers 
{fn}  which  converges  to  zero,  and  let  Gn  be  that  part  of  G  which  consists  of 
points  for  which  the  spans  of  the  corresponding  cells  A'  are  >  eM.  Since  every 
point  of  G,,.  is  in  a  cell  (with  equal  sides)  of  spans  all  >  en,  every  point  of  Gn'  is 
in  a  cell  of  this  system.  Hence  a  finite  set  of  these  cells  can  be  determined  which 
contain  in  their  interiors  all  the  points  of  Gn',  and  therefore  all  the  points  of 
Gn  with  the  possible  exception  of  a  finite  number.  By  adding  the  cells  cor- 
responding to  this  finite  number  of  points  of  Gn,  we  obtain  a  finite  set  of  the 
cells  A'  which  contain  all  the  points  of  Gn.  We  have  now  a  finite  set  of  cells 

*  See  Borel,  Le.^nn*  sur  les  /auctions  monogcnes,  p.  12. 
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which  contain  all  the  points  of  Glf  and  a  finite  set  which  contain  all  the 
points  of  (ra  that  do  not  belong  to  (?,;  and  generally,  a  finite  set  of  cells 
which  contain  all  the  points  of  Gn  that  do  not  belong  to  (?,,  G2,  ...  or  #„_,. 
Taking  every  value  of  n,  we  have  now  an  enumerable  set  of  the  cells  A'  that 
contain  all  the  points  of  G,  and  we  take  the  set  of  cells  A  which  correspond 
to  the  enumerable  set  of  cells  A'  as  the  required  set. 

76.  In  a  proof  given  by  Heine*  that  every  continuous  function  is  uniformly 
continuous  there  is  contained  the  germ  of  the  theorem  now  called  the  Heine- 
Borel  theorem,  as  it  was  first  explicitly  stated  and  proved  by  Borelf,  for 
the  case  of  a  linear  interval  (a,  6).  Various  proofs  of  the  theorem  have  since 
been  published  J.  It  was  pointed  out  by  H.  F.  Baker§  that  the  proof  by  Goursat|| 
of  Cauchy's  fundamental  theorem  in  the  theory  of  functions  of  a  complex 
variable  contains  a  general  method  of  procedure,  equivalent  to  the  use  of  nets 
made  in  §  73.  The  extension  to  the  case  of  any  closed  set  G  was  given  11  by 
W.  H.  Young,  and  by  Borel**. 

By  means  of  an  analysis  of  the  reasoning  of  Heine,  W.  H.  Young  was  ledff 
to  the  discovery  of  the  following  more  general  theorem*,  called  the  Heine- 
Young  theorem: 

If,  with  each  point  x  of  a  closed  interval  (a,  b),  we  have  associated  a  pair 
of  intervals  rx  and  lx>  such  that-,  (1)  #  is  the  left-hand  end-point  of  rx  and  the 
right-hand  end-point  of  lx  and,  (2)  if  x  is  an  internal  point  of  lx,  then  x  is  an 
internal  or  end-point  of  rx> ;  there  then  exists  a  finite  number  of  the  intervals  rx 
abutting  end  to  end  and  covering  the  whole  segment  (a,  b). 

It  is  unnecessary  to  suppose  r^  and  la  to  be  defined. 

Starting  with  a  as  end-point,  we  take  the  interval  ra,  or  (a,  #j);  we  then 
take  the  interval  rXl  or  (#1}  #2);  and  so  on.  We  thus  obtain  a  series  of  intervals 
of  the  given  set,  (a,  x,}  (x1}  x2)  ...  (xn^,  xn)  .... 

If,  for  some  value  of  n,  xn  =  b,  the  condition  of  the  theorem  is  satisfied  by 
the  n  intervals  thus  obtained.  Otherwise,  the  sequence  x1}  x2, ...  has  a  limiting 
point  X  on  the  right  of  all  the  points  [xn\.  There  is  then  an  interval  lx  with 
X  as  right-hand  end-point. 

All  the  points  \xn]  except  a  finite  number  of  them  are  interior  to  the 
interval  lx\  let  these  be  xn,xn+l,  ....  By  the  hypothesis  (2)  in  the  statement 
of  the  theorem  rXn  must  reach  at  least  as  far  as  X,  and  this  is  not  the  case,  as 
it  reaches  only  to  xn+l.  Hence  the  sequence  [xn]  cannot  be  infinite.  Thus  the 
theorem  is  established. 

*  Journal  f.  reine  und  anyewandte  Mat.,  vol.  LXXIV  (1871),  p.  188. 

t  Ann.  de  I'Ecole  Norm.  (3),  vol.  xn  (1895),  p.  50. 

J  See  for  example  Lemons  sur  I' integration,  p.  105. 

§  Proc.  Loud.  Math.  Soc.  (1),  vol.  xxxv,  p.  459,  and  (2),  vol.  i,  p.  24. 

||  Trans.  Amer.  Math.  Soc.  vol.  i,  p.  15. 

H  Proc.  Loud.  Math.  Sue.  (1),  vol.  xxv,  p.  387.  For  a  further  extension  of  the  theorem,  see 
W.  H.  Young,  Mess,  of  Math.,  vol.  xxxni,  p.  129,  and  also  Proc-.  Loud.  Math.  Soc.  (2),  vol.  n, 
p.  (J7.  **  Comptes  Rendus,  Jan.  1905.  ft  Proc.  Lond.  Math.  Soc.  (2),  vol.  xiv,  p.  114. 
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In  case  there  exists  no  interval  lb>  so  that  the  conditions  of  the  theorem 
hold  for  the  half-open  interval  (a,  b),  there  exists  an  enumerable  set  of  the 
intervals  rx  abutting  on  one  another  and  covering  the  half-open  interval  (a,  6). 
For  it  is  clear  that  in  this  case  the  point  X  must  coincide  with  b. 

The  following  theorem,  originally  due  to  Lusin*,  and  employed  by  him  in 
the  theory  of  derivatives  of  a  function,  follows  from  the  Heine-Young  theorem : 

If,  associated  with  every  point  of  a  closed  interval  (a,  b),  we  have  all  the 
intervals  with  x  as  end-point  that  lie  in  a  certain  neighbourhood  of  the  point  x 
on  both  sides  (except  for  the  points  a,  b  where  the  neighbourhood  is  on  one  side), 
then  a  finite  number  of  these  intervals-  exists,  abutting  end  to  end,  and  covering 
the  interval  (a,  b). 

At  each  point  x  we  take  for  rx  the  smallest  interval  with  x  as  left-hand 
end-point  containing  all  those  of  the  given  intervals  which  have  x  for  end-point, 
whether  originally  associated  with  x  or  not;  and  for  lx  the  part  of  the  given 
neighbourhood  on  the  left  of  x. 

There  is  then  a  finite  set  of  these  intervals  rx  covering  (a,  b),  and  abutting 
on  one  another.  Let  these  be  (a,  x^)  (#1,  #2) ...  (xn-i>  b)',  and  consider  (xr_ly  xr). 
We  may  choose  an  interval  of  the  given  set  with  xr^  as  left-hand  end -point; 
if  this  interval  does  not  reach  to  xr  we  add  one  of  the  given  intervals  associated 
with  the  point  xr,  and  thus  two  of  the  given  intervals  make  up  (xr_1}  xr). 
Doing  this  for  each  interval  (a?,— i,  xr)  we,  by  a  finite  number  of  choices,  obtain 
the  required  finite  set  which  covers  (a,  b). 

By  means  of  Lusin's  theorem  the  Heine-Borel  theorem  can  now  be  deduced. 
If  we  replace  those  intervals  of  A  which  contain  x  as  interior  point  by  all 
intervals  with  x  as  end-point  contained  in  each  one  of  the  intervals  of  A,  we 
have  the  condition  in  Lusin's  theorem  satisfied.  The  finite  set  of  abutting 
intervals  which  covers  (a,  b)  having  been  determined,  we  replace  each  one  of 
the  intervals  of  that  finite  set  by  an  interval  of  A  that  contains  it;  this 
requires  only  a  finite  number  of  choices;  thus  the  Heine-Borel  theorem  is 
deduced. 

77.    The  following  theorem  will  now  be  proved : 

If  any  unen  umerable  set  of  overlapping  intervals  in  (a,  b)  be  given,  then  an 
enumerable  set  of  intervals  all  belonging  to  the  given  set  exists,  of  which  the 
interior  points  are  the  same  as  tltose  of  the  given  set. 

It  has  been  shewn  in  §  71  that  the  given  set  can  be  replaced  by  a  non- 
overlapping  set  of  intervals  with  the  same  interior  points.  An  interval  of  this 
second  set  is  however  not  in  general  an  interval  of  the  given  set. 

Let  PQ  be  an  interval  of  the  equivalent  non-overlapping  set;  then  every 
internal  point  of  PQ  is  an  internal  point  of  one  interval  at  least  of  the  given 

*  Recueil  de  la  soc.  mat.  de  Moscou,  vol.  xxvni,  2  (1911). 
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set.  The  point  P  is  either  an  end-point  of  some  interval  Pp  of  the  given  set, 
or  else  it  is  a  limiting  point  of  end-points  of  an  infinite  number  of  intervals  of 
the  given  set.  In  the  latter  case  there  exists  an  enumerable  sequence  Ptpi, 
Pa/>2,  Pap3,  ...  of  intervals  of  the  given  set  such  that  P  is  the  limiting  point 
of  the  sequence  of  points  P,,  P2,  ...  Pn Similarly,  unless  Q  is  an  end- 
point  of  an  interval  qQ  of  the  given  set,  it  is  the  limiting  point  of  a  sequence 
Qn  Qa>  •••  Qn,  ...of  end-points  of  intervals  </,Qi,  q2Q2,  ...  qnQn,  •••  of  the 
given  set.  Consider  the  intervals  PiQi,  PtQa,  ...  PnQn>  •••,  where  Pj,  P2,  ... 
may  be  taken  all  to  coincide  with  P  in  case  the  interval  Pp  belongs  to  the 
given  set,  a  similar  convention  being  made  as  regards  Q.  Since  every  point 
of  PiQi  is  interior  to  some  interval  of  the  given  set,  therefore,  in  accordance 
with  the  Heine-Borel  theorem,  a  finite  number  of  intervals  of  the  given  set 
exists,  such  that  every  point  of  PjQj  is  interior  to  one  at  least  of  them.  Let 
a  similar  determination  of  a  finite  set  of  intervals  be  made  for  each  of  the 
intervals  P.2Q2,  PaQ3,  •••  PnQn,  •••',  we  have  then  altogether  an  enumerable 
set  of  finite  sets  of  intervals.  The  totality  of  these  intervals  forms  a  finite,  or 
an  enumerable,  set  of  intervals  belonging  to  the  given  set,  which  contains 
every  point  in  the  interior  of  PQ  as  an  interior  point.  Applying  the  same 
process  to  each  interval  PQ  of  the  equivalent  non-overlapping  set,  and  re- 
membering both  that  the  intervals  PQ  form  a  finite,  or  an  enumerable,  set,  and 
that  an  enumerable  set  of  finite  or  enumerable  sets  is  itself  enumerable,  we 
derive  the  conclusion  that  an  enumerable  set  of  intervals  exists,  all  belonging 
to  the  given  set,  such  that  their  internal  points  are  identical  with  those  of 
the  given  set. 

The  assumption,  here  made,  that,  if  P  be  a  limiting  point  of  end-points 
of  intervals  of  the  given  set,  a  sequence  of  such  end-points  exists  which 
converges  to  P,  will  be  discussed  in  Chap.  iv. 

THE  LEBESGUE  CHAIN  OF  INTEEVALS. 

78.  For  the  purposes  of  the  Theory  of  Functions  it  is  convenient  to 
consider  a  set  of  intervals  in  a  given  linear  segment,  of  the  kind  known  as  a 
Lebesgue  chain. 

If  (a,  b)  be  a  given  linear  interval,  a  set  Cab  of  non-overlapping  closed  intervals 
such  that  every  point  of  the  closed  interval  (a  ^  x  ^  b)  is  either  a  left-hand  end- 
point  of  one  of  the  intervals  or  an  internal  point  of  one  of  these  intervals,  is  said 
to  form  a  chain  stretching  from  a  to  b.  The  chain  is  supposed  to  have  a  last 
interval  which  contains  b  as  interior  point  or  as  left-hand  end-point. 

To  consider  the  nature  of  such  a  chain,  let  us  suppose  that  we  have  defined 
an  interval  (a,  a?j)  with  a  as  left-hand  end-point,  then  a  set  of  intervals 
(#u  #2),  (#2>  ^3),  •••  from  left  to  right.  In  case  the  right-hand  end-point  of  one 
of  these  intervals  is  6,  we  have,  by  adding  one  interval,  a  chain  consisting  of  a 
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finite  number  of  closed  intervals,  reaching  from  a  to  6.  But,  if  this  is  not  the 
case,  the  points  «,,  x2,  ...  xn,  ...  will  have  a  limiting  point  £  on  their  right, 
which  may  coincide  with  b,  in  which  case  we  have,  by  adding  one  more  interval, 
a  chain  reaching  from  a  to  b  consisting  of  an  enumerable  set  of  intervals.  But 
if  £  <  b;  starting  from  £,,  a  set  of  intervals  (£„  £,),  (£2,  £,),  ...  is  supposed  to 
be  defined.  If  these  reach  from  £,  to  6;  either  by  taking  a  finite  number,  or 
by  taking  an  enumerable  set,  together  with  one  more  interval,  8,  we  have 
again  a  chain  from  a  to  b,  consisting  of  the  intervals 

Oi,  #2),  0»a>  #s)  •••  (£i>  £a)»  (&•  £s)  •••  •  & 

But  if  £1,  £>,  •••  nave  a  limiting  point  ^  (<  6),  we  proceed  as  before  to  set  up 
new  intervals  proceeding  from  rjl.  This  process  may  be  carried  on  indefinitely, 
until  a  chain  is  defined  reaching  from  a  to  b.  That  this  will  always  happen 
when  any  set  of  rules  is  prescribed  for  the  definition  of  the  intervals  is  asserted 
in  the  following  theorem*  : 

If,  for  each  point  of  the  closed  interval  (a,  b),  there  be  assigned,  by  some 
prescribed  set  of  rules,  one  single  interval  with  the  point  as  left-hand  end-point, 
then,  if  X  be  any  point  in  the  closed  interval  (a,  b),  there  is  one  and  only  one 
chain  stretching  from  a  to  X,  composed  of  intervals  of  the  given  set. 

It  will  be  observed  that  the  theorem  asserts  (1)  the  existence  of  a  chain 
from  a  to  X,  and  (2)  its  uniqueness.  Also,  if  a  chain  stretching  from  a  to  X 
exists,  and  X'  be  any  point  on  the  left  of  X,a  chain  from  a  to  X'  exists  which 
is  a  part,  or  the  whole,  of  the  chain  from  a  to  X. 

If  (a,  #j)  be  the  interval  corresponding  to  a,  the  theorem  is  clearly  true  for 
all  points  x  such  that  a  ^  x  ^  x1 . 

Let  us  assume,  if  possible,  that  there  exists  in  (a,  6)  a  set  of  points  G,  for 
which  the  property  in  the  theorem  does  not  hold  good.  The  set  G  must  have 
a  lower  boundary  point  X  (>  n\),  which  may  or  may  not  belong  to  G.  Let  us 
first  suppose  that  X  does  not  belong  to  G',  it  is  consequently  a  limiting  point 
of  G.  There  is,  by  hypothesis,  a  unique  chain  Cax  reaching  from  a  to  X. 
Since  Cax  has  a  last  interval  (X  —  e',  X  +  e),  or  (X,  X  +  e),  the  chain 
Cax  reaches  from  a  to  £  where  £  is  such  that  0  <  £  <  e.  Now  Cax  is  the  only 
chain  that  can  reach  from  a  to  £;  for  any  chain  that  reaches  from  a  to  £  must 
contain  some  chain  that  reaches  from  a  to  X,  and  this  can  only  be  the  chain 
Cax.  It  follows  that,  in  a  certain  neighbourhood  of  X  on  its  right,  there  are 
no  points  of  G,  which  is  contrary  to  the  hypothesis  that  X  is  a  limiting 
point  of  G. 

Next,  let  us  suppose  that  X  belongs  to  G,  whether  it  be  a  limiting  point 
of  G  or  not.  Consider  a  sequence  of  points  a;1,  x2,  ...  xn,  ...  of  which  X  is 
the  limiting  point,  and  such  that  ^  <  xz  <  x.A . . .  <  xn  <  . . .  .  A  unique  chain 
CaXn  stretches  from  a  to  xn ;  also  if  ri  >  n,  Cax«  is  a  part,  or  the  whole,  of  Cax». 

*  See  Pal,  fiend,  di  Palermo,  vol.  xxxm,  p.  352  (1912);  also  G.  ('.  Young,  Quart.  Journ.  of 
Math.,  vol.  XLVII,  p.  142,  and  W.  H.  &  G.  C.  Young,  Proc.  Lond.  Math.  Sue.  (2),  vol.  xiv,  p.  128. 
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Consider  the  complete  set  C  of  intervals  each  of  which  belongs  to  Ca**,  from 
and  after  some  value  of  n.  Then  C,  together  with  the  interval  (X,  X  +  e), 
corresponding  to  the  point  X,  constitutes  a  chain  reaching  from  a  to  X.  For 
each  point  on  the  left  of  X  is  reached  by  Cax»  for  some  value  of  n,  and  is 
therefore  an  interior  point,  or  a  left-hand  end-point  of  an  interval  of  C.  Any 
chain  reaching  from  a  to  X  contains  as  a  part  a  chain  from  a  to  xn,  and  this 
can  only  be  Ca*».  As  this  holds  for  every  value  of  n,  any  chain  from  a  to  X 
must  contain  Cax»  for  every  value  of  n ;  and  therefore  it  contains  C.  Also  C 
must  be  independent  of  the  particular  sequence  [xn}\  for  any  point  x'm  of 
another  sequence  must  lie  in  an  interval  (xn,  xn+l),  and  the  unique  chain  from 
a  to  x'm  must  be  a  part,  or  the  whole,  of  the  chain  from  a  to  xn+l.  Any  chain 
from  a  to  X  can  thus  only  consist  of  C  and  the  interval  (X,  X  +  e);  thus  there 
is  a  unique  chain  reaching  from  a  to  X,  contrary  to  the  hypothesis  that  X 
belongs  to  G.  It  now  follows  that  no  such  set  as  Gr  can  exist,  and  the  theorem 
is  therefore  established. 

79.  The  reasoning  by  which  Lebesgue  establishes  the  existence  of  a  chain 
from  a  to  b  depends  upon  the  employment  of  transfinite  numbers  of  the  second 
class,  and  upon  the  principle  that  every  enumerable  ordered  aggregate  must 
be  exhausted  before  some  particular  number  of  second  class  is  reached.  If  a 
set  of  intervals 

(a,  a^)  (a?!,  ara) ...  (#„_„  a?B)  ...  (a?M,  xu+1)  ...(#/»,  «fc+i)  ••• 

be  numbered  from  left  to  right,  the  numbers  employed  will  all  be  of  the  first, 
or  of  the  second,  class.  If  the  process  of  construction  does  not  cease  before 
some  particular  number  of  the  second  class  is  used,  the  set  of  intervals  would 
be  unenumerable ;  and,  as  they  are  non-overlapping,  this  is  impossible  (§69). 
Thus,  by  a  set  of  such  intervals,  starting  with  a,  the  point  b  must  be  reached, 
and  the  existence  of  a  chain  reaching  from  a  to  6  is  established. 

It  may  happen  however  that,  corresponding  to  each  point  x  of  (a,  b),  not  a 
unique  interval,  but  a  finite  or  infinite  set  of  intervals  is  defined,  with  x  as 
left-hand  end-point.  In  that  case,  the  theorem  as  to  the  existence  of  a  Lebesgue 
chain  takes  a  more  general  form  than  that  of  Pal,  given  above,  and  the 
proof  given  in  §  78  fails*.  For  chains  from  a  to  xl  and  from  a  to  xz,  where 
x.,  > xl}  may  both  exist,  but  it  does  not  follow  that  the  former  is  part  of  the 
latter.  The  general  existence  theorem  may  be  stated  as  follows: 

If,  to  each  point  x  of  the  interval  (a,  b)  (except  b)  there  be  assigned  a  set  of 
intervals  (finite  or  infinite)  with  x  as  left-hand  end-point,  an  enumerable  non- 
overlapping  set  of  the  intervals  can  be  so  determined  as  to  contain  every  point 
of  the  semi-open  interval  (a,  b)  as  interior  or  left-hand  end-point. 

A  proof  of  this  theorem  not  involving  an  infinite  number  of  acts  of  choice 
is  a  desideratum. 

*  W.  H.  &  G.  C.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xiv,  pp.  128—130. 
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For  most  of  the  purposes  of  the  theory  of  functions  the  more  restricted 
form  of  the  theorem,  in  which  a  single  interval  is  denned,  with  each  point  of 
(a,  6)  as  left-hand  point,  suffices. 


CLOSED  AND  PERFECT  LINEAR  SETS. 

80.    The  following  theorem  may  be  stated: 

Those  points  of  the  linear  segment  (a,  b)  which  are  not  points  of  a  given  set 
of  non-overlapping  open  intervals  form  a  closed  set  of  points. 

This  theorem  is  a  particular  case  of  the  theorem  proved  in  §  56,  that  the 
complementary  set  of  an  open  set,  relatively  to  a  closed  interval  in  which  it  is 
contained,  is  a  closed  set.  In  the  present  case  this  open  set  consists  of  the 
points  of  a  set  of  non-overlapping  open  intervals.  In  case  a,  or  6,  is  a  point  of 
the  set  of  open  intervals,  it  is  regarded  as  an  interior  point. 

Conversely,  it  has  been  shewn  in  §  56,  that 

Every  closed  set  of  points  in  the  linear  segment  (a,  b)  is  the  complement  of 
a  non- overlapping  set  of  open  intervals. 

In  accordance  with  the  classification  given  in  §  72  of  the  points  which  do 
not  belong  to  a  set  of  open  intervals,  it  appears  that: 

The  most  general  linear  closed  set  of  points  in  an  interval  (a,  b)  consists  of 
(1)  the  end-points  of  a  set  of  non-overlapping  intervals,  (2)  limiting  points 
of  such  end-points,  and  (3)  the  points  interior  to  intervals  every  point  of  which 
belongs  to  the  closed  set. 

The  open  intervals  belonging  to  the  set  C  (G),  complementary,  relatively  to 
(a,  6),  to  a  given  closed  set  G  in  (a,  b)  are  said  to  be  the  intervals  contiguous  to,  or 
complementary  to,  the  said  G.  The  set  of  all  such  intervals  may  be  spoken  of  as 
the  set  of  contiguous  intervals,  or  complementary  intervals,  for  the  closed  set  G. 

If  the  set  G  is  non-dense,  no  interval  exists  in  (a,  b)  which  consists  entirely 
of  points  of  (G) ;  and  the  set  of  contiguous  intervals  is  then  everywhere  dense 
in  (a,  6),  since  no  interval  can  be  determined  in  (a,  b)  so  as  to  contain  no 
points  of  the  set  of  contiguous  intervals.  Thus : 

Every  linear  non-dense  closed  set*  in  an  interval  (a,  b)  consists  of  the  end- 
points  of  the  intervals  of  an  everywhere  dense  set  of  open  intervals  and  of  the 
limiting  points  of  such  end-points. 

*  This  relation  between  everywhere  dense  sets  of  intervals  and  closed  sets  was  discovered  by 
Du  Bois  Beymond  and  by  Harnack.  See  Du  Bois  Reymond's  Allgemeine  Functionentheorie  (1882), 
p.  188;  also  Math.  Annalen,  vol.  xvi,  p.  128,  where  everywhere  dense  sets  of  intervals  are  intro- 
duced. See  also  Harnack,  Math.  Annalen,  vol.  xix,  p.  239,  and  Bendixson,  Acta  Math.  vol.  n, 
p.  416,  and  Ofv.  af  Kvemk.  Vet.  Fl>rh.,  vol.  xxxix,  2,  p.  31.  Proofs  of  the  fundamental  theorems 
based  on  the  amalgamation  of  abutting  intervals  have  been  given  by  W.  H.  Young,  Proc.  Land. 
Math.  Soc.,  Ser.  2,  vol.  i,  p.  240,  and  by  Schoenflies,  Gottinger  Nachrichten,  1903. 
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The  most  general  type  of  closed  set  is  obtained  by  adding  to  a  non-dense 
closed  set  all  the  points  of  some  of  the  contiguous  intervals. 

The  points  of  a  non-dense  closed  set  G  consist  in  general  of  three  classes: 

(1)  those    which  are   common   end -points   of  two   contiguous   intervals 
abutting  on  one  another ; 

(2)  semi-external  points  of  the  set  of  contiguous  intervals;    i.e.  points 
which  are  end-points  of  one  interval,  and  also  limiting  points,  on  one  side,  of 
end-points;  and 

(3)  external  points,  i.e.  such  as  are  not  end-points  of  any  contiguous 
interval,  but  are  limiting  points,  on  both  sides,  of  such  end-points. 

The  point  a,  or  the  point  6,  if  it  belongs  to  G,  may  be  regarded  as  belonging 
t«  (1)  or  (3),  according  as  it  is,  or  is  not,  an  end-point  of  a  contiguous  interval. 

Those  points  which  belong  to  (1)  are  clearly  isolated  points  of  G.  Hence, 
if  no  such  points  exist,  every  point  of  G  is  a  limiting  point;  and  therefore  G 
is  perfect.  It  follows  that: 

Every  non-dense  perfect  linear  set  G  consists  of  the  end-points  of  an  every- 
where dense  set  of  non-overlapping  intervals  (contiguous  to  G),  no  two  of  which 
abut  on  one  another,  together  with  the  limiting  points  of  these  end-points. 

If  the  closed  set  G  is  such  that  no  semi-external  points  exist,  then  every 
contiguous  interval  abuts  on  another  one  at  both  its  ends.  In  this  case,  all 
the  points  of  G  are  either  end-points  of  adjacent  intervals,  or  limiting  points, 
on  both  sides,  of  a  sequence  of  such  end-points;  unless  a  or  6  be  a  limiting 
point,  in  which  case  it  belongs  to  G.  The  end-points  have  the  same  cardinal 
number  a  as  the  rational  numbers,  since  the"  set  of  intervals  is  enumerable. 
Moreover  the  external  points  form  a  finite  set,  or  an  enumerable  set;  because 
to  each  such  external  point  there  corresponds  an  enumerable  set  of  end-points 
of  which  it  is  the  limiting  point,  and  in  this  correspondence  any  one  end- 
point  can  correspond  to  at  most  two  limiting  points,  one  on  each  side  of  it. 
We  thus  have  the  theorem  that: 

A  non-dense  closed  set  is  enumerable  if  its  complementary  intervals  are  such 
that  every  one  of  them  abuts  on  another  one  at  each  of  its  ends. 

The  condition  here  stated  is  sufficient,  but  not  necessary,  in  order  that  the 
non-dense  closed  set  should  be  enumerable.  For  example,  let  the  contiguous 
intervals  of  a  set  G  in  the  interval  (0,  1)  consist  of  the  open  intervals 

(0,  5 )  and  ( s  +  -  s  +  -    -  V  where  n  =  1,  2,  3,  . 

V      2J  \2     n  +  2    2      n  +  l/ 

The  set  G  is  enumerable,  but  the  interval  ( 0,  -  ]  does  not  abut  on  another 

\      «/ 
interval. 

81.  Every  non-dense  linear  closed  set  is,  in  general,  made  up  of  an  enu- 
merable set  and  of  a  perfect  set. 

Let  the  intervals  complementary  to  the  set  G  be  arranged  in  enumerable 
H.  S 
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order,  that  of  descending  magnitude;  we  may  denote  them  by  S^  82,  ...  8n,  .... 
If  G  is  not  perfect,  it  contains  isolated  points,  each  of  which  is  the  common 
end-point  of  two  adjacent  intervals;  let  8Pl  be  the  first  of  the  intervals  {8}  at 
an  end  of  which  there  is  such  a  point;  let  8P>  be  the  interval  which  abuts  on 
8Pl  at  that  end.  It  may  happen  that  the  other  end-point  of  8P>  is  also  a 
common  end-point  of  two  intervals.  If  so,  let  8p->  be  the  interval  which  abuts 
on  8P',  and  so  on:  after  a  finite,  or  enumerable,  set  of  such  intervals 

Bp',  BP",  Bp'"  ... 

we  must  arrive  at  an  interval  of  which  the  end-point  does  riot  belong  to  Gt, 
the  set  of  isolated  points  of  G,  or  else  at  an  end-point  of  the  domain  of  G; 
unless  G  is  an  enumerable  set.  It  may  happen  that  8Pl  at  its  other  end  abuts 
on  another  interval;  in  that  case  we  proceed,  in  the  same  manner  as  before, 
to  find  the  intervals  Sq>,  Sq»,  ...  each  of  which  abuts  on  another  one.  Now 
conceive  all  the  intervals  8P>,  8P",  ...,  and,  if  they  exist,  8q',  8q",  ...  to  be  amal- 
gamated with  8Pl  into  one  interval  8Pl(l\  by  removing  all  the  common  end- 
points.  If  any  isolated  points  of  G  now  remain,  let  8P2  be  the  first  interval  of 
[8\  after  8Pl,  of  which  an  end-point  is  such  a  point;  proceed  as  before,  we 
then  have  an  interval  8Pz(1)  formed  by  amalgamating  a  finite  or  enumerable 
set  of  intervals.  We  proceed  in  this  way,  and  thus  form  a  set  of  intervals 
8Pl{1),  8P2(1\  ...  no  end-points  of  which  are  points  of  G,. 

Since  G  =  Gt  +  G(l),  where  G(l)  is  the  derivative  of  G,  the  set  of  intervals 
{3(1)[  complementary  to  6r(1)  consists  of  the  intervals  8Pl(l},  8P2(1},  ...  and  of  any 
intervals  {8}  which  remain  after  such  intervals  as  Sp>,  8P",  ...  8q>,  <y,  ...  have 
been  removed,  and  the  8p(l}  substituted  for  the  Bp. 

We  proceed  in  a  similar  manner  with  G(l}  =  Gt(l}  +  6r(2),  again  removing  a 
finite  or  enumerable  number  of  the  set  [8(1)],  and  again  with  G{~\  and  so  on. 
It  may  happen  that  the  process  comes  to  an  end  after  a  number  n  of  such 
stages,  either  if  Gt(n}  does  not  exist,  in  which  case  G(n)=  G(n+1},  and  thus  G{n)  is 
perfect;  or  else,  if  G(n)  does  not  exist,  in  which  case  G,  being  the  sum  of  a 
finite  number  of  enumerable  sets  2Crt(r),  is  itself  enumerable.  If  the  process 
does  not  come  to  an  end  for  any  finite  value  of  n,  we  form  the  derivative 
GM  =  D(GM,  G(>2},  ...  G(n\  ...),  which  contains  all  the  points  common  to  all 
the  derivatives  of  G  of  finite  order.  This  set  has  been  shewn,  in  §  68,  to  exist, 
and  to  be  a  closed  set;  6r(to)  is  then  resolved  as  before  into  GL(<a)  +  G(<a+l),  and 
we  proceed  further  as  before. 

We  obtain,  by  proceeding  in  this  manner, 


where  (3  is  a  number  of  the  first  or  second  class.  It  will  now  be  shewn  that 
there  must  be  some  definite  number  ft  of  the  first  or  second  class,  for  which 
this  process  comes  to  an  end,  either  by  G^  containing  no  points,  in  which 
case  GW=  G(<*+1],  so  that  G(&  is  perfect;  or  else  by  G(l*+l}  containing  no  points, 
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in  which  case  G,  being  the  sum  of  an  enumerable  set  of  finite,  or  enumerable, 
sets,  is  itself  enumerable.  The  {8}  contain  all  the  indices  1,  2,  3,  ...  n,  ...; 
from  these  indices  we  must  remove  a  finite,  or  an  enumerably  infinite  number, 
to  obtain  those  indices  which  occur  in  the  {S(1)j;  and  again  an  enumerable  set 
of  indices  must  be  removed  from  those  which  occur  in  the  {S'%  to  obtain 
those  which  occur  in  the  {8(2)}.  Now  as  the  indices  1,  2,  3,  ...  n,  ...  are  enu- 
merable, the  process  of  removing  successively  a  finite,  or  enumerably  infinite, 
set  of  them  must  cease  for  some  order  ft  of  8{*\  for  otherwise  a  more  than 
enumerable  infinity  of  indices  could  be  removed  from  the  set  1,  2,  3,  ...  n,  ..., 
which  is  impossible;  hence,  for  some  fixed  number  ft  of  the  second  class,  all  the 
indices  must  have  been  removed. 

It  has  thus  been  shewn  that,  unless  the  given  set  G  is  enumerable,  for 
some  number  ft  of  the  first  or  second  class,  G(ft}  =  G(ft+l);  and  therefore  Gw  is 
perfect..  Thus  G  has  been  resolved  into  an  enumerable  set  and  a  perfect  one, 
which  may  be  called  the  nucleus  of  G. 
'   If,  for  any  value  of  ft,  Gw  =  0,  the  set  G  is  enumerable. 

82.  The  following  theorem*,  more  general  than  that  of  §  81,  includes  the 
latter  as  a  particular  case.  The  proof  here  given  may  be  taken  as  alternative 
to  that  of  §81. 

If  PI,  P2,  ...  Pn,  •••  Pp,  ...  Pa,  •  ••  are  all  closed  linear  sets  of  points  such 
that  (1)  if  tti  <  a2,  all  the  points  of  Paz  belong  to  Pai,  and  (2)  if  in  any  interval, 
any  set  Pa  contains  only  a  finite  number  of  points,  the  set  Pa+l  contains  no 
points  in  that  interval;  then  either  Pp  must  vanish  for  some  definite  number  ft 
of  the  first  or  second  class,  or  else  there  is  a  definite  number  ft  such  that  Pp  is 
a  perfect  set. 

If,  for  some  number  ft,  the  set  Pp  vanishes,  then  Py  vanishes  for  all  values 
of  7  which  are  >  ft. 

Let  us  now  suppose  that  there  exists  no  number  ft  such  that  Pp  vanishes. 
In  this  case  there  exists  a  set  of  points,  which  may  be  denoted  by  Pn,  such 
that  each  point  of  the  set  belongs  to  Pp,  whatever  number  ft  may  be.  The 
set  Pa  is  closed,  for  if  p  be  a  limiting  point  of  the  set,  in  an  arbitrarily 
small  neighbourhood  there  are  points  of  Pp,  whatever  number  ft  maybe; 
hence  p  belongs  to  Pp,  whatever  ft  may  be,  and  thus  p  itself  belongs  to  Pn. 

It  will  now  be  shewn  that  Pn  contains  no  isolated  points,  and  is  therefore 
dense  in  itself.  If  Pn  contains  an  isolated  point  p,  a  neighbourhood  of  p  can 
be  found  which  contains  no  point  of  Pn  except  p\  let  Q  be  that  part  of  Pl 
which  is  contained  in  this  neighbourhood.  In  the  neighbourhood  considered, 
let  us  suppose  a  sequence  of  intervals  8l}  8.,,  ...  8n,  ...  constructed,  each  one- 
containing  the  next  one  and  the  point  p,  and  such  that  8n  converges  to  zero 
as  n  is  indefinitely  increased.  Let  Q(n]  denote  that  part  of  Q  which  lies  in  8n 

*  See  Baire,  Annali  di  Mat.  (3),  vol.  in,  p.  46  et  aeq. 
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but  not  in  S»+1>  then  Q  =  QM  +  Q(2)  + ...  +  QM  4- . . .  +  p.  There  must  exist  a 
number  &,  of  the  first,  or  of  the  second,  class,  for  which  Q(l}  contains  no  point  of 
Pp  ;  otherwise  Qi  would  contain  points  which  belong  to  Pn,  and  this  is  not  the 
case.  Similarly,  there  exist  numbers  ftz,  ftz,  ...  j3n,  •••  such  that  Q(2)  contains 
no  points  of  P/32,  and  Q(3)  contains  no  points  of  Ppa,  and  so  on.  Of  the  numbers 
&,  /32,  ...  ftn,  •••,  let  7i  be  the  first  which  is  >  ft*,  then  let  y2  be  the  first 
which  is  greater  than  7,,  and  so  on;  we  have  therefore  a  sequence  <yl}  72,  ... 
7n,  ...  of  increasing  numbers  all  of  which  belong  to  the  set  ftlt  ft.2,  •••  Pn>  •••• 
This  sequence  71,  72,  ...  yn,  ...  is  either  finite,  with  say  7  as  the  last,  or  else 
there  is  a  limiting  number  7  of  the  second  class  which  is  greater  than  all  of 
them,  and  therefore  greater  than  all  the  numbers  fli,  ftz,  •••  ftn,  —  The  set 
Q  can  have  no  point,  except  p,  which  belongs  to  Py,  hence,  since  Pv  contains 
only  one  point  in  a  certain  interval,  Py+i  contains  no  point  in  that  interval, 
and  does  not  contain  p ;  which  is  contrary  to  the  hypothesis. 

It  has  now  been  shewn  that  Pn  is  closed  and  dense  in  itself;  it  is  there- 
fore perfect.  Let  us  next  consider  the  enumerable  set  of  intervals  which  are 
complementary  to  Pn.  For  any  one  of  these  intervals  there  exists  a  number 
7  such  that  Py  contains  no  point  in  the  interior  of  the  interval.  As  before,  it 
is  seen  that  there  exists  a  number,  of  the  first  or  the  second  class,  which  is 
greater  than  all  these  numbers  7;  if  this  number  be  ft,  the  set  Pp  contains  no 
points  which  do  not  belong  to  Pn.  It  is  thus  seen  that  Pp  is  perfect,  and 

P/3  =  P0+1  =  . . .  =  PQ. 

The  theorem  has  now  been  completely  established. 

If,  in  the  above  theorem,  the  condition  (2)  be  omitted,  the  set  Pn  is  closed 
but  not  necessarily  perfect.  The  set  Pp,  as  before,  contains  no  points  that  do 
not  belong  to  Pn;  it  then  follows  that  P^  =  P^+1  =  P0+2,  = 

We  can  thus  state  the  following  theorem: 

If  PJ,  P2,  ...  Pn,  ...  Pp,  ...  Pa,  ...  are  all  closed  sets  such  that  any  one 
contains  all  the  others  with  a  higher  index;  then,  either  Pp  vanishes  for  some 
definite  number  ft  of  the  first,  or  the  second,  class,  or  else  there  is  a  definite 
number  ft,  from  and  after  which  all  the  sets  are  identical. 

83.  Every  perfect  linear  set*  has  the  cardinal  number  c  of  the  continuum; 
and  every  closed  infinite  set  has  the  cardinal  number  c,  or  else  the  cardinal 
number  a  of  the  rational  numbers. 

Let  the  intervals  whose  internal  points  are  the  set  C(G),  the  complement 
of  the  perfect  set  G,  be  denoted  by  {8} ;  and  let  A  denote  the  greatest,  or  one 
of  the  greatest  in  case  of  equality,  of  the  intervals  {SJ.  Let  I,  the  whole 
interval  (a,  b)  in  which  G  lies,  be  divided  into  the  three  parts  10,  A,  lr  so  that 
I  =  IQ  +  A  +  li,  where  1Q  is  on  the  left,  and  ^  on  the  right  of  A,  the  greatest 

*  Cantor,  Math.  Annalcn,  vol.  xxm,  pp.  48G — 488. 
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interval  of  {8}.  Denote  the  greatest  of  the  intervals  {8}  in  10,  by  A0,  and  the 
greatest  in  /,,  by  A,;  then  the  interval  /„  is  divided  by  means  of  AO  into  three 
parts  /oo.  A0,  I0i  in  order  from  left  to  right,  and  the  interval  ^  is  divided  by 
means  of  Aj  similarly  into  lw,  A,,  ln.  Proceeding  in  this  manner  to  a  further 
subdivision,  let  Aw  be  the  greatest  of  the  intervals  \8]  which  lie  in  Ipq,  where 
p,  q  each  has  one  of  the  values  0  or  1;  then  lpq  is  divided  into  three  parts  ^Oi 
&pq>  lpq\\  and  so  on  indefinitely.  The  intervals  {8}  are  thus  arranged  in  the 
order  A,  A0,  A!,  AM,  A0i,  A10,  An,  ...  and  each  interval  of  {8}  occurs  at  a 
definite  place  in  the  sequence.  Consider  a  sequence  of  intervals 

')   '/> >  "pq >  "pqr ,   '•• , 

where  p,  q,  r,  ...  all  have  definite  values,  each  of  which  is  either  0  or  1.  Each 
of  these  intervals  is  contained  in  the  preceding  one,  and  has  one  end-point  in 
common  with  it;  and  the  sequence  determines  a  single  point  P  which  is 
interior  to  all  the  intervals  of  the  sequence,  unless,  from  and  after  some  fixed 
index,  all  the  indices  are  identical,  in  which  case  P  is  a  common  end-point  of 
all  the  intervals  after  a  fixed  one.  Hence,  since  the  point  P  is  not  interior  to 
any  of  the  intervals  {8},  it  is  a  point  of  G.  Conversely,  every  point  of  G  can 
be  so  determined  by  means  of  a  sequence  of  intervals;  for  every  point  of  G 
belongs  either  to  10  or  to  llf  and  also  to  one  of  the  four  intervals  £00.  ^01,  ^o>  ^n» 
and  so  on.  The  point  P  is  the  limiting  point  of  the  end-points  of  the 
intervals  A^,  AOT,  AMr,  ...  with  the  indices  the  same  as  those  of  the  sequence 
lp>  lpq,  Ipyr,  •••  which  determines  the  point. 

Every  number  of  the  continuum  (0,  1)  is  expressible  in  the  dyad  scale  by 
means  of  a  sequence  'p,  'pq,  'pqr,  ...,  where  each  of  the  numbers  p,  q,  r,  ...  is 
either  0  or  1 ;  and  all  numbers  are  expressed  uniquely  in  this  mamier,  except 
those  for  which  all  the  digits  after  some  fixed  one  are  1,  these  numbers  being 
also  expressible  by  a  sequence  in  which  only  0  occurs  after  some  fixed  place. 
The  numbers  last  mentioned  correspond  as  indices  oflp,  lpq,  lpqr,  ...  to  a  point 
of  G  which  is  an  end-point  of  one  of  the  intervals  {8};  but  in  every  other  case 
a  number  in  the  dyad  scale  corresponds  to  a  point  of  G  which  is  not  an  end- 
point  of  the  intervals  {8}.  Since  the  set  of  numbers  of  the  continuum  (0,  1) 
has  the  cardinal  number  c,  it  follows  that  the  points  of  G  form  a  set  of  the 
same  cardinal  number,  because  each  point  of  G  corresponds  uniquely  to  a 
single  number  of  the  continuum,  except  that  two  points  of  G  which  are  end- 
points  of  one  contiguous  interval  correspond  to  a  single  number  of  the  con- 
tinuum. Every  closed  set  which  is  not  enumerable  has  been  shewn  to  contain 
a  perfect  set  as  component;  such  a  set  has  therefore  the  cardinal  number  c. 

It  will  appear  from  the  theory  of  order-types  which  will  be  discussed  in 
Chapter  IV  that  the  set  of  intervals  (Sj  which  define  a  perfect  set  G,  when 
taken  in  their  order  of  position  from  left  to  right,  have  an  order-type  which 
is  the  same  as  ?;  the  order-type  of  the  rational  numbers  which  lie  between 
0  and  1,  excluding  0  and  1  themselves,  taken  in  their  natural  order  in  the 


118  Sets  of  Points ;  descriptive  properties  [CH.  n 

continuum.  It  follows  that  a  correspondence  can  be  established  between  the 
intervals  and  the  rational  numbers,  in  which  any  two  intervals  correspond 
to  two  rational  numbers  that  have  the  same  order.  If  we  take  each  rational 
number  to  correspond  to  the  end-points  of  the  corresponding  interval,  then 
each  irrational  number  corresponds  to  a  point  of  G  which  is  a  limiting  point 
of  end-points  of  intervals. 


EXAMPLES. 

1.     Let  x  be  a  number  given  by  x  =  ^  +  ~>  +  ^,+  •••  +^+  ••-,  where  the  numbers 

o        o          o  o 

ci>  C2)  •••  cni  •••  have  each  one  of  the  values  0,  2.    The  set  {x\  is  a  non-dense  perfect  set. 
No  number  of  the  set  lies  between 

C-l+^+         4.£j._i_       ^  ,r^  +  ^-2+          i-i 

3        32  "r3n-r3n  +  1^3n  +  2'r  "3  ~T  32  +  +  3«> 


i        9 
and  J  +  _|+  ... 

these  two  numbers  determine  a  complementary  interval  of  the  set,  the  interval  being  of 
length  —  .  The  number  of  complementary  intervals  of  length  —  is  2n~1,  hence  the  sum 

o  o 

oo     Qn  —  l 

of  all  the  complementary  intervals  is   2  ,  which  is  unity.     It  is  clear  that  the  set  of 

n=l     3™ 

complementary  intervals  is  everywhere  dense,  and  thus  the  set  of  points  is  non-dense. 
This  example  was  constructed*  by  Cantor,  and  is  the  first  example  of  a  perfect  non-dense 
set  which  has  been  purposely  constructed. 

2.  Let  us  suppose  that  the  numbers  of  the  interval  (0,  1)  are  expressed  in  the  dyad 
scale,  in  the  form  a^a^a^  ...  an  ...  ;  where  each  a  is  either  0  or  1.      Each  number  for 
which  the  a's  all  vanish,  after  some  fixed  one  an,  which  must  be  1,  is  also  representable  as 
an  unending  radix  fraction,  in  which  an  is  0,  and  all  the  subsequent  digits  are  1.     Let  the 
numbers  now  be  interpreted  as  if  they  were  in  the  decimal  scale.      To  each  irrational 
number  in  the  dyad  scale,  there  corresponds  a  single  number  in  the  decimal  scale,  repre- 
sented by  the  same  digits.     Of  each  rational  number^  the'rV  is*  a'double  representation'  in 
the  dyad  scale,  and  there  correspond  two  numbers  in  the  decimal  scale,  which  define  a 
complementary  interval  of  the  set  of  points  which  represents  the  numbers  in  the  decimal 
scale.     A  perfect  non-dense  set  of  points  is  thus  defined. 

3.  Taking  a  positive  integer  m  (>  2),  let  the  interval  (0,  1)  be  divided  into  m  equal 
parts,  and  exempt  the  last  part  from  further  subdivision.     Divide  each  of  the  remaining 
m-l  intervals  into  m  equal  parts,  and  in  each  case  exempt  the  last  part  from  further  sub- 
division.    Let  this  operation  be  continued  indefinitely.     The  points  of  division  form  a 
non-dense  set  ;  for  if  an  interval  d  be  taken  anywhere  in  the  interval  (0,  1),  k  may  be  so 

chosen  that  —  .  <  —  ,  and  a  segment  [  —  i.  —   e-  )  entirely  within  d,  can  be  determined.   This 
m"      2  \mk      mk  ) 

segment  is  either  an  exempted  interval,  or  its  with  part  is  one.     The  end-points  of  the 
*  See  Math.  Annalen,  vol.  xxi,  p.  590. 
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intervals,  together  with  their  limiting  points,  form  a  non-dense  closed  set*,  of  cardinal 
number  c. 

4.  As  in*  Ex.  3,  let  the  interval  (0,   1)  be  divided  into  m  equal  parts,  and  the 
last  be  exempted  from  further  division.     Then  let  the  remaining  m  -  1  parts  each  be 
divided  into  »i2  equal  parts,  the  last  of  each  being  exempted  from  further  division.     Let 
the  remaining  parts  be  then  divided  into  m3  equal  parts,  the  last  of  these  in  each  case 
being  exempted  from  further  division.      If  this  process  be  carried  on  indefinitely,  the 
end-points  of  the  divisions,  together  with  their  limiting  points,  form  a  non-dense  closed 
set,  of  cardinal  number  c. 

5.  Let  /?!,  £2, ...  ^n,  ...  be  a  sequence  of  positive  integers,  each  of  which  is  greater  than 
unity,  and  defined  according  to  any  law. 

It  can  be  shewn  t  that  every  irrational  number  x,  in  (0,  1),  can  be  uniquely  represented 
in  the  form 

"l  c'l  cn 

i;  —  —  4- L.  4.         4. iL L. 

I-  ~TTL~  "'  ~  If  If         I-  T  •  •  •  > 

K j         KI  K%  KI  K%  . . .  A/n 

where  cn  <  kn,  and  not  all  of  the  numbers  c,,,  cn  +  1, ...  are  zero,  for  any  value  of  n. 
It  can  further  be  shewn  that 


where  rjn  =  Kn-  I  -cn.  If,  from  and  after  a  certain  value  of  n,  the  condition  cn  =  kn-  1  is 
always  satisfied,  then  all  the  rjn  vanish,  and  x  is  rational.  It  thus  appears  that  these  rational 
numbers  are  capable  of  a  double  representation  in  the  form 


(1)  by  the  vanishing  of  all  the  c,  after  some  fixed  one,  and  (2)  by  the  condition  cn  =  kn-  1 
being  satisfied,  from  and  after  some  fixed  value  of  n. 

If  we  now  take  those  values  of  x,  for  which  every  c  does  not  exceed  some  fixed  integer  X, 
these  values  of  x  form  a  non-dense  perfect  set  <7A.  It  is  easily  seen  that  the  interval  of 
which  the  end-points  are 

Ci         Co                       cn                      X  X 

-1 +-j-^-+ ...  H - \-  . f-  f—  — 


contains  no  points  of  the  set  in  its  interior,  although  these  points  belong  to  the  set. 
A  particular  case  of  this  set  consists  of  the  numbers  given  by 


where  every  c  is  ^9.     This  set  consists  of  the  transcendental  numbers  first  defined  by 
LiouvilleJ. 

*  See  H.  J.  S.  Smith,  Proc.  Land.  Math.  Soc.  (1),  vol.  vi,  pp.  147,148. 
t  Brod^n,  Math.  Ann.,  vol.  LI,  p.  299. 
J  Liouville's  Journal,  vol.  xvi,  p.  133. 
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PROPERTIES  OF  THE  DERIVATIVES  OF  LINEAR  SETS. 

84.  If  a  set  is  dense  in  any  sub-interval  of  the  interval  in  which  it  is  con- 
tained, its  derivative  GM  contains  every  point  of  the  sub-interval,  and  is  iden- 
tical, so  far  as  such  sub-interval  is  concerned,  with  the  totality  of  the  points  of 
the  sub-interval ;  we  confine  ourselves  therefore  to  the  case  in  which  G  is  a 
non-dense  set,  and  consequently  its  derivatives  are  also  non-dense. 

The  derivatives  of  transfinite  orders  have  been  defined  in  §  68 ;  and  it  was 
there  shewn  that  there  is  either  a  last  derivative,  whose  order  is  some  number 
of  the  first  class,  or  non-limiting  number  of  the  second  class  ;  or  else  that  de- 
rivatives of  all  such  orders  exist,  and  have  a  set  of  points  G(fl}  in  common. 

It  was  shewn  in  §  68,  that  G(l]  being  a  non-dense  closed  set,  two  cases 
arise  : 

(1)  If  G(l)  is  enumerable,  in  which  case  G  is  also  enumerable,  then  G(?] 
vanishes  for  some  number  (3  of  the  first  or  the  second  class.    A  set  with  an 
enumerable  derivative  is  called  a  reducible  set. 

(2)  If  Gw  is  not  enumerable,  then  there  exists  some  number  /3,  of  the  first 
or  second  class,  for  which  G(&}  is  a  perfect  set,  and  is  consequently  identical 
with  G(fi+v,  and  with  £<n>  as  defined  in  §  68.    The  set  Gw  is  the  sum  of  an 
enumerable  set  and  the  perfect  set  G®}.    A  set  G  which  has  this  property,  is 
said  to  be  irreducible. 

It  should  be  observed  that,  when  G(l]  is  unenumerable,  and  consequently  of 
cardinal  number  c,  the  same  as  the  cardinal  number  of  its  perfect  component, 
we  are  unable  to  make  any  inference  as  to  the  cardinal  number  of  G  itself. 
This  may  be  a  or  c,  or  other  cardinal  number  between  the  two,  in  case  such  a 
number  exists. 

CLOSED  SETS  IN  TWO  OR  MORE  DIMENSIONS. 

85.  In  considering  the  properties  of  closed  sets  of  points  in  space  of  two 
or  more  dimensions  it  is  sufficient  to  treat  in  detail  the  case  of  plane  sets  only, 
because  these  sets  exhibit  sufficiently  clearly  the  respects  in  which  linear  closed 
sets  differ  from  other  closed  sets.  The  results  obtained  for  plane  closed  sets 
can  immediately  be  extended  to  the  case  of  such  sets  in  any  number  of 
dimensions.  In  the  case  of  a  linear  set,  each  point  P  which  does  not  belong 
to  a  given  closed  set  is  enclosed  in  an  open  interval  which  contains  no  points 
of  the  set,  and  this  interval  has  a  maximum  length  in  both  directions,  the 
end-points  of  such  maximum  interval  8  being  points  of  the  closed  set  (or  an 
end-point  of  the  fundamental  interval),  and  this  maximum  interval  is  identical 
Avith  B,  for  all  points  interior  to  B.  But,  in  the  case  of  a  plane  set,  if  we  con- 
fine ourselves  to  areas  of  given  form  and  orientation,  such  as  rectangular  cells, 
and  these  take  the  place  of  the  linear  intervals  B,  it  is  not  the  case  that  a 
closed  set  is  defined  as  the  set  of  boundary  points,  together  with  the  external 
points,  of  a  unique  system  of  such  cells. 
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If  P  be  a  point  which  does  not  belong  to  a  given  plane  closed  set  G,  in  a 
fundamental  cell,  and  if  we  draw  through  P  a.  straight  line  parallel  to  the  line 
whose  equation  is  y  =  mx,  then  those  points  of  the  given  set  which  lie  on  this 
straight  line  form  a  closed  set  (see  §  56),  and  the  point  P  must  be  interior  to 
an  interval  A,n  (P)  contiguous  to  this  closed  set.  If,  on  either  side  of  P,  there 
are  in  this  straight  line  no  points  of  G,  then  on  this  side  the  extremity  of  the 
interval  Am  (P)  may  be  regarded  as  the  point  in  which  the  straight  line 
intersects  a  side  of  the  fundamental  cell.  The  interval  ATO  (P)  exists  for  every 
value  of  m,  and  the  extremities  of  the  interval,  for  a  fixed  P,  are  points  of  the 
plane  set,  or  points  on  the  boundary  of  the  fundamental  cell.  The  region  of 
plane  space  Ap,  defined  by  all  these  intervals,  for  every  value  of  m,  when  that 
straight  line  is  included  for  which  x  has  a  constant  value,  is  free  in  its  interior 
from  points  of  G.  Such  a  region  AOT  (P)  may  be  regarded  as  the  analogue,  for 
plane  sets,  of  the  interval  8  (P)  for  the  case  of  linear  sets.  But  the  analogy  is 
not  complete,  for  if  Q  be  an  interior  point  of  Am  (P),  it  is  not  necessarily  the 
case  that  Am(Q)  is  identical  with  Am(P). 

If  however  we  work  only  with  rectangular  cells,  there  exists  in  general  no 
unique  rectangular  cell,  corresponding  to  a  point  P  of  C  (G),  which  is  such 
that  no  interior  point  of  it  is  a  point  of  G,  and  that  on  its  boundary  there  is  a 
point  of  G.  If  we  describe  a  square  cell  of  sides  2p  with  its  centre  at  P,  and 
containing  in  its  interior  and  on  its  boundary  no  point  of  G,  we  may  keep 
three  of  the  sides  fixed,  and  move  the  fourth  parallel  to  itself  until  it  either 
contains  a  point  of  P,  or  becomes  coincident  with  one  of  the  sides  of  the 
fundamental  rectangle.  We  may  then  proceed  to  move  another  side  parallel 
to  itself  until  the  same  thing  happens,  and  so  on  with  the  other  two  sides. 
The  resulting  rectangle  will  in  general  depend  upon  the  order  in  which  the 
moving  of  the  four  sides  takes  place.  Thus  we  obtain,  in  general,  no  unique 
rectangular  cell  corresponding  to  the  point  P. 


86.  It  is  however  possible,  for  a  given  closed  non-dense  plane  set  G,  to 
construct  an  enumerable  set,  not  unique,  of  cells  which  is  everywhere  dense, 
and  such  that  every  point  of  G  lies  on  the  boundary  of  a  rectangle,  or  is  a 
limiting  point  of  points  which  lie  on  the  boundaries  of  such  cells.  Let  us 
denote  by  S  the  fundamental  cell  in  which  the  whole  set  G  lies,  and  let  8  be 
a  cell  constructed,  as  above,  for  a  point  P  of  the  set  C(G).  Produce  the  sides 
of  8,  when  necessary,  until  they  cut  the  sides  of  S,  thus  dividing  $  into  at  most 
nine  different  rectangles,  of  which  one  is  8,  and  the  others  may  be  denoted  by 
Sr,  where  r  =  1,  2, ...  8.  In  each  rectangle  S,.  take  any  point  Pr  which  does  not 
belong  to  G,  and  construct  for  Pr  a  free  rectangle  8r,  as  before;  let  the  sides  of  8,. 
be  produced,  when  necessary,  until  they  meet  the  sides  of  Sr,  then  *Sy  is  divided 
into  at  most  nine  rectangles,  which  consist  of  8r  and  at  most  eight  rectangles 
Srs,  where  s=l,  2, ...  8. 
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Proceeding  in  this  manner,  we  obtain  a  set  of  rectangles 

8,  Sr,  Srs,  Srgt, ..., 

and  in  them  a  set  of  rectangles  8,  Br,  8rs,  Brst,  ...,  each  of  which  contains  no 
points  of  G  in  its  interior ;  each  of  the  numbers  r,  s,  t,  ...  being  one  of  the  digits 
1,  2,  3,  ...  8.  If  p  be  a  point  of  0  which  is  not  on  a  boundary  of  any  rect- 
angle Sn,  it  must  be  in  the  interior  of  each  of  an  unending  sequence  of  rectangles 
Sr,  Srg,  Srgt,  ...,  where  r,  s,  t,  ...  have  definite  values;  and  this  set  of  rect- 
angles must  converge  either  (1)  to  a  point  in  the  interior  of  all  of  them,  or 
(2)  to  a  linear  interval,  or  (3)  to  a  definite  rectangle  &„  in  the  interior  of  all 
of  them.  In  case  (1),  the  point  to  which  the  rectangles  converge  is  a  limiting 
point  of  those  points  of  G  which  lie  on  the  boundaries  of  the  definite 
sequence  of  rectangles  Sr,  &rs,  8rst,  —  In  case  (2),  there  must  be,  in  the 
limiting  linear  interval,  at  least  one  point  which  is  a  limiting  point  of  G :  for, 
if  not,  the  whole  interval  could  be  enclosed  in  a  rectangle  which  contains  no 
points  of  G ;  and  this  is  impossible.  In  case  (3),  we  start  with  the  rectangle 
S<a,  and  take  a  point  Pw  not  belonging  to  G  inside  it,  which  point  exists,  since 
G  is  non-dense,  and  construct  the  maximum  free  rectangle  8M.  Produce  its 
sides  as  before  to  meet  those  of  SM,  and  proceed  as  before  to  construct 
Siarst  . . .  and  8Mrst ....  This  process  can  be  continued  until  an  index  is  reached 
which  may  be  any  number  of  the  second  class,  but  the  point  p  must  be  reached 
before  some  definite  number  of  the  second  class  appears  as  index ;  this 
following  from  the  fact  that  the  number  of  non-overlapping  regions  which  are 
contained  in  a  given  space  must  be  enumerable.  Thus  the  point  p  is  reached 
after  an  enumerable  set  of  steps  of  the  process. 

It  has  therefore  been  shewn  that : 

If  G  is  a  non-dense  closed  plane  set  of  points,  an  everywhere  dense 
enumerable  set  of  rectangles  exists,  such  that  every  point  of  G  is  on  a  boundary 
of  one  or  more  of  the  rectangles,  or  is  a  limiting  point  of  such  points,  or  lies  in 
a  linear  interval  which  is  the  limit  of  a  sequence  of  the  rectangles. 

In  case  the  set  G  is  perfect,  the  rectangles  of  the  set  must  either  not  abut  on 
one  another,  or  every  common  side  must  contain  either  no  points  of  G,  or  else  a 
perfect  set  of  points  of  G. 

87.  That  a  perfect  plane  set  G  has  the  power  of  the  continuum*  may  be 
proved  by  projecting  the  points  of  (r  on  a  side  of  the  fundamental  rectangular 
cell.  The  linear  set  of  points  which  are  the  projections  of  G  is  a  closed  set. 
For  if  P  be  a  limiting  point  of  the  set  of  projects,  let  pp'  be  an  arbitrarily 
small  neighbourhood  of  the  set  of  projects,  of  which  P  is  the  centre ;  con- 
struct straight  lines  PQ,  pq,  p'q',  perpendicular  to  pp',  meeting  the  opposite 
side  of  the  fundamental  rectangle  in  Q,  q,  q'.  In  the  rectangle  pqq'p'  there 

*  See  Bendixson,  Bib.  Svensk.  Vet.  IlandL,  vol.  ix  (1884),  where  the  first  proof  of  this  theorem 
was  given. 
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are  an  infinite  number  of  points  of  0.  Let  a  symmetrical  system  of  nets,  with 
closed  meshes,  be  fitted  on  to  this  rectangle  pqq'p ',  the  number  m  (see  §  51), 
whose  square  is  the  number  of  meshes  in  /),,  being  taken  to  be  odd.  There  is 
in  at  least  one  mesh  of  /),  an  infinite  number  of  points  of  0  in  the  interior  or 
on  one  of  its  boundaries  parallel  to  pp;  and  one  such  mesh  at  least  must  exist 
with  its  centre  on  PP';  for  otherwise  P  could  not  be  a  limiting  point  of  the 
projection  of  G.  We  take  that  one  d,  of  such  meshes  which  is  nearest  to  pp'. 
Similarly  there  exists  a  mesh  dz,  of  D2,  contained  in  rf,,  with  its  centre  on  PP', 
and  containing  an  infinite  set  of  points  of  G.  Proceeding  in  this  manner,  we 
obtain  a  sequence  dly  d2,  ...  of  meshes  each  containing  the  next,  and  all  having 
the  same  property  as  dl.  These  define  a  point  on  PQ  which  must  be  a  limiting 
point  of  G,  and  therefore  belongs  to  G.  Therefore  P  is  a  point  of  the  projection 
of  G  ;  and  thus  the  projection  of  G  is  a  closed  set.  If  a  point  P  be  an  isolated 
point  of  the  projection  of  G,  we  may  as  before  construct  a  rectangle  pqq'p'  which 
contains  no  points  of  G  that  do  not  lie  on  PP.  The  component  of  G  in  this 
rectangle  must  be  perfect,  and  therefore  the  straight  line  PP'  contains  a 
perfect  component  of  G.  If  the  projected  set  is  perfect,  then  it  has  the  power 
c  of  the  continuum ;  and  if  it  contains  isolated  points,  these  must  be  the 
projections  of  perfect  linear  components  of  G ;  therefore,  in  either  case,  G  has 
the  power  of  the  continuum. 

It  is  clear  that  this  method  of  proof  can  be  extended  to  the  case  of  a  set 
in  any  number  of  dimensions. 

THE  ANALYSIS  OF  SETS  IN  GENERAL. 

88.  With  a  view  to  the  general  analysis  of  sets  of  points  in  any  number 
of  dimensions,  it  is  necessary  to  classify  the  points  of  a  given  set  according  to 
the  cardinal  number  of  those  points  of  the  set  that  are  in  the  arbitrarily  small 
neighbourhoods  of  the  various  points*. 

An  isolated  point  of  any  set  G  is  such  that  in  a  sufficiently  small  neigh- 
bourhood of  the  point  there  are  no  other  points  of  G.  For  this  reason  an 
isolated  point  may  be  said  to  be  of  degree  zero  in  the  set. 

A  limiting  point  of  G,  such  that  in  every  sufficiently  small  neighbourhood 
of  the  point  there  is  an  enurnerably  infinite  set  of  points  of  G,  is  said  to  be  a 
point  of  enumerable  degree  in  the  set,  or  of  degree  a  in  the  set.  Such  a  point 
may  or  may  not  belong  to  G. 

In  case  the  point  is  such  that,  in  every  neighbourhood  of  it,  there  exists  a 
set  of  points  of  G  that  has  the  cardinal  number  of  the  continuum,  the  point 
is  said  to  have  degree  c  in  the  set. 

*  An  analysis  of  this  kind,  for  sets  in  general,  was  given  by  Cantor,  Acta  Math.,  vol.  vn.  A 
more  elementary  presentation  of  the  matter,  without  the  use  of  transfinite  numbers,  has  been 
given  by  w.  H.  Young,  (Juart.  Journ.  of  Math.,  vol.  xxxv,  p.  102. 
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As  we  are  not  entitled  to  assert  that  every  infinite  set  of  points  has  either 
a  or  c  for  its  cardinal  number,  we  contemplate  the  existence  of  sets  having  a 
hypothetical  cardinal  number  x  different  from  a  or  c.  A  point  such  that,  in 
every  sufficiently  small  neighbourhood  of  the  point,  there  existed  a  set  of 
cardinal  number  x  of  points  belonging  to  G,  would  be  termed  a  point  of 
degree  x  in  the  set  G. 

A  point  P,  whether  it  belongs  to  G  or  not,  is  said  to  be  a  point  of  un- 
enumerable  degree  in  G  if,  in  every  neighbourhood  of  the  point,  there  exists  an 
unenumerable  set  of  points  which  belong  to  G.  Such  a  point  has  been  termed* 
by  Lindelof  a  point  of  condensation.  As  however  this  expression  is  employed 
by  some  writers  to  denote  a  limiting  point,  it  is  better  avoided.  The  set  of 
all  such  points  P  (belonging  to  G  or  not)  is  closed,  for  any  limiting  point 
clearly  belongs  to  the  set.  It  is  obvious  that,  if  G  be  enumerable,  it  contains 
no  points  of  unenumerable  degree  in  G,  for  every  part  of  an  enumerable  set 
is  enumerable  (finite  or  infinite). 

Conversely  it  may  be  shewn  that : 

If  no  point  of  G  is  a  point  of  unenumer ability  in  G,  the  set  G  is  enumerable. 

Let  a  system  of  nets  be  fitted  on  to  the  finite,  or  indefinitely  great,  cell  in 
which  G  is  contained.  Any  point  P,  of  G,  is  defined  by  a  unique  sequence  of 
meshes  of  the  successive  nets  of  the  system.  There  must  be  a  value  n,P  of  n, 
from  and  after  which  all  the  meshes  {dn}  of  the  sequence  which  defines  P 
contain  only  an  enumerable  set  of  points  of  G.  Thus,  to  each  point  P  of  G, 
we  can  correlate  one  mesh  dn  ;  but  the  same  mesh  may  be  correlated  with 
more  than  one  point  P.  Taking  all  the  meshes  dn  3,  for  all  the  points  P  of  G, 
these  form  an  enumerable  set,  being  part  of  the  enumerable  set  of  all  the 
meshes  of  the  system  of  nets.  Since  each  mesh  dnp  contains  only  an  enumerable 
set  of  points  of  G,  it  follows  (§  58)  that  G  is  an  enumerable  set. 

89.  Every  set  G,  not  enumerable,  is  the  sum  of  an  enumerable  set  {possibly 
absent}  and  a  set  in  which  each  point  is  of  unenumerable  degree  in  the  set,  and 
which  is  therefore  dense  in  itself. 

Thus  G  =  H  +  K,  where  H  is  enumerable,  and  K  is  such  that  each  point 
is  a  point  of  unenumerable  degree  in  G.  Since  every  point  of  K  has  in  its 
neighbourhood  an  unenumerable  set  of  points  of  G,  and  therefore  of  K,  the 
set  K  is  dense  in  itself.  To  prove  the  theorem,  let  H  be  that  part  of  G,  each 
point  of  which  is  a  point  of  zero,  or  of  enumerable,  degree  in  G ;  it  is  then 
clear  that  H  must  be  a  set  in  which  every  point  is  of  zero,  or  of  enumerable, 
degree  in  H',  therefore  H  is  enumerable.  The  set  K=G  —  H  is  then  un- 
enumerable because  each  point  of  it  contains  in  its  neighbourhood  an  un- 
enumerable set  of  points  of  G,  and  therefore  of  K. 

*  See  Comptes  Rendus,  Paris,  November  2,  1903. 
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It  is  clear  that  every  limiting  point  of  a  set  of  points  of  unenumerability 
is  itself  such  a  point.  Since  K  is  dense  in  itself,  it  is  contained  in  its  derivative 
K'\  and  as  K'  has  no  isolated  points,  it  is  perfect. 

If  G  is  a  closed  set,  it  must  contain  all  the  points  which  are  of  unenumerable 
degree  in  it  ;  thus  K'  and  K  are  identical,  and  therefore  K  is  perfect.  There- 
fore we  have  the  theorem  : 

A  closed  set  is  the  sum  of  an  enumerable  set  and  a  perfect  set,  either  of 
which  may  be  absent. 

The  proof  of  the  theorem  here  given  is  due  to  Lindelof  (loc.  cit.). 

For  the  special  case  of  linear  closed  sets  this  theorem  has  already  been 
proved  in  §  81.  But  the  following  theorem  may  here  be  added*  : 

A  linear  closed  set  is  the  sum  of  an  enumerable  set  and  a  set  which  is  dense 
in  itself,  such  that  each  point  of  the  latter  set  is  a  limiting  point  on  both  sides. 

For,  a  point  of  K  that  is  a  limiting  point  only  on  one  side  is  also  a  point 
of  K'  which  can  only  be  a  limiting  point  of  K'  on  one  side  ;  moreover  the  set 
of  all  such  points  of  K',  viz.  the  extremities  of  the  intervals  contiguous  to  it, 
is  enumerable.  Therefore  those  points  of  K  that  are  limiting  points  of  K  on 
one  side  only  form  an  enumerable  set  ;  and  if  these  be  removed  from  K  we 
have  a  set  Kl  the  points  of  which  are  limiting  points  of  K,  and  therefore  also 
of  Kl  ,  on  both  sides. 

It  is  asserted  in  the  first  theorem  given  above  that  every  unenumerable  set 
contains  a  component  that  is  dense  in  itself.  An  enumerable  set  may  contain 
such  a  component,  and  if  it  does,  its  derivative  contains  a  perfect  set,  and  thus 
the  enumerable  set  is  irreducible. 

90.  If  G  is  an  unenumerable  set,  those  points  of  G  which  are  of  the  same 
degree  x(>  a)  in  G  form  a  set  which  is  dense  in  itself,  and  of  which  the  cardinal 
number  does  not  exceed  x. 

Let  us  fit  on  a  system  of  nets  to  the  unbounded  space  in  which  G  is  con- 
tained. Consider  those  points  P  that  are  of  degree  x  in  G  ;  P  is  in  a  rnesh 
dn  (a)  in  which  the  set  of  points  of  G  is  of  degree  a;  ;  n  may  have  the  smallest 
value  for  which  this  is  the  case.  When  all  such  points  P  are  taken,  the 
meshes  dn  (x)  corresponding  to  them  form  an  enumerable  (or  finite)  set,  a  part 
of  the  enumerable  set  of  all  the  meshes  of  the  system  of  nets.  In  each  of 
these  meshes  dn  (x)  there  is  a  set  of  points  of  G  of  cardinal  number  x.  [it  will 
be  shewn  in  Chapter  IV,  in  connection  with  the  general  theory  of  cardinal 
numbers,  that  an  enumerable  set  of  aggregates  of  points,  each  of  cardinal 
number  x,  has  the  cardinal  numbers.  Since  all  those  points  of  G  that  are  of 
degree  x  in  the  set  are  contained  in  the  enumerable  set  of  meshes  so  obtained, 
it  follows  that  the  set  of  such  points  of  G  cannot  have  a  cardinal  number 
greater  than  x. 

*  W.  H.  Young,  Quart.  Journ.  of  Math.,  vol.  xxxix,  p.  76. 
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To  shew  that  the  set  is  dense  in  itself,  we  observe  that,  in  a  sufficiently 
small  neighbourhood  (a,  /3)  of  Px,  there  is  a  set  of  points  of  0,  of  cardinal 
number  x  ;  and  that  none  of  these  points  can  be  of  degree  in  G  higher  than  x, 
For  if  there  were  such  a  point  Q  of  degree  higher  than  x,  in  some  interval 
(a',  /3')  contained  in  (a,  /3)  and  containing  Q  in  its  interior,  there  would  be  a 
set  of  points  of  G  of  cardinal  number  higher  than  x  ;  but  this  is  impossible,  as 
all  these  points  would  be  in  (a,  /3).  Again,  the  points  of  G  in  (a,  /3),  other  than 
Px,  cannot  be  all  of  degree  lower  than  x  ;  for  if  they  were  so,  their  cardinal 
number  would  be  lower  than  x,  since  they  could  be  enclosed  in  an  enumerable 
set  of  non-overlapping  intervals  with  Px  as  sole  external  point.  Moreover, 
since  in  any  arbitrarily  small  neighbourhood  of  Px  there  are  points  of  the  same 
degree  x  in  G,  Px  is  a  limiting  point  for  such  points.  Therefore  the  set  of 
points  such  as  Px  is  dense  in  itself. 

The  set  of  points  of  degree  c  is  dense  in  itself,  and  of  cardinal  number  c. 

91.  Any  set  G  consists  of  isolated  points  which  form  an  enumerable  set 
called  the  adherence  of  G,  and  of  limiting  points  which  form  a  set  called  the 
coherence  of  G.  Denoting  the  adherence  and  the  coherence  of  G  by  Ga,  Gc 
respectively,  we  have  G  =  Ga  +  Gc. 

The  set  Gc  can  in  a  similar  manner  be  split  up  into  its  adherence  and  its 
coherence,  which  we  denote  by  Gca  and  Gc2  respectively  ;  thus 

Gc  =  Gca  +  Gc2. 

The  set  Gca  is  an  isolated  set,  and  therefore  enumerable  ;  and  if  we  proceed 
to  resolve  Gcz  in  a  similar  manner  into  its  adherence  Gc2a,  and  its  coherence 
Gc3,  and  then  to  resolve  Gcs,  it  is  clear  that  the  process  may  be  continued 
any  number  n  of  times.  We  thus  obtain 

G  =  Ga  +  Gca  +  Gc2a  +  ...+  Gcn~la  +  Gcn. 

The  set  Gcn~*  a  may  be  named  the  adherence  of  G  of  order  n,  and  Gcn  may  be 
denominated  the  coherence  of  G  of  order  n. 

It  may  happen  that,  for  some  value  of  n,  Gcn  vanishes  ;  in  that  case  G  has 
been  split  up  into  a  finite  number  of  enumerable  sets,  and  is  consequently 
itself  enumerable.  If  this  be  not  the  case,  the  process  may  be  continued 
indefinitely,  and  Gcn  then  exists  for  every  value  of  n.  We  then  define 

D(G,  Gc,  Gc\...  Gcn,...), 

the  set  of  points  common  to  all  the  coherences  of  G,  to  be  the  coherence  of 
order  to,  and  denote  it  by  GC".  It  is  clear  that  every  point  of  G  which  does 
not  belong  to  one  of  the  sets  Gcll~la,  belongs-  to  Gc™,  hence  we  have 


the  summation  being  taken  for  all  values  of  n  belonging  to  the  first  class. 

We  now  split  up  GC™  into  its  adherence  GcMa,  and  its  coherence  Gcu+l,  and 
proceed  further  to  obtain  the  adherences  and  coherences  of  G  of  the  orders 
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of  the  various  numbers  of  the  second  class.  If  alt  a.,,  ...  On,  ...  is  a  sequence 
of  numbers  of  the  second  class,  which  has  $  for  its  limit,  the  coherence  of 
order  /9  is  defined  by 

GcP  =  D  (Gc--,  Gc*>,  ...  G<?»,  ...). 

We  now  obtain  a  resolution  of  G  of  the  form 

G  =  2Gcpa  +  Gey, 

where  7  is  any  number  of  the  first  or  second  class,  and  the  summation  refers 
to  all  values  of  p  which  are  less  than  7.  Each  adherence  Gcpa  is  an  isolated 
set,  and  therefore  enumerable  ;  and  if  G  contains  a  component  which  is  dense 
in  itself,  this  component  is  contained  in  Gcv. 

First  suppose  G  to  be  an  enumerable  set  ;  the  process  of  analysis  must 
then  cease  for  some  number  7  of  the  first  or  second  class.  For  if  Gcpa  existed 
for  every  number  7  of  the  second  class,  we  should  have  obtained  an  unenumerable 
set  of  adherences  containing  no  points  in  common,  and  all  belonging  to  G  : 
thus  G  could  not  be  enumerable. 

The  cessation  of  the  process  may  take  place  in  two  different  manners  : 

(1)  if  for  some  number  7  of  the  first  or  second  class,  Gc?  =  0,  G  has  been 
resolved  into  an  enumerable  set  of  adherences,  and  it  contains  no  component 
which  is  dense  in  itself: 

(2)  if  for  some  number  7, 


in  which  case  Go*  =  Gcv+l,  the  set  G&  then  contains  no  adherence,  and  every 
point  of  it  is  a  limiting  point,  and  G&  is  therefore  dense  in  itself.  The  set  G 
has  consequently  been  resolved  into  an  enumerable  component  which  contains 
no  part  that  is  dense  in  itself,  and  into  a  set  which  is  enumerable  and  dense 
in  itself. 

Next,  let  us  suppose  that  G  is  an  unenumerable  set.  Then  it  has  been  shewn 
that  those  points  of  G  which  are  of  unenumerable  degree  in  G  form  a  set  that 
is  dense  in  itself;  and  those  points  which  belong  to  the  adherences  of  all  orders 
are  points  of  zero,  or  of  enumerable,  degree,  and  thus  form  an  enumerable  set. 
It  follows,  since  all  points  that  do  not  belong  to  that  part  of  G  which  is 
dense  in  itself  belong  to  the  adherences,  that  the  number  of  adherences  must 
be  enumerable  ;  and  thus  that,  for  some  number  7  of  the  first  or  second  class, 
Gtf  is  dense  in  itself.  The  set  Get  may  consist  of  an  enumerable  set  dense  in 
itself,  and  of  sets  of  higher  cardinal  numbers  dense  in  themselves. 

It  has  thus  been  shewn  that  any  set  G  mat/  be  represented  by 

G=U+Va  +  ZVx+Vc, 

where  U  is  an  enumerable  set  which  contains  no  component  that  is  dense  in  itself, 
Va  is  an  enumerable  set  of  points  of  degree  a  dense  in  itself,  V,.  is  a  set,  of  cardinal 
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number  c,  consisting  of  points  of  degree  c,  dense  in  itself;  Vx  is  a  set  dense  in  itself 
consisting  of  points  of  degree  x,  where  a<x<c. 

If,  as  is  probable,  no  cardinal  numbers  exist  between  a  and  c,  the  sets  Vx 
can  be  omitted.  A  set  such  as  Va,  Vx,  Vc  is  denominated  a  homogeneous  set 
of  degree  a,  x,  c,  in  the  set  G. 

If  G  is  a  closed  set,  then,  as  has  been  shewn  in  §89,  Fc  is  perfect,  and  ^Vx 
cannot  exist. 

INNER   AND   OUTER   LIMITING  SETS. 

92.  Let  Si,  S2,  ...  Sn,  ...  be  a  sequence  of  sets  of  points,  in  one  or  more 
dimensional  space,  such  that  each  set  Sn+i  of  the  sequence  is  contained  in  the 
preceding  one  Sn,  then  the  set  Su,  or  D(Sl}  S2,  ...  Sn, ...),  consisting  of  points 
each  of  which  belongs  to  all  the  sets  of  the  sequence,  is  said,  when  such  set 
exists,  to  be  the  inner  limiting  set  of  the  sequence  of  sets.  A  point  of  the 
inner  limiting  set  which  does  not  belong  to  Sn,  Sn  +  l,  ...  but  belongs  to  Sn^1 
is  said  to  be  shed  at  the  index  n. 

When  Si,  S2,  ...  Sn,  ...  is  a  sequence  of  sets,  each  one  of  which  Sn  is 
contained  in  the  next  $w+1,  the  set  $„,,  or  M(Sl}  S2,  ...  Sn,  ...  ),  which  consists 
of  the  set  of  those  points  each  of  which  belongs  to  all  the  sets,  from  and  after 
some  value  of  n  dependent  on  the  particular  point,  is  said  to  be  the  outer 
limiting  set  of  the  sequence. 

If  any  sequence  2j,  S2,  ...  2n,  ...  of  sets  be  given,  and  we  take  $1  =  2i, 
8,  =  D($i,  22),  S3  =  D  (Sa,  22,  23),  and  in  general  8m  =  D  (%,  28,  ...  2m),  the 
inner  limiting  set  Sm,  when  it  exists,  also  defines  D  (£1,  22,  ...  2m>  ...  ),  and 
may  be  regarded  as  defined  by  the  given  sets  2i,  22,  .... 

For  a  point  that  belongs  to  all  the  sets  2X,  S2,  ...  2n,  •••  belongs  to  all  the 
sets  Si,  S2,  83,  ...  Sn,  ...  ;  and  a  point  that  belongs  to  all  the  latter  sets 
belongs  to  all  the  former  sets. 

Thus  it  has  been  shewn  that  the  set  of  points  common  to  all  the  sets  of  any 
given  sequence  of  sets  is  an  inner  limiting  set. 

Again,  if  we  take  S1  =  2lf  &  =  Jtf"  (2,,  £>),  83  =  M(21}  22,  28),  ...  the  outer 
limiting  set  Su  also  defines  M  (2j,  22,  •••  2n,  ...  ),  and  thus  the  set  of  all  points 
tJtat  belong  to  one  at  least  of  the  sets  of  any  given  sequence  of  sets  is  an  outer 
limiting  set. 

If  Su  be  the  inner  limiting  set  of  the  sequence  Si,  S2,  S3,  ...  of  sets, 
each  of  which  contains  the  next,  the  sequence  0(8^,  C (&>),  0(8.^),  ...  of  the 
sets  complementary  to  the  first  sequence  with  respect  to  a  cell  (or  interval), 
whether  finite  or  not,  which  contains  them  all,  is  such  that  each  set  is 
contained  in  the  next,  and  their  outer  limiting  set  is  (?($„,). 
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It  has  been  shown  in  §  67  that,  when  the  sets  <Slt  /S'a,  ...  are  all  closed  sets, 
the  inner  limiting  set  always  exists,  and  is  a  closed  set. 

Every  set  of  points  can  be  exhibited  as  the  inner  limiting  set  of  a  sequence 
of  sets  each  of  which  contains  the  next  ;  and  also  as  the  outer  limiting  set  of  a 
sequence  of  sets  each  of  which  is  contained  in  the  next. 

Let  a  set  G  be  contained  in  the  interior  of  a  cell  (a(I),  a'2),  ...  a(p)  ;  b(1},  b(t>, 
...  bw)  ',  and  let  06  be  the  component  of  G  that  is  contained  in  the  cell 
(a^,  aw,  ...  a*>  ;  06(1>,  06(2),  ...  0&(x»)>  where  0  <  8  <  1. 

If  #1,  0a,  ...  6n,  ...  be  an  increasing  sequence  of  values  of  0  that  converges 
to  1,  the  sets  G6l,  Ggt,  ...  will  be  such  that  each  one  is  contained  in  the  next, 
and  their  outer  limiting  set  is  the  given  set  G. 

Also  G  is  the  inner  limiting  set  of  the  sequence  of  those  sets  which  are 
complementary  to  the  sets  of  a  sequence  for  which  C(G)  is  the  outer 
limiting  set. 

In  case  G  is  unbounded,  it  may  be  placed  in  correspondence  with  a  set 
interior  to  the  cell  (—1,  1),  and  the  theorem  therefore  holds  for  G,  since  it 
holds  for  the  set  that  corresponds  to  G. 

SETS   OF   THE   FIRST,   AND   OF   THE   SECOND,   CATEGORY. 

93.  The  outer  limiting  set  of  a  sequence  of  non-dense  sets  G},  G2,  ...  Gn,  ... 
each  of  which  contains  the  preceding  one  is  said  to  be  a  set  of  the  first  category. 

This  is  equivalent  to  the  definition  given*  by  Baire,  that,  if  P1}  P2,  .  .  .  Pn,  .  .  . 
be  any  sequence  of  sets,  non-dense  in  the  fundamental  interval,  or  cell,  in 
which  they  are  all  contained,  the  set  M(Pl,  P2,  ...  Pn,  ...  ),  consisting  of 
all  the  points  that  belong  to  any  of  the  given  sets,  is  of  the  first  category. 
For  we  have  only  to  take  Gn=  M(Pl,  P2)  ...  Pn)',  and  Gn  is  non-dense.  The 
set  PJ,  P2>  ...  Pn>  ...  can  always  be  replaced  by  non-dense  sets,  Qlt  Q2,  Q3,  ... 
no  two  of  which  have  a  point  in  common  ;  for  we  may  take 


Any  set  which  is  not  a  set  of  the  first  category  is  said  to  be  of  the  second 
category. 

It  is  clear  that  a  set  of  the  first  category  is  enumerable  if  the  sets 
Glt  G2,  ...  Gn,  ...  are  all  enumerable,  and  that  every  enumerable  set  is  of  the 
first  category. 

An  enumerable  set  of  the  first  category  may  be  everywhere  dense.  For 
example,  the  set  of  rational  points  in  a  given  cell,  or  interval,  (a,  6)  is  of 
the  first  category,  and  is  everywhere  dense. 

A  set  of  the  first  category  may  have  the  cardinal  number  of  the  continuum, 
and  may  then  be  everywhere  dense,  or  non-dense. 

*  Annali  di  Mat.  (3),  vol.  in,  p.  65,  where  the  distinction  between  sets  of  the  first  and  second 
categories  was  first  introduced. 

H.  9 
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A  set  of  the  first  category  is  not  necessarily  closed.  As  an  illustration  of 
this  fact,  let  us  consider  a  set  of  cells  PlQi,  P2Q2,  •••  PnQn,---  each  of  which 
contains  the  next,  and  such  that  the  sequence  converges  to  a  single  point  p 
interior  to  all  the  cells  of  the  sequence.  Let  Gl  consist  of  a  set  of  points  on 
the  boundary  of  P±Qi ;  let  G2  consist  of  G^  together  with  a  set  of  points  on  the 
boundary  of  P2Q2;  and  generally',  let  Gn  consist  of  Gn^  together  with  a  set 
of  points  on  the  boundary  of  PHQn.  The  point  p  does  not  belong  to  Gn  for 
any  value  of  n,  and  therefore  does  not  belong  to  G^,  but  it  is  clearly  a  limiting 
point  of  G» ;  and  therefore  GM  is  not  a  closed  set. 

To  illustrate  the  fact  that  an  unenumerable  set  of  the  first  category  may 
be  everywhere  dense,  fit  on  to  the  cell,  or  interval,  (a,  b)  a  system  of  closed 
nets.  Let  (rj  be  a  non-dense  perfect  set  in  (a,  b) ;  in  each  mesh  of  Dl  place  a 
non-dense  perfect  set,  and  let  G2  be  the  perfect  set  made  up  of  all  such  non- 
dense  perfect  sets,  together  with  G^.  Let  G3  consist  of  G2  together  with 
non-dense  perfect  sets  placed  in  all  the  meshes  of  D2 ;  and  so  on. 

The  set  of  the  first  category  defined  by  G1}  G2,  ...  is  clearly  everywhere 
dense;  for  any  sub-cell  contains  a  mesh  of  Dn,  for  some  sufficiently  large 
value  of  n,  and  thus  contains  points  of  Gn,  and  therefore  of  G^. 

In  case  Gu  is  closed,  it  must  be  non-dense,  for  it  will  be  shewn  that  C  (Gu) 
is  everywhere  dense,  and  therefore  G^  cannot  contain  a  cell,  or  interval. 

94.  A  set  of  points  which  is  complementary  to  a  set  of  the  first  category  is 
a  set  of  the  second  category. 

In  particular,  a  cell,  or  an  interval,  is  not  the  sum  of  two  sets  of  the  first 
category. 

To  prove  this  theorem,  we  shew,  in  the  first  place,  that  the  set  comple- 
mentary to  a  set  of  the  first  category  is  everywhere  dense.  For  let  (a,  /?)  be 
any  cell,  or  interval,  in  the  fundamental  cell,  or  interval,  in  which  the  set  of 
the  first  category  is  contained.  There  exists  a  cell,  or  interval,  (a1}  /?0  interior 
to  (a,  /3),  in  which  there  is  no  point  of  the  non-dense  set  Gl.  Again  (alt  /30 
contains  in  its  interior  a  cell,  or  interval,  (a2,  /32)  containing  no  points  of  G.2 ; 
and  so  on. 

There  exists  then  a  point  interior  to  all  the  cells,  or  intervals, 

(a,  /S)  (alf  /SO  (aa,  /32) («„,  &0 

which  is  not  a  point  of  GM.  Hence  the  set  complementary  to  G»  is  everywhere 
dense.  It  follows  from  this  that  a  set  of  the  first  category  cannot  contain  all 
the  points  of  any  interval,  or  cell,  and  is  thus  a  diffuse  set.  Next,  let  us 
assume,  if  possible,  that  C(GU1}  is  itself  the  limit  of  a  sequence  Qlt  Q2,  ...  Qn,  ... 
of  non-dense  sets,  each  of  which  is  contained  in  the  next.  The  sets 
GI  +  Qi,  G2  +  Q2,  ...  are  all  non-dense,  and  each  one  of  them  is  contained  in 
the  next;  their  limit,  which  is  of  the  first  category,  is  identical  with  the 
fundamental  cell,  and  this  has  been  shewn  to  be  impossible.  Hence  the 
complement  of  a  set  of  the  first  category  is  not  of  the  first  category. 


93-95]  Sets  of  the  First  and  Second  Categories  131 

The  converse  of  the  above  theorem  does  not  hold  good.  It  is  not  true  that 
every  set  of  the  second  category  is  the  complement  of  a  set  of  the  first 
category.  Thus  there  are  two  kinds  of  sets  of  the  second  category ;  those 
which  are  complementary  to  a  set  of  the  first  category,  and  those  for  which 
this  is  not  the  case.  If,  for  example,  in  a  linear  interval  (a,  6)  the  points  of  a 
sub-interval  (ou  &)  be  taken,  these  make  up  a  set  of  the  second  category,  and 
the  complementary  set  is  also  of  the  second  category.  It  has  been  shewn*  by 
Mahlo  that  the  points  of  a  linear  interval  (a,  b)  can  be  divided  into  two  sets, 
each  of  which  is  everywhere  dense,  and  of  the  cardinal  number  of  the  continuum, 
and  neither  of  which  is  of  the  first  category. 

95.  A  set  of  the  second  category  which  is  complementary  to  a  set  of  the 
first  category  is  saidf  to  be  a  residual  set. 

A  residual  set  can  be  obtained  by  the  successive  removal  from  a  fundamental 
cell,  or  interval,  of  non-dense  sets  of  points  belonging  to  a  sequence  of  such 
sets.  It  has  been  shewn  in  §  94  that  a  residual  set  is  everywhere  dense.  It  will 
now  be  shewn  that : 

A  residual  set  has  the  cardinal  number  of  the  continuum. 

Let  GM  be  a  set  of  the  first  category,  and  C  (GM)  the  corresponding  residual 
set.  Let  a  system  of  nets  be  fitted  on  to  the  fundamental  cell,  or  interval,  in 
which  GU  is  contained.  If  n  be  sufficiently  large,  there  is  a  mesh  of  the  net  Dn 
that  contains  no  point  of  G^ ;  the  smallest  value  of  n,  say  n^,  for  which  this  is 
the  case  may  be  chosen.  Let  this  mesh  be  denoted  by  d.  There  is  some 
smallest  value  of  n,  say  n2,  such  that  d  contains  in  its  interior  two  (or  more) 
meshes  d0,  dlf  of  the  net  DB2,  neither  of  which  contains  a  point  of  G2.  In  case 
there  are  more  than  two  such  meshes,  d0,  dl  are  those  of  lowest  rank.  Again, 
there  is  some  smallest  value  of  n,  say  w3,  such  that  both  d0  and  dl  contain 
within  them  two  or  more  meshes  of  Dna.  Let  the  two  meshes  in  d0  be 
denoted  by  dw,  dol ;  and  the  two  meshes  in  dl  by  dw,  du.  Proceeding  in  this 
way,  we  can  define  a  sequence  of  meshes  dp,  dpq,  dpqr,  ...  each  of  which 
contains  the  next  in  its  interior;  where  each  of  the  indices  p,  q,  r,  ...  is  either 
0  or  1  ;  and  where  the  sequence  p,  q,  r,  ...  is  defined  in  accordance  with  some 
set  of  rules.  The  point  interior  to  all  the  meshes  of  this  sequence  does  not 
belong  to  (•?„,,  and  is  therefore  a  point  of  the  residual.  But  such  a  sequence 
can  be  defined  corresponding  to  each  number  of  the  continuum;  leaving  out 
those  sequences  in  which  all  the  indices,  from  and  after  a  fixed  one,  are  all  1, 
the  correspondence  is  unique.  Hence  the  residual  must  have  the  cardinal 
number  c,  of  the  continuum,  Further,  it  can  be  proved  that: 

Any  finite,  or  enumerobly  infinite,  set  of  residuals  have  in  common  a  set 
which  is  also  a  residual  set. 

*  See  Leipz.  Ber.,  vol.  LXV,  p.  283. 

t  See  Deujoy,  Journal  de  Math.  (7),  vol.  i,  p.  123,  who  has  introduced  this  terminology. 

9 o 
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Let  H(l},  H®,  ...  be  residuals  obtained  by  removing  successively  the  sets  of 
sequences  {Gnw},  {Grn®},  ...  of  non-dense  sets  from  an  interval,  or  a  cell;  the 
non-dense  sets  can  be  arranged  in  enumerable  order 


If  these  sets  be  removed  successively  from  the  interval,  or  cell,  we  obtain  the 
residual  D  (#<",  #<2>,  #<3>,  ...). 


96.  It  is  often  of  importance  to  consider  the  properties  of  sets  that  are 
contained  in  a  given  perfect  set  H,  which  may  be  non-dense  in  the 
continuum. 

If  PI,  P2,  •••  Pn,  •••  be  a  sequence  of  sets  all  of  which  are  non-dense  in  the 
perfect  set  H,  the  set  M  (P]}  P2,  ...)is  said  to  be  of  the  first  category  relatively 
to  H.  A  set  which  is  not  of  the  first  category  relatively  to  H  is  said  to  be  of  the 
second  category  relatively  to  H.  A  set  whose  complement  relatively  to  H  is  of 
the  first  category  relatively  to  H  is  said  to  be  a  residual  set  relatively  to  H. 

It  can  be  shewn  that  : 

A  set  of  points  which  is  the  complement,  relatively  to  the  perfect  set  H,  of  a 
set  of  the  first  category,  relatively  to  H,  is  of  the  second  category  relatively  to  H. 

In  particular  H  cannot  be  the  sum  of  two  sets  both  of  the  first  category 
relatively  to  H. 

This  is  proved  in  a  manner  precisely  similar  to  that  in  which  the  theorem 
for  the  case  in  which  H  is  a  continuous  cell,  or  interval,  was  proved  in  §  94.  Thus 
it  is  first  shewn  that  a  residual  relatively  to  H  is  dense  in  H,  and  then  the 
proof  of  the  theorem  is  completed  as  in  §  94. 

It  follows  that  a  set  of  the  first  category  relatively  to  H  is  diffused  in  H. 

It  can  be  proved,  as  in  §  95,  that  a  residual  with  respect  to  H  has  the  cardinal 
number  of  bhe  continuum. 

It  has  been  shewn  by  Denjoy*  that  a  residual  relatively  to  a  perfect  set 
H  itself  contains  a  perfect  set. 

It  will  appear,  from  the  theory  of  order-types  developed  in  Chapter  IV, 
that  the  points  of  a  perfect  linear  set  can  be  made  to  correspond  uniquely 
with  points  of  a  continuous  interval  (a,  b),  in  such  a  manner  that  the  relative 
order  of  two  points  of  the  perfect  set  is  the  same  as  that  of  the  corresponding 
points  in  (a,  6)  ;  the  end-points  of  an  interval  contiguous  to  the  perfect  set 
corresponding  to  one  point  of  (a,  b).  To  a  closed  set  H,  non-dense  in  the 
perfect  set,  there  corresponds  a  closed  set,  non-dense  in  the  continuum;  and 
sets  of  the  first,  or  the  second,  category  relatively  to  the  perfect  set  correspond 
respectively  to  sets  of  the  first,  or  of  the  second,  category  in  (a,  b).  It  thus 
appears  that  the  properties  of  sets  of  the  first,  and  second,  categories  in  the 
continuum  can  be  immediately  extended  to  sets  of  the  first,  and  second, 
categories  relatively  to  a  perfect  set. 

*  Journal  de  Math.  (7),  vol.  i,  p.  232. 
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This  is  a  particular  case  of  the  general  property  of  any  perfect  set ;  viz. 
that  all  descriptive  properties  of  sets  of  points  in  the  continuum  correspond 
to  identical  properties  of  sets  of  points  in  a  perfect  set,  even  if  the  perfect  set 
be  itself  non-dense  in  the  continuum. 


EXAMPLES. 

1.  Let  P,,  /*2,  P3,  ...  /'», ...  be  an  enumerable  set  of  points  in  an  interval  (a,  b);  the 
set  may  be  everywhere  dense  in  (a,  6).    The  finite  sets 

(A),    (Pi,  A),    (Pi,  P-2,  PS),  ...  (Pi,  P*,  ".  Pn),  ". 

are  each  closed,  and  the  given  set  is  the  limiting  set,  which  is  therefore  of  the  first  category. 
The  remaining  joints  of  (a,  6)  form  a  set  of  the  second  category,  which  is  a  residual. 

2.  Denoting  the  points  of  the  interval  (0,  1),  as  in  Ex.  5,  §  83,  by 

x= T  "*"  TT"^  '"  ~*T~t       /.^'"' 

K j         KI  K%  K i  K%  .  .  .  Kn 

where  cn<kn;  let  the  fixed  integers  k±,  kz,...kn,...  form  a  sequence  which  increases 
without  limit.  If*  aj,  a2,  ...  an,  ...  is  any  sequence  of  positive  integers  which  increase 
without  limit,  let  Gn  denote  the  set  of  those  numbers  x,  which  are  such  that  the  integers 
Ci,  c2,  ...  cn,  ...  are  all  <  an.  The  sets  6^,  Gz,  O3,  ...  Gn,  ...  are  a  sequence  of  perfect  sets, 
each  one  of  which  contains  the  preceding  ones  ;  the  set  Ou>  is  then  a  set  of  the  first  category. 

3.  The  numbers  of  the  continuum  (0, 1)  may  be  divided  into  sets,  of  the  first,  and  of  the 
second  category,  in  the  following  manner:  All  the  numbers  in  (0,  1)  may  be  expressed  as 
endless  decimals;  the  finite  decimals  being  therefore  not  used.    Lett  the  set  H  consist  of 
all  those  numbers  in  which  the  digit  9  occurs  only  a  finite  number  of  times,  and  of  those 
numbers  also  in  which,  from  and  after  some  place,  all  the  figures  are  9.    The  complementary 
set  K  consists  of  all  those  numbers  in  which  9  occurs  an  infinite  number  of  times,  except 
those  in  which  every  figure  is  9,  from  and  after  some  place.    The  set  H  is  the  limit  of  a 
sequence  of  non-dense  closed  sets  Hly  H2, ...  Hn,  ...  each  of  which  is  of  cardinal  number  c. 
For,  let  HI  consist  of  the  numbers  of  the  form  -abc  ...  £999  ...,  in  which  every  figure  is  9, 
after  some  fixed  place,  and  in  which  none  of  the  figures  a,  b,  c,  ...  k  is  9 ;  together  with  those 
decimals  in  which  no  figure  is  9.    No  number  of  the  set  Sl  can  lie  within  the  interval 
(•abc...  £899 ...,  •abc, ...  £999  ...)  which  is  therefore  a  complementary  interval  of  the  set. 
The  set  Hn  may  be  taken  to  consist  of  the  numbers  of  the  form  -abc  ...  M999  ...,  in  which 
k  is  not  9,  and  not  more  than  n—  1  of  the  figures  a,  6,  c,  ...  /;,  are  9;  together  with  those 
decimals  in  which  9  does  not  occur.    That  each  of  the  sets  Hn  is  of  cardinal  number  c, 
follows  from  the  fact  that  it  contains  all  the  decimals  in  which  9  does  not  occur ;  and  these, 
if  interpreted  in  the  scale  of  8,  represent  all  the  numbers  of  the  continuum  (0,'l).    The  set 
H  is  everywhere  dense,  since  it  contains  that  everywhere  dense  set  of  numbers  in  which 
every  figure  is  9,  after  some  place.    The  set  A',  being  of  the  second  category,  is  also  every- 
where dense,  and  of  cardinal  number  c. 

4.  The  following  method  of  dividing  the  continuum  (0,  1)  into  two  portions,  each  of 
which  is  everywhere  dense,  and  of  cardinal  number  f,  has  been  given  by  Broden  J :  Let 

1-2  +  •••  +  ln+ ...  denote  a  divergent  series  of  positive  numbers,  such  that  the  limit 

*  Broden,  Math.  Annalen,  vol.  LI,  p.  299. 
t  See  Schoenflies,  BericJit,  vol.  i,  p.  106. 
J  Crelle's  Journal,  vol.  cxviu,  p.  29. 
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of  ln,  as  n  is  indefinitely  increased,  is  zero.  Let  a  be  a  positive  number  <  1,  and  let 
HI,  tt?,  ...  n  , ...  be  a  sequence  of  increasing  positive  integers.  It  is  possible  to  choose  the 
divergent  series  so  that  each  of  the  ratios  Injln^,  Injln^  ...  is  <a:  if  this  be  done,  the 

series  2  £»t  is  convergent,  its  sum  being  <  ~-  .   Each  of  the  series  obtained  from  2  £„ t,  by 

1=1  ~~ a  1=1 

leaving  out  a  finite  number  of  terms,  is  also  convergent.    The  convergent  series,  so  obtained, 

form  an  unenumerable  set :  for  they  are  obtained  by  multiplying  the  terms  of  the  series  2  ZWl, 

1=1 
each  either  by  0,  or  by  1 ;  and  thus  there  is  a  series  corresponding  to  each  fractional  number 

expressed  in  the  dyad  scale.  Corresponding  to  each  convergent  series,  there  is  a  divergent 
series  which  consists  of  I0  +  li  + ...+ln  +  ...,  with  the  convergent  series  removed  from  it. 
We  obtain  in  this  manner  an  unenumerable  set  of  divergent  series.  The  convergent  and 
divergent  series,  each  of  which  consists  of  terms  of  I0  +  li  +  ...  +  £»  +  ...,  may  now  be  corre- 
lated with  the  numbers  of  the  continuum  (0,  1).  Let  these  numbers  be  expressed  in  the 
dyad  scale,  in  the  form  •a1a2«3  •••>  where  every  a  is  0,  or  1,  and  the  case  in  which  every 
figure  is  zero,  after  some  place,  is  excluded.  To  one  of  the  series  lp  +  lq  +  lr  + ...,  we  may 
take  that  number  to  correspond  in  which  ap,  aq,  ar, ...  are  all  1,  and  the  remaining  digits 0. 
The  points  of  (0,  1)  are  thus  divided  into  two  classes;  one  of  these  consisting  of  all  the 
numbers  which  correspond  to  convergent  series,  and  the  other  of  those  corresponding  to 
divergent  series. 


ORDINARY  INNER  LIMITING  SETS. 

97.  The  inner  limiting  set*  of  a  sequence  of  open  sets,  each  one  of  which 
contains  the  next,  is  said  to  be  an  ordinary  inner  limiting  set. 

The  outer  limiting  set  of  a  sequence  of  closed  sets,  each  of  which  is  con- 
tained in  the  next,  is  said  to  be  an  ordinary  outer  limiting  set. 

Thus  an  ordinary  inner  limiting  set  is  the  complement  of  an  ordinary  outer 
limiting  set. 

In  the  case  of  a  linear  set  of  points  contained  in  a  given  interval  (a,  b), 
to  which  the  case  of  a  linear  set  in  an  unbounded  interval  may  be  reduced  by 
the  method  of  correspondence,  an  ordinary  inner  limiting  set  is  the  set  of 
points  common  to  a  sequence  {An|,  where  AM  is,  for  each  value  of  n,  a  set  of 
open  intervals  contained  in  (a,  b). 

In  the  particular  case  in  which  all  the  sets  An  are  everywhere  dense  in 
(a,  b),  the  complementary  closed  sets  (7(An)  are  all  non-dense.  In  this  case 
the  ordinary  inner  limiting  set  is  of  the  second  category,  and  is  of  that  species 
which  is  termed  (§  95)  a  residual  set. 

The  case  of  linear  sets  will  here  be  considered  in  detail. 

The  following  theorem  will  be  established  : 

Every  ordinary  (linear}  limiting  set  is  either  enumerable,  or  it  has  the 
power  of  the  continuum. 

*  The  term  inner  limiting  set  is  due  to  W.  H.  Young,  who  has  investigated  the  properties  of 
such  sets;  see  Leipz.  Ber.,  1903;  for  further  properties  see  also  his  papers  in  the  Proc.  Loud. 
Math.  Soc.  (2),  vol.  i,  p.  212,  and  (2)  vol.  in,  p.  372,  where  the  theory  is  extended  to  ^-dimen- 
sional Bets. 
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If  the  sets  An  are  all  everywhere  dense  in  (a,  6),  the  inner  limiting  set, 
being  a  residual  set  (see  §  100),  has  the  cardinal  number  of  the  continuum, 
in  accordance  with  the  theorem  of  §  95. 

Next,  let  us  suppose  that  the  sets  {An{  are  not  all  of  them  everywhere 
dense  in  (a,  6) ;  and  let  us  suppose  further  that  the  inner  limiting  set  contains 
a  part  H  which  is  dense  in  itself,  so  that  H'  is  perfect.  The  set  H'  may  be 
placed  in  correspondence  with  points  of  the  interval  (0,  1),  so  that  the  order 
of  corresponding  points  in  H'  and  in  the  interval  (0,  1)  is  always  the  same. 
To  each  point  in  (0,  1)  there  corresponds  a  single  point  of  H',  except  that  the 
end-points  of  an  interval  contiguous  to  H'  correspond  to  a  single  point  in 
(0,  1).  The  points  of  H  correspond  to  the  points  of  a  set  Hl  everywhere 
dense  in  (0,  1);  and  those  intervals  of  An  that  contain  points  of  H  corre- 
spond to  a  set  of  intervals  An'  everywhere  dense  in  the  interval  (0,  1).  The 
set  H  therefore  corresponds  to  a  residual  set  in  (0,  1);  and  therefore  H 
has  the  cardinal  number  of  the  continuum. 

The  only  sets  of  points  which  contain  no  components  that  are  dense  in 
themselves  are  enumerable  sets,  and  therefore  the  cardinal  number  of  an 
ordinary  inner  limiting  set  is  c  or  a. 

It  has  thus  been  established  that : 

An  ordinary  inner  limiting  set  has  the  power  of  the  continuum,  if  it 
contains  a  set  that  is  dense  in  itself ;  and  if  it  contains  no  such  component  it  is 
enumerable. 

An  enumerable  set  which  contains  a  component  that  is  dense  in  itself  cannot 
be  an  ordinary  inner  limiting  set. 

98.  If  any  linear  set  G  is  given,  we  may  suppose  a  point  x,  of  G,  to  be  in  the 
interior  of  each  interval  of  a  sequence  ^  (x),  8.z(x),  ...  $„(#), ...  of  intervals,  each 
one  of  which  contains  the  next ;  and  this  will  be  the  case  for  each  point  x  in 
G.  Let  dn  be  the  upper  boundary  of  the  lengths  of  the  intervals  Sn  (x),  taken 
for  every  point  x  in  G ;  and  suppose  the  intervals  so  determined  that  the 
numbers  dly  d2,  ...  dn,  ...  form  a  sequence  which  converges  to  zero.  Let  A,, 
be  the  set  of  open  non-overlapping  intervals  which  have  the  same  interior 
points  as  the  set  of  intervals  {§„(#)}.  We  have  then  a  sequence  of  sets 
At,  A2,  ...  An, ...  of  open  intervals,  which  will  define  an  ordinary  inner  limiting 
set,  to  which  all  the  points  of  G  must  belong;  but  it  may  also  include 
limiting  points  of  G  that  do  not  belong  to  G.  For  it  is  clear  that  any  point 
of  the  inner  limiting  set  is  at  a  distance  from  some  point  of  Gn  that  does  not 
exceed  dn ;  hence  any  point  of  the  inner  limiting  set  which  is  not  a  point  of 
Gn  is  the  limiting  point  of  a  sequence  of  points  of  6rw,  since  dn  ~»  0,  as  n  ~  oo  . 

The  set  of  points  of  G'  which  do  not  belong  to  G,  but  which  belong  to  the 
inner  limiting  set  defined  by  the  sequence  {Anj,  will  depend  in  general  on 
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the  mode  in  which  the  intervals  of  the  sets  An  are  defined.  In  order  that  the 
given  set  G  may  be  an  ordinary  inner  limiting  set  it  must  be  possible  so  to 
define  the  sets  An  that  no  point  G'  that  does  not  belong  to  G  is  in  An  for 
every  value  of  n. 

Some  criteria  will  be  here  given  for  deciding  in  respect  of  a  given  set 
whether  this  is  possible  or  not,  that  is,  whether  the  set  is  an  ordinary  inner 
limiting  set  or  not. 

A  point  which  is  in  Am,  but  not  in  the  sets  ATO+1,  Am+2,  ...,  is  said  to  be 
shed  from  the  sequence  {An}  at  the  index  ra. 

If  all  the  intervals  {8n(#)}  be  taken  to  be  of  equal  length  2cn,  with  the 
point  #  in  the  centre  of  Bn(ac},  where  cn~0,  as  n**'X> ,  then  every  limiting 
point  of  G  belongs  to  the  ordinary  inner  limiting  set  defined  by  the  sequence 
of  intervals. 

For,  however  small  cn  may  be,  there  are  points  of  G  whose  distance  from 
a  limiting  point  p  is  less  than  c,, . 

EXAMPLE. 

The  following  example,  given  by  Borel*,  drew  attention  to  the  fact  that  a  sequence  of 
sets  of  intervals  constructed  as  above  may  define  an  inner  limiting  set  that  contains  points 
other  than  those  of  the  given  set. 

V) 

Let  us  suppose  that  each  rational  point  -  in  the  interval  (0, 1)  is  enclosed  in  the  interval 

(*- .,    —+  -5) ,  where  X  has  the  same  value  for  all  the  points.    In  this  manner  the 

\q      f      q      q*J 

rational  points  are  enclosed  in  a  set  of  overlapping  intervals,  whose  sum  is  less  than 

2  1 

X2  (q  -  1)  -3 ,  or  than  2X2  — 2 ,  which  can  be  made  as  small  as  we  please  by  choosing  X  small 

enough.  The  equivalent  set  of  non-overlapping  intervals  defines,  by  means  of  the  end-points 
and  their  limits,  a  closed  set  {(/J,  such  that  for  any  point  of  the  set 


-5,  for  all 


points  —  . 
2 

Now  consider  the  set  of  points  defined  by 


where  each  a  is  ^  9,  and  the  a's  are  such  that  an  infinite  number  of  them  are  different  from 
xero.    It  has  been  shewn  by  Liouville  that  these  numbers  x  are  transcendental.    Let 


q      1"       -  q»       q 

L,ccsons  Kur  In  theorie  des  fonctioti-s,  p.  44. 
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If  follows  that,  if  x  is  one  of  the  above  transcendental  numbers,  whatever  value  X 

may  have,  it  is  interior  to  an  interval  (---s»   ^  +  ^j  •    For  «uppose  £  =  10"';    then 

£  -  x    <  \  <  ^ ,  provided  X  >  ^rn^s^]  •  an(*>  h°wever  small  X  may  be,  values  of  n  can  be 

f) 

found  for  which  this  inequality  is  satisfied.  Therefore  rational  points  *-  can  l>e  found,  how- 
ever small  X  may  be,  such  that  x  lies  within  the  intervals  (*-  --,,-  +  -,}  •  It  thus 

W      ?       1     Ti 

appears  that,  besides  the  original  points  -  which  the  intervals  are  drawn  to  enclose,  there 
are  other  points  which  lie  inside  the  intervals,  for  all  values  of  X,  when  X  is  diminished  in- 
definitely. 

99.  Those  limiting  points  of  an  enumerable  set  of  points  P  that  do  not 
belong  to  P  form  an  ordinary  inner  limiting  set. 

Let  pi,  p.,,  ...  pn,  ...  denote  the  points  of  P,  and  let  H denote  the  set  of 
those  limiting  points  of  P  which  do  not  belong  to  P. 

The  points  of  H  can  be  enclosed  in  a  set  Ax  of  open  intervals  that  do 
not  include  the  point  p^  Interior  to  A1}  a  set  of  open  intervals  A2  can  be 
defined  which  include  all  the  points  of  H,  but  not  the  point  p2 ;  and  so  on. 
The  sequence  Aj,  A2,  ...  has  for  its  inner  limiting  set  all  the  points  of  H, 
but  none  of  the  points  plt  p2,  ...  ;  and  H  can  have  no  limiting  points 
that  do  not  belong  either  to  itself  or  to  P.  Hence  H  is  an  ordinary  inner 
limiting  set. 

Every  isolated  set  is  an  ordinary  inner  limiting  set. 

For  each  point  x,  of  an  isolated  set,  may  be  enclosed  in  an  interval  of  a 
sequence  81(x},  S!t(x),  ...  Sn(ac),  ...,  each  interval  containing  the  next,  and  so 
that  8W(#)  converges  to  zero  ;  and  such  that  8^  (x)  contains  no  point  of  the  given 
set  other  than  x.  The  set  A,,  may  then  consist  of  the  intervals  &n  (x)  for  every 
point  x  of  the  set.  It  is  then  clear  that  the  only  points  interior  to  AB,  for 
every  value  of  n,  are  the  points  of  the  given  isolated  set,  which  is  therefore 
an  ordinary  inner  limiting  set. 

Every  closed  set  is  an  ordinary  inner  limiting  set. 

Enclose  each  point  P,  of  a  closed  set  G,  in  an  interval  of  length  2/?,  with  P 
as  its  middle  point ;  any  part  of  such  an  interval  that  is  not  in  (a,  b)  may  be 
disregarded.  We  have  thus  a  set  of  overlapping  intervals  which  contain 
all  the  points  of  G  within  them.  ^Consider  the  equivalent  set  of  non- 
overlapping  intervals  (§  71).  We  have  thus  a  set  A,  of  non- overlapping 
intervals,  each  of  which  is  of  length  =  p  ;  and  therefore  A  must  be  a  finite  set, 
which  contains  all  the  points  of  G.  Let  p  have  all  the  values  in  a  sequence  of 
diminishing  numbers  that  converges  to  zero ;  the  corresponding  sequence  of 
finite  sets  of  intervals  A1;  A2, ...  A,, ,  ...  is  such  that  each  contains  the  next,  and 
every  interval  includes  points  of  G.  The  inner  limiting  set  which  they  define 
must  be  the  set  G,  since  G  contains  all  its  limiting  points. 
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100.  The  necessary  and  sufficient  condition  that  an  enumerable  set  P  should 
be  an  ordinary  inner  limiting  set  is  that  P  contains  no  component  that  is  dense 
in  itself, 

That  the  condition  is  necessary  has  been  shewn  above,  since  every  inner 
limiting  set  that  has  a  component  that  is  dense  in  itself  has  the  power  of  the 
continuum. 

To  prove  the  sufficiency  of  the  theorem,  we  employ  the  mode  of  analysis 
of  a  set,  given  in  §  91,  into  adherences  and  coherences. 

Any  enumerable  set  P  can  be  resolved  into  a  sum  of  sets 

PI+PI+...+PP+QP, 

where  P1;  P2,  ...  Pp  are  all  isolated  sets,  and  Qp  is  a  component  of  a  perfect 
set  Gp ;  fi  denotes  a  number  of  the  first,  or  of  the  second,  class.  When  P 
contains  no  component  that  is  dense  in  itself,  the  decomposition  of  P  ceases 
at,  or  before,  some  definite  number  a  of  the  first,  or  of  the  second,  class, 
when  there  is  no  set  Ga. 

It  has  thus  been  shewn  that,  when  P  contains  no  component  that  is  dense 
in  itself,  it  can  be  resolved  into  a  finite,  or  enumerably  infinite,  set  of  inner 
limiting  sets,  of  which  there  may,  or  may  not,  be  a  last  set.  Let  Py  be  one 
of  the  components  into  which  P  has  been  resolved,  7  denoting  a  number  of 
the  first  or  second  class.  We  now  fix  on  a  sequence  of  sets  of  intervals  enclosing 
the  points  of  Py,  such  that  all  the  intervals  are  interior  to  the  intervals 
complementary  to  Gy;  then  the  set  Py+1  +  Py+2+  ...,  which  is  contained  in  Gy, 
has  no  limiting  points  in  any  of  the  intervals  which  enclose  the  points  of  Py, 
for  all  its  limiting  points  must  be  in  Gy.  The  sequence  of  sets  of  intervals 
having  thus  been  fixed  for  every  Py,  we  can  now  shew  that  each  limiting  point 
p,  of  P,  which  does  not  belong  to  P,  is  shed  from  the  whole  sequence  of  sets  of 
intervals,  at  a  definite  index.  The  point  p  is  either  a  limiting  point  of  P1} 
belonging  to  Lly  the  set  of  the  isolated  points  of  P',  or  is  contained  in  GV  In 
the  former  case  it  is  shed  from  the  intervals  enclosing  Pl  at  a  definite  index; 
and,  not  being  a  limiting  point  of  P.2  +  P3  +  ...,  it  is  shed  from  the  intervals 
enclosing  the  points  of  that  set,  at  a  definite  index ;  consequently  it  is  shed 
from  the  intervals  enclosing  P,  at  a  definite  index,  the  greater  of  the  two  former 
ones.  In  the  latter  case,  unless  p  is  in  G2  or  in  P2',  it  is  not  a  limiting  point  of 
P2  +  Ps  +  . . . ,  and  does  not  come  into  any  of  the  intervals  enclosing  the  points  of 
PI;  it  is  therefore  shed  at  a  definite  index.  If  p  belongs  to  Gl}  6r2,  ...  and  to 
every  G  before  Ga ,  but  is  not  in  Ga ,  it  may  be  a  point  of  P0'.  In  that  case  it  is 
not  a  limiting  point  of  the  set  Pa+1  +  Pa+->+  ••-,  and  does  not  come  into  the 
interior  of  any  of  the  intervals  which  enclose  the  points  of  P1;  P2,  ...,  or  any 
P  with  index  less  than  a.  It  is  therefore  shed,  at  a  definite  index,  from  the 
sequence  of  sets  of  intervals  enclosing  the  points  of  P.  The  theorem  has 
thus  been  established. 
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A  corollary  to  the  above  proof  is  that  every  enumerable  set  is  the  sum  of 
an  ordinary  inner  limiting  set,  and  of  a  set  which  is  dense  in  itself. 

In  particular*  it  follows  that : 

Every  reducible  set  is  an  ordinary  inner  limiting  set. 

For  the  derivative  of  a  reducible  set  being  enumerable,  the  given  set  can 
contain  no  component  that  is  dense  in  itself,  since  the  derivative  of  such 
component  would  be  perfect,  which  is  impossible  when  the  derivative  is 
enumerable. 

Since  an  ordinary  inner  limiting  set  that  is  everywhere  dense  consists  of  the 
points  that  are  common  to  a  sequence  {AMj  of  sets  An,  of  non-overlapping  open 
intervals  which  are  everywhere  dense,  and  since  the  points  that  do  not  belong 
to  An  form  a  non-dense  closed  set,  it  follows  that  an  everywhere  dense  ordinary 
inner  limiting  set  is  a  residual  set  (§  95);  the  complementary  set,  of  the  first 
category,  being  the  outer  limiting  set  of  a  sequence  of  non-dense  closed  sets. 

101.  Any  unenumerable  set  can,  in  accordance  with  the  result  of  §91,  be 
expressed  in  the  form  P  =  U  +  Vn  +  2  Vx  +  Ve ;  and  we  observe  that,  if  Vc  is 
absent,  the  necessary  and  sufficient  conditions  that  P  may  be  an  inner  limiting 
set  are  that  Va  and  2  Vx  should  both  be  absent ;  this  follows  from  the  preceding 
results. 

If  Vc  exists,  we  observe  that  no  point  of  U  +  Va  +  ^Vx  can  be  a  limiting 
point  of  Vc ;  for  any  limiting  point  of  Vc  must  be  a  point  of  degree  c  in  the 
set  P.  If  Vc  is  everywhere  dense  in  (a,  6),  it  follows  that  U  +  Va  +  ^Vx  is 
absent.  The  set  Vc  may  be  non-dense  in  (a,  6),  or  it  may  be  dense  in  some 
parts  of  (a,  6),  and  non-dense  in  other  parts. 

It  will  be  shewn  that  Vc  is  in  general  made  up  of  a  part  which  is  non-dense 
in  (a,  b),  and  of  a  finite,  or  indefinitely  great,  number  of  parts  each  of  which 
is  everywhere  dense  in  a  particular  interval  in  which  ib  lies.  Suppose  that  an 
interval  (a,  /3)  can  be  found  in  which  Vc  is  everywhere  dense ;  and  let  a;  be  a 
point  in  (a,  b)  such  that  x  ^  @.  Then  those  values  of  x  for  which  Vc  is  every- 
where dense  in  (a,  x),  together  with  those  values  for  which  this  is  not  the  case, 
define  a  section  of  all  the  numbers  of  the  continuum  (/3,  b) ;  and  this  section 
defines  a  number  /S,  ^  /3.  Similarly,  we  may  assign  a  number  «j  ^  a,  so  that 
(ai>  @i)  is  the  greatest  interval  containing  (a,  /3)  which  is  such  that  Vc  is 
everywhere  dense  in  it.  If,  in  the  parts  of  (a,  6)  external  to  (a1,  #,),  the  set 
Vc  is  dense  in  any  interval,  then  we  proceed  to  fix  the  greatest  interval  for 
which  it  is  everywhere  dense.  In  this  manner  we  obtain  a  finite,  or  enumer- 
ably  infinite,  set  of  detached  intervals  contained  in  (a,  b),  in  each  of  which  Vc 
is  everywhere  dense ;  and  the  remainder  of  (a,  6)  may  consist  of  a  set  of 
detached  intervals  and  a  set  of  points.  In  this  remainder  the  points  of  Vc 
form  a  non-dense  set. 

*  See  Hobson,  Proc.  Land.  Math.  Soc.  (2),  vol.  n,  p.  316.  Another  proof  of  the  general  theorem, 
applicable  to  any  number  of  dimensions,  has  been  given  by  Brouwer,  Proc.  Roy.  Soc.  Amsterdam, 
vol.  xvm,  p.  48.  See  also  vol.  xx  for  further  information  on  the  subject. 
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No  point  of  U  +  Va  +  SF^  can  be  in  an  interval  (a1}  &)  in  which  Ve  is 
everywhere  dense.  If  Vc  is  the  part  of  Vc  which  is  non-dense  in  (a,  6),  every 
point  of  U  +  Fa  +  XF,,;  must  lie  in  one  of  the  intervals  complementary  to  the 
perfect  set  Fc'.  It  is  to  be  observed  that  in  Fc  are  included  the  end-points  of 
the  intervals  (al,  /3a),  in  case  those  end-points  belong  to  Fc. 

In  order  that  P  may  be  an  inner  limiting  set,  it  is  necessary  that  the  part 
of  U  4-  Va  +  2  Vx  Avhich  is  in  each  interval  complementary  to  F/  should  be  an 
inner  limiting  set  ;  and  this  cannot  be  the  case  unless  Fa  and  £  Vx  are  absent. 

It  has  thus  been  shewn  that  : 

In  order  that  an  unenumerable  set  of  points  may  be  an  inner  limiting  set,  it 
is  necessary  that  the  set  should  contain  no  points  whose  degrees  in  the  set  are 
other  than  0,  a,  or  c,  and  that  it  should  contain  no  component  which  is  dense 
in  itself,  and  whose  points  are  of  degree  a  in  the  set. 

The  determination  of  the  necessary  and  sufficient  conditions  that  any  given 
unenumerable  set  of  points,  however  defined,  may  be  an  ordinary  inner  limiting 
set,  has  now  been  reduced  to  the  problem  of  determining  the  criteria  for  the 
case  of  a  set  which  is  dense  in  itself,  and  all  the  points  of  which  are  of  degree  c 
in  the  set.  The  case  in  which  the  latter  set  is  non-dense  in  its  domain  may 
be  reduced,  by  the  method  of  correspondence,  to  that  in  which  it  is  everywhere 
dense;  and  the  problem  is  therefore  reducible  to  that  of  determining  the 
conditions  under  which  a  given  everywhere  dense  set  of  points,  all  of  degree  c 
in  the  set,  is  a  residual  set,  of  the  kind  described  in  §  100.  No  investigation 
of  all  the  possible  types  of  such  sets  has  yet  been  carried  out,  and  there- 
fore the  problem  remains  as  yet  unsolved. 

PLANE  SETS  OF  POINTS. 

102.  It  has  already  been  shewn  in  §  85,  in  the  case  of  closed  sets,  that  the 
descriptive  properties  of  sets  of  points  in  two  or  more  dimensions  are  in  some 
respects  less  simple  than  those  of  linear  sets.  The  properties  of  plane  sets  of 
points  are  of  special  importance  in  the  theory  of  functions  of  a  complex  variable, 
but  they  do  not  differ  in  any  essential  respect  from  the  properties  of  sets  in 
three  or  more  dimensions.  The  definitions  of  closed  sets,  open  sets,  frontiers, 
etc.,  given  in  §  55,  for  such  sets,  are  identical  with  those  for  linear  sets,  but  an 
account  will  here  be  given  of  the  most  important  descriptive  properties  of  sets 
in  more  dimensions  than  one.  Although  the  proofs  of  the  theorems  are,  for 
simplicity  of  language,  given  for  plane  sets  only,  they  may  be  extended  to  the 
case  of  sets  in  more  dimensions  than  two,  without  material  alteration. 

Each  point  (x,  y),  of  a  plane  set,  is  defined  by  the  two  real  numbers  which 
are  the  rectangular  Cartesian  coordinates  of  the  point.  As  has  already  been 
pointed  out  in  §  53,  a  correspondence  may  be  established  between  the  points 
in  a  finite  cell,  or  rectangle,  and  the  points  in  unbounded  space,  of  such  a 
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character  that  the  descriptive  properties  of  sets  of  points  are  unaltered  by  the 
transformation ;  a  special  convention  being  introduced  concerning  the  points 
on  the  boundary  of  the  finite  rectangle,  or  cell.  It  makes  no  essential  difference, 
in  the  properties  which  will  here  be  considered,  whether  the  closed  or  open 
sets  are  considered  as  bounded,  that  is  as  contained  in  a  finite  open,  or 
closed,  rectangle,  or  whether  they  are  in  the  unbounded  plane  space;  provided 
that,  when  necessary,  an  adjunct  boundary  at  infinity  is  postulated  which 
corresponds  to  the  boundary  of  a  finite  rectangle. 

103.  The  frontier  of  a  set  of  points  G  has  been  defined,  in  §  55,  as  the  set 
of  points  each  of  which  belongs  to  one  of  the  sets  G,  C(G),  and  is  a  limiting 
point  of  the  other  set.    It  will  be  shewn*  that : 

If  the  complementary  set  of  G  exists,  then  the  frontier  of  G  andC(G}  exists, 
and  is  a  closed  set. 

Let  P  be  any  point  of  G,  and  P'  a  point  of  C(G),  and  consider  those  points 
of'  G  that  are  on  the  linear  segment  PP',  i.e.  those  points  whose  coordinates 

are   -    — r-,  ^ jr-,  where  P  is  (#,  y\  and  P  is  (x,  y'),  and  &  is  a  positive 

1  -f~  K  J-  ~}~  rC 

number,  or  zero.  The  linear  set  of  points  of  G,  on  PP',  has,  in  accordance  with 
the  theorem  of  §  47,  an  upper  boundary  Q.  This  point  Q,  which  may  coincide 
with  P,  is  a  point  of  the  frontier  of  G  and  C(G) ;  for  if  Q  is  a  point  of  G,  it  is 
also  a  limiting  point  of  C  (G),  and  if  it  is  a  point  of  C(G),  it  is  a  limiting  point 
of  G.  Therefore,  if  C(G)  exists,  there  is  always  a  frontier  of  G  and  C  (G). 
Moreover,  let  Qlt  Q2,  ...  Qn,  ...  be  a  convergent  sequence  of  points  of  this 
frontier.  Denoting  by  Q  the  limiting  point  of  the  sequence,  Q  is  itself  a  point 
of  the  frontier;  for  in  the  set  {Qn}  there  is  contained  a  convergent  sequence 
of  points  all  of  which  belong  to  G,  or  else  such  a  sequence  of  which  all  points 
belong  to  C  (G).  If  these  points  all  belong  to  C  (G),  and  consequently  to  G', 
then  Q  belongs  to  the  closed  sets  G'  and  {C  (G)}'',  if  they  all  belong  to  G,  and 
consequently  to  {C(G)}\  then  Q  belongs  to  G'  and  to  {C (G)}'.  In  either  case  Q 
is  a  point  of  the  frontier ;  and  thus,  since  every  limiting  point  of  the  frontier 
belongs  to  it,  the  frontier  is  a  closed  set. 

If  all  points  of  G  belong  to  its  frontier,  G  has  no  interior  points. 

104.  The  distance f  of  two  points  P,  or  (x,  y},  and  Q,  or  (x',  y'),  has  been 
defined  in  §  50  to  be  the  number  {(x  —  x'}2  +  (y  —  y'}z}~\  and  this  may  be  de- 
noted by  PP'. 

If  P  is  a  point  of  a  set  G1}  and  P'  a  point  of  another  set  G2,  the  distances 
PP,  when  every  such  pair  of  points  is  contemplated,  form  an  aggregate  of 

*  Jordan,  Cours  d' Analyse,  vol.  i,  p.  20. 

t  Instead  of  the  distance  so  defined,  Jordan  employs  in  this  connection  the  "  (.'cart,''  defined 
by  !  x  -  x'  \  +    >j  -  y'  \ .     This  makes  no  difference  in  the  developments. 
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numbers  which  have  a  lower  boundary.  In  case  this  lower  boundary  is  a 
positive  number  d(Glt  G2)  (  >  0),  the  sets  G^  and  G2  are  said  to  be  detached 
from  one  another.  The  number  d(G1}  G2)  is  said  to  be  the  distance  between 
the  two  sets  Gl}  G%. 

If  one  set  (r2  consists  of  a  single  point  p,  d  (p,  G^,  or  simply  d,  is  said  to 
be  the  distance  of  the  point  p  from  the  set  G^ . 

If  two  closed  sets  Gl}  G2  are  detached  from  one  another,  there  exists  at  least 
one  pair  of  points  P,  P/,  belonging  respectively  to  the  two  sets,  such  that  the 
distance  PP'  is  equal  to  the  distance  of  the  sets  from  one  another. 

Let  d  denote  the  distance  d(Gl}  G2),  of  the  closed  sets  from  one  another; 
and  let  e1;  e2,  ...  en,  ...  be  a  sequence  of  decreasing  positive  numbers  that 
converges  to  zero.  For  each  value  of  n,  a  pair  of  points  Pn,  or  (xn,  yn),  and  Pn', 
or  (xnr,  y^},  belonging  to  Glt  G2  respectively,  can  be  determined  such  that 
PnPn'2<d2  +  en.  A  unique  point  pn,  or  (xn,  yn,  xn,  yn'),  exists  in  the  four- 
dimensional  continuum,  corresponding  to  each  pair  of  points  Pn,  Pn'.  The  set 
of  points  pi,  pz,  ...  pn,  •"  has  at  least  one  limiting  point  (x,  y,  x',  y') ;  let  P,  P' 
denote  the  two  points  (x, y},  (x,  y'}  in  the  plane  space.  It  will  be  shewn  that 
P  and  P'  belong  to  GI  ,  G2  respectively,  and  that  PP'  is  equal  to  d.  An  integer 
m  can  be  so  determined  that  x  —  xn,  y  —  yn,  x'  —  xn',  y'  —  yn'  are  all  numerically 
less  than  an  arbitrarily  chosen  positive  number  77,  provided  n  =  m.  It  follows 
that  P  is  a  limiting  point  of  the  sequence  {Pn},  and  that  P'  is  a  limiting  point 
of  the  sequence  {P/};  and  thus  that  P  and  P'  belong  to  G1}  G2  respectively, 
since  these  sets  are  closed.  We  have,  further, 

CC  —  OC      =  ]  *K       **"n  i    i      ™n  """"  ™n       '    j  *^n   "~~  ^      ==  ^^]    *      *^n  ~~~  *^ii     > 

and  similarly  y  —  y'  ^^  +  \yn  —  yn'\,  for  n  =  m.  From  these  inequalities  we 
see  that  (x  —  x')2  +  (y  —  y'J1  <  8?72  +  Ay  +  PnPn'2,  where  A  is  some  fixed  number; 
hence  PP'2  <  8772  +  Ay  +  d2  +  en.  Since  77  and  ew  are  both  arbitrarily  small,  it 
follows  that  PP'2  ^  d2,  and  thus  that  PP'2,  which  is  certainly  not  less  than  d2, 
must  be  equal  to  d2.  The  theorem  has  now  been  established.  For  the  choice*  of 
the  infinite  set  of  pairs  of  points  Pn,Pn,  the  multiplicative  axiom  is  required. 

105.  A  closed  set  of  points  is  said  to  be  connex,  or  single  sheeted,  when  it 
is  not  the  sum  of  two  or  more  closed  sets. 

It  should  be  observed  that  the  expression  "  connex  "  has  been  employed 
in  §  41  in  a  somewhat  different  sense. 

A  connex  closed  set  which  does  not  consist  of  a  swgle  point  is  a  perfect  set. 

For  an  isolated  point  of  a  closed  set  G  can  be  considered  as  a  set  detached 
from  the  closed  set  consisting  of  all  the  remaining  points  of  G ;  and  hence,  if 
an  isolated  point  existed,  G  could  not  be  connex. 

If  P,  P'  are  any  two  points  of  a  connex  perfect  set  G,  and  e  be  any  assigned 
positive  number,  a  finite  number  of  points  p^,  p.2,  ...  pn,  of  the  set,  can  be  so  de- 

*  See  Chap,  iv,  §  l'J7  ct  teq. 
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termined  that  Pplt  p,pa,  p.2pa,  ...  pn^pn>  pnF  are  all  £  e.  Conversely,  if  this 
condition  holds  for  each  pair  of  points  P,  P'  of  G,  and  for  every  positive  number 
e  (n  depending  on  e),  then  G  is  connex. 

The  condition  stated  in  the  theorem  is  sufficient  to  ensure  the  connexity 
of  0.  For  if  G  be  the  sum  of  two  closed  sets  Glt  Gz,  the  distance  between 
which  is  d  (>  0),  we  may  choose  €  to  be  <  d.  If  P  is  a  point  of  (•?,,  and  pl  is  a 
point  of  G  such  that  Ppl  £  e,  the  point  pl  belongs  to  Gl ;  again  if  pt  be  a  point 
of  G  such  that  plpz  £  e,  the  point  p2  belongs  to  Gl ;  and  so  on.  Since  pn  belongs 
to  (TI,  whatever  finite  value  n  may  have,  it  is  impossible  that  pnP'  =  e,  because 
pnP'  =  d.  Again,  the  condition  is  a  necessary  one.  For,  let  us  suppose  that, 
for  some  value  of  e,  the  condition  is  not  satisfied  for  every  pair  of  points.  If 
P  belong  to  such  a  pair,  the  set  G  may  be  divided  into  two  parts  Gl  and  Gz ; 
where  Gl  is  such  that,  for  each  point  P'  belonging  to  it,  a  definite  set  of  points 
of  Glt  viz.  plt  p2,  ...pn,  exists,  such  that  Pplt  p^,  ...  pnP'  are  all  £  e;  and  6r2 
is  such  that,  for  each  point  of  it,  this  condition  is  not  satisfied.  It  can  be 
shewn  that  Gl}  G2  are  both  closed  sets,  and  that  the  distance  between  them 
is  >  e.  For,  if  p  is  a  limiting  point  of  6r1}  it  belongs  either  to  Gl  or  to  G2;  and 
since  there  are  points  pn>  of  Gl}  such  that  ppn<  e,  the  point  p  clearly  belongs 
to  Gl ;  therefore  Gl  is  a  closed  set.  Again,  if  q  is  a  limiting  point  of  G2,  it 
cannot  belong  to  G^  for  a  point  P',  of  G.2>  can  be  found  such  that  qP'  <  e; 
hence,  if  q  belonged  to  Gl  so  also  would  P' ;  thus  Gz  is  closed.  It  is  clear  that 
no  pair  of  points  of  Gl}  G2  can  exist,  of  which  the  distance  is  ^  e;  hence,  for 
these  sets,  d  >  e.  It  has  thus  been  shewn  that,  if  for  any  value  of  e  the  con- 
dition in  the  theorem  is  not  satisfied,  G  can  be  divided  into  two  detached 
closed  sets,  and  it  is  therefore  not  connex. 

106.  An  open  set,  defined  in  §  55,  as  one  such  that  every  point  is  an 
interior  point,  is  also  frequently  called  an  open  domain. 

When  the  open  domain  is  such  that  any  two  points  P,  P'  of  it  may  be 
joined  by  means  of  a  finite  set  of  linear  intervals  Ppi,  Pip2,  •••  pnP'>  such  that 
all  the  points  of  all  these  closed  intervals  belong  to  the  domain,  it  is  said  to 
be  a  connex  open  domain. 

A  connex  open  domain  is  also  called  aWeierstrassian  domain,  or  a  continuum. 
It  may  be  either  bounded  or  unbounded.  The  terms  domain  and  region  are 
also  used  to  denote  a  connex  open  domain  together  with  some,  or  all,  of  its 
frontier  points. 

If,  to  a  connex  open  domain,  all  its  frontier  points  be  added,  it  becomes  a 
closed  set,  connex  in  accordance  with  the  definition  of  §  105.  But  a  closed 
connex  set  does  not  necessarily  become  a  single  connex  open  domain  when  all 
its  frontier  points  arc  removed  from  it.  Thus,  for  example,  the  set  of  all  the 
points  interior  to,  or  on  the  circumferences  of,  two  circles  that  touch  each 
other  externally,  is  a  closed  set,  but  the  interior  points  of  the  circle's  do. not 
form  a  single  connex  open  set,  but  two  such  sets. 
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Each  point  of  an  open  domain  is  such  that  a  circle  exists  with  the  point 
as  centre,  and  with  all  its  interior  points  in  the  domain,  but  with  one  or  more 
points  of  the  frontier  on  its  circumference. 

For  the  point  P  has  a  definite  distance  dp  from  the  closed  frontier  of  the 
open  domain,  and  in  virtue  of  the  theorem  of  §  104  there  is  one  point  at  least 
of  the  frontier  on  the  circumference  of  the  circle  with  centre  P  and  radius  dp. 

Every  open  set  of  points  is  the  sum  of  a  finite,  or  an  enumerably  infinite,  set 
of  connex  open  domains. 

If  P  be  any  point  of  the  given  open  set  0,  let  the  component  Olt  of  0,  be 
such  that  every  point  Q,  of  01}  satisfies  the  condition  that  P  and  Q  can  be 
joined  by  a  set  Pplt  p\pi,  --^pnQ  of  straight  lines  of  which  all  the  points  are 
points  of  0.  The  set  Oj  is  clearly  an  open  set,  for  a  point  Q  of  it  has  a  neigh- 
bourhood, of  which  every  point  q  can  be  joined  in  the  above  manner  to 
P;  and  thus  Q  is  an  interior  point  of  Ol.  Therefore  any  point  P,  of  0,  is  a 
point  of  a  connex  open  domain  which  is  a  component  of  0.  It  will  be  shewn 
that  the  number  of  such  connex  domains  contained  in  0  is  enumerably  infinite, 
or  finite.  Apply  to  the  finite,  or  infinite,  rectangle,  in  which  G  is  contained,  a 
system  of  closed  nets  {Dn}.  There  is  a  smallest  integer  wls  such  that  Dth  has 
one  or  more  meshes  interior  to  0;  there  is  a  smallest  integer  w2  (>^i). 
such  that  those  meshes  of  DM]  which  are  not  interior  to  0  contain  one  or 
more  meshes  of  DM2  that  are  interior  to  0;  and  so  on.  We  obtain  in  this 
manner  a  sequence  of  sets  of  meshes  dni,  dn^,  ...  such  that  each  mesh  of  the 
sequence  is  interior  to  0,  and  such  that  no  mesh  of  the  sequence  contains  any 
other  such  mesh.  Every  such  component  of  0,  as  Olf  contains  meshes  of  the 
sequence ;  and  since  the  set  of  meshes  in  the  sequence  is  enumerable,  it 
follows  that  the  aggregate  of  all  components  of  0  that  consist  of  connex  open 
sets  is  enumerable,  whether  finite  or  not.  It  follows  from  this  theorem  that 
every  closed  set  of  points  has  for  its  complement,  relatively  to  a  finite,  or  infinite, 
cell,  a  finite,  or  an  enumerably  infinite,  set  of  connex  open  domains.  If  the  set 
of  connex  open  domains  is  everywhere  dense  in  a  closed  domain,  the  closed  set 
is  non-dense  in  that  domain. 

These  connex  open  domains  are  the  true  analogues,  for  plane  closed  sets,  of 
the  open  intervals  contiguous  to  a  linear  closed  set  of  points,  and  they  may 
therefore  be  said  to  be  the  open  connex  domains  contiguous  to  the  closed  set. 
In  §  86,  attention  has  been  confined  to  domains  of  a  particular  type,  viz.  cells, 
and  it  has  there  been  shewn  that  there  exists,  in  general,  no  unique  set  of 
cells  which  have  properties  in  relation  to  a  given  closed  set  fully  analogous  to 
the  properties  of  the  open  intervals  contiguous  to  a  linear  closed  set.  If,  to  a 
non-dense  closed  set,  there  be  added  all  the  interior  points  of  some  of  the 
contiguous  domains,  we  obtain  a  closed  set  of  the  most  general  type.  A  closed 
set  will  be  perfect,  if  no  two  of  the  contiguous  domains  abut  on  one  another, 
or  also  if  they  abut  on  one  another  in  such  a  way  that  the  points  common 
to  the  two  boundaries  form  a  perfect  set. 
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107.  A  bounded  connex  domain  together  with  its  frontier  points  forms  a 
connex  closed  set 

Since  the  frontier  is  closed,  it  is  easily  seen  that  the  set  formed  by  adding 
all  the  points  on  the  frontier  of  the  bounded  connex  domain  is  closed. 

If  P,  Q  are  both  interior  points  of  the  closed  set,  they  are  both  points  of 
the  bounded  connex  domain,  and  thus  satisfy  the  requisite  conditions  as  to 
the  mode  in  which  they  may  be  joined.  If  P,  Q  are  points  on  the  frontier, 
interior  points  P,  Q'  may  be  so  determined  that  PP,  QQ'  are  both  less  than 
e ;  and  then  P,  Q'  may  be  joined  by  a  series  of  straight  lines  Ppl ,  p^ ,  ...,pnQf 
each  of  length  less  than  e,  and  such  that  every  point  of  all  of  them  is  an  interior 
point  of  the  connex  domain.  It  thus  follows  that  P,  Q  may  be  joined  so  that 
the  requisite  condition  is  satisfied.  The  theorem  will  also  hold  good  for  an 
unbounded  domain,  provided  those  points  at  infinity  which  are  boundary  points 
are  adjoined. 

The  boundary  of  a  connex  domain,  although  closed,  is  not  necessarily  itself 
connex.  For  example,  if  the  connex  domain  consists  of  the  points  between  two 
concentric  circles,  the  boundary  consists  of  the  two  circumferences,  and  is  not 
connex. 

The  Heine-Borel  theorem  in  its  generalized  form,  proved  in  §  74,  has 
already  been  emancipated,  in  §  75,  from  the  condition  that  the  set  to  which  the 
points  of  the  closed  set  are  interior  is  necessarily  a  set  of  cells.  It  has  in  fact 
been  shewn  in  §  75  that  a  set  of  open  domains  may  be  employed  instead  of  a 
set  of  cells.  The  theorem  may  then  be  stated  in  the  following  form  : 

If  every  point  of  a  closed  set  G  belongs  to  one  or  more  of  the  connex  open 
domains  of  a  set  D  (not  necessarily  enumerable}  of  such  domains,  there  exists 
a  finite  set  of  connex  open  domains  belonging  to  D  such  that  every  point  of  G 
belongs  to  one  at  least  of  the  domains  of  the  finite  set. 

In  a  similar  manner  it  is  shewn  that,  in  the  last  theorem  of  §  75,  applicable 
to  any  set  of  points  G,  open  connex  domains  may  be  employed  instead  of  cells. 
We  thus  have  the  following  theorem : 

If  each  point  of  a  given  set  of  points  belongs  to  a  definite  connex  open  domain 
corresponding  to  that  point,  an  enumerable  set  of  these  connex  open  domains  can 
be  so  determined  that  all  the  points  of  the  set  belong  to  one  or  more  of  the 
domains  of  that  enumerable  set. 

If  the  enumerable  set  of  connex  open  domains  be  {Dn},  it  may  be  shewn 
that,  if  H  be  any  closed  set  contained  in  G,  all  the  points  of  //  belong  to  one 
or  more  of  the  domains  of  a  finite  set  which  is  a  component  of  [Dn\. 

For,  since  each  point  of  the  closed  set  is  interior  to  one  or  more  of  the 
domains  {Dn}>  the  result  follows  from  the  generalized  Heine-Borel  theorem. 

H.  10 
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108.  The  case  when  G  is  an  open  set  0,  in  which,  to  each  point  of 
0,  there  corresponds  a  domain  which  consists  of  the  interior  of  a  circle  with 
the  point  as  centre,  is  of  special  importance  in  connection  with  the  theory  of 
a  complex  variable. 

Let  dp  be  the  distance  of  the  point  P,  of  an  open  set  0,  from  the  frontier 
of  the  set.  To  the  point  P,  the  interior  of  the  circle  of  radius  dp,  and  centre  P, 
may  be  taken  to  correspond.  In  accordance  with  the  theorem  of  §  107,  every 
point  of  the  open  set  is  interior  to  one  at  least  of  an  enumerable  set  {Cn}  of 
these  circles.  Each  of  the  circles  has  on  its  circumference  one  or  more  points 
of  the  frontier  of  the  open  set.  If  the  radius  of  the  circle  with  centre  P  be 
BdP,  instead  of  dp,  where  6  is  some  fixed  number  (<  1),  an  enumerable  set  of 
these  circles  [Cn]  exists,  such  that  each  point  of  the  open  set  is  interior  to  one 
or  more  of  the  circles  [Cn],  and  all  the  points  on  the  circumferences  of  these 
circles  are  points  of  the  open  set  0. 

The  results  may  thus  be  stated  as  follows : 

//  0  be  any  open  set,  there  exists  (I)  an  enumerable  set  of  circles  {Cn},  such 
that  each  point  of  0  is  an  interior  point  of  one  at  least  of  the  circles,  and  such 
that  every  interior  point  of  the  circles  is  a  point  of  0,  and  the  circumference 
of  each  of  the  circles  contains  one  or  more  points  on  the  frontier  of  0 ;  and 
(2)  there  exists  an  enumerable  set  of  circles  {Cn},  such  that  each  point  of  0  is 
interior  to  one  or  more  of  the  circles,  and  every  point  on  the  circumference,  or 
interior  to,  any  of  the  circles,  is  a  point  of  0. 

If  0  consist  of  a  single  connex  open  domain,  any  two  points  P,  Q,  of  0,  can 
be  joined  by  means  of  a  finite  number  of  the  circles  of  the  set  {Cn},  or  also  by 
a  finite  number  of  the  circles  of  [Cn] ;  so  that  P,  Q  are  interior  to  circles  of  the 
finite  set,  and  each  circle  overlaps  the  next. 

For  P  and  Q  may  be  joined  by  a  finite  set  of  segments  of  straight  lines 
all  of  which  segments  are  points  of  0.  This  broken  line  joining  P  and  Q 
constitutes  a  closed  set  H,  which,  in  accordance  with  the  last  remark  in  §  107, 
is  such  that  each  point  of  H  is  interior  to  a  finite  set  of  circles  belonging 
either  to  {Gn},  or  to  [Cn\.  Whichever  of  these  sets  of  circles  be  employed,  we 
have  a  chain  of  circles  such  that  P  and  Q  are  each  interior  to  one  of  them,  and 
such  that  each  circle  of  the  chain  overlaps  the  next  one. 

109.  The  following  theorem  is  the  analogue,  for  a  plane  set  of  points,  of  the 
theorem  proved  in  §  71  for  an  open  linear  set. 

Having  given  a  set  of  overlapping  open  domains,  there  exists  a  set  of  non-over- 
lapping open  domains  such  tJiat  an  interior  point  of  a  domain,  of  either  set  is  an 
interior  point  of  a  domain  of  the  other  set. 

The  proof  of  the  theorem  is  the  same  as  that  in  §  71;  it  being  shewn  that 
every  point  interior  to  one  or  more  of  the  given  domains  is  an  interior  point 
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of  the  set  of  all  such  points.  The  set  of  all  points  interior  to  one  or  more  of 
the  given  domains  is  open,  and  consequently  consists  of  a  finite,  or  an  enu- 
merable set  of  connex  open  domains. 

Every  open  domain  0  is  the  sum  of  an  enumerable  set  of  closed  cells  which 
do  not  overlap,  though  any  two  of  them  may  have  a  portion  of  their  boundaries 
in  common. 

We  may  prove  the  theorem  for  the  case  in  which  the  open  domain  is 
bounded,  as  it  can  then  be  extended  to  the  case  of  an  unbounded  open  domain, 
by  the  method  of  correspondence. 

Apply  a  system  of  nets,  with  closed  meshes,  to  a  fundamental  cell  which 
contains  the  open  domain.  There  is  a  smallest  value  nlt  of  n,  such  that  Dni  has 
one  or  more  meshes  which,  being  completely  closed,  are  interior  to  0.  Let  these 
meshes,  arranged  in  order  of  rank,  be  denoted  by  DHl.  There  is  a  smallest 
value  ft2,  of  n,  (>?ii)  so  that  Dnt  contains  meshes  not  contained  in  Dni,  each  of 
which  is  interior  to  0.  Proceeding  in  this  manner,  we  have  a  sequence  of 
finite  sets  of  meshes  Dni,  Dnt, ...,  arranged  in  order,  such  that  each  mesh  is 
interior  to  0,  and  no  two  of  the  meshes  overlap  one  another. 

Any  point  of  0  iscontainedineachof  a  unique  sequence  of  meshes  dltdz,d3,... 
belonging  to  Dl}  D.z,  ...  respectively.  These  are  the  meshes  in  which  the 
point  would  have  been  contained  if  the  nets  had  been  half  closed  instead  of 
completely  closed.  For  all  sufficiently  large  values  of  n,  dn  is  interior  to  0. 
The  smallest  such  value  of  n  must  be  one  of  the  numbers  nlt  n^,  ...;  say  nPl. 
Thus  the  point  P,  of  0,  is  in  a  mesh  dnp  which  belongs  to  Dnp.  It  is  thus 
proved  that  all  the  points  of  0  are  points  of  the  sequence  Dni,  Dn^ 

An  open  set  is  the  outer  limiting  set  of  a  sequence  of  closed  sets. 

First  let  the  open  set  0  be  bounded,  and  let  H  be  its  boundary.  Let  d  be 
a  fixed  number,  and  consider  the  set  of  points  Gd  which  consists  of  all  points 
of  0  at  a  distance  from  H  that  is  =  d.  This  set  Gd,  which  certainly  exists 
provided  d  be  sufficiently  small,  is  a  closed  set.  For  let  {Pn}  be  a  convergent 
sequence  of  sets  of  points  of  Gd  that  has  P  for  its  limiting  point. 

The  distance  of  P  from  H  cannot  be  less  than  d,  for  if  it  were,  for  a 
sufficiently  great  value  of  n,  Pn  would  be  at  a  distance  from  H  less  than  d, 
which  is  not  the  case.  If  d  have  the  values  in  a  sequence  dl,d.2>  ...  dn, . . .  which 
steadily  converges  to  zero,  the  closed  sets  Gdl,  Gdt,  ...  Gdn,  ...  are  such  that 
each  is  contained  in  the  next,  and  any  point  of  0  belongs  to  all  of  the  closed 
sets,  from  and  after  some  value  of  n.  Thus  0  is  the  outer  limiting  set  of  the 
sequence  {Gdn}  of  closed  sots. 

In  case  the  open  set  is  unbounded,  the  theorem  may  be  deduced  from  the 
case  in  which  it  is  bounded.  In  this  case  the  closed  sets  may  have  parts 
of  their  frontiers  at  infinity. 

10—2 
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The  theorem  may  also  be  proved  by  considering  the  set  of  circles  {Gn}  such 
that  every  point  of  0  is  an  interior  point  of  one  or  more  of  the  circles,  and 
that  every  point  in,  or  on  the  circumference  of,  each  circle  belongs  to  0.  For 
we  may  consider  the  closed  set  M (Glt  G2,  ...  Gn)  =  Gn\  then  0  is  the  outer 
limiting  set  of  the  sequence  {On}  of  closed  sets. 

A  closed  set  is  the  inner  limiting  set  of  a  sequence  of  open  sets. 

This  is  at  once  deduced  from  the  last  theorem  by  considering  the  com- 
plementary sets. 

110.  Having  given  a  non- finite  family  of  sets  of  points,  there  exists  at  least 
one  point  such  that,  in  its  arbitrarily  small  neighbourhood,  there  are  points 
belonging  to  an  infinite  number  of  sets  of  the  given  family.    Such  a  point  may 
be  an  adjoined  point  at  infinity. 

In  the  case  in  which  each  of  the  sets  of  points  consists  of  a  single  point 
this  theorem  reduces  to  that  of  the  existence  of  the  limiting  point  of  a  set  of 
points ;  and  thus  the  theorem  may  be  considered  to  be  a  generalization  of  the 
fundamental  theorem  of  §  52.  " 

In  case  all  the  sets  of  the  family  are  contained  in  a  finite  fundamental 
cell,  the  theorem  is  proved  exactly  as  that  in  §  52.  A  system  of  nets  being 
applied  to  the  fundamental  cell,  a  sequence  of  meshes  dlt  d^,  ...  each  containing 
the  next,  and  belonging  to  the  successive  nets,  is  obtained,  and  each  of  which 
contains  points  belonging  to  an  infinite  number  of  the  sets  of  the  given  family. 
The  point  defined  by  the  sequence  [dn]  is  a  point  which  satisfies  the  condition 
of  the  theorem.  In  case  there  is  no  finite  fundamental  cell,  the  theorem  is 
proved  by  the  method  of  correspondence  with  a  family  of  sets  in  a  finite  cell. 
In  this  case  the  point  determined  may  be  an  adjoined  point  at  infinity. 

It  is  easily  seen  that  all  the  points  that  satisfy  the  condition  of  the  theorem 
form  a  closed  set ;  the  points  at  infinity,  if  any,  being  included  in  the  set. 

In  case  all  the  sets  are  open  domains,  we  have  the  following  theorem  : 

Having  given  a  non-finite  (enumerable  or  unenumerable)  family  of  open 
domains,  there  exists  at  least  one  point  such  that,  in  its  arbitrarily  small  neigh- 
bourhood, there  are  contained  domains  which  are  portions  of  an  infinite  number 
of  the  domains  of  the  given  family.  Such  a  point  may  be  an  adjoined  point  at 
infinity.  The  set  of  all  such  points  is  a  closed  set. 

111.  The  theorems  in  §  82  can  be  extended  to  the  case  of  a  set  of  two  (or 
more)  dimensions. 

Instead  of  intervals  <$i,82,  ...8M,  ---.each  containing  the  next,  and  con  verging 
to  the  point  p,  we  take  a  convergent  set  of  cells.  Instead  of  intervals  con- 
tiguous to  the  perfect,  or  the  closed,  set  Pn,  we  take  the  set  of  contiguous 
domains.  The  procedure  in  the  proofs  is  then  otherwise  unaltered. 
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In  case  the  sets  P2,  P3,  ...  PB,  ...  consist  of  the  successive  derivatives  of  a 
closed  set  P,,  if  a  set  P$  does  not  exist,  the  previous  set  P^_i  consists  of 
a  finite  set  of  points,  and  in  case  there  is  a  number  /9  such  that  P/j  =  Pp+i,  the 
set  Pft  is  perfect.  We  thus  obtain  a  proof  of  the  theorem  already  established 
in  §  89,  that  every  ^-dimensional  set  is  the  sum  of  a  perfect  set  and  an 
enumerable  set.  A  proof  has  been*  given  by  de  la  Vall6e  Poussin  of  the 
second  theorem  of  §  82,  applicable  to  closed  sets  in  any  number  of  dimensions, 
in  which  the  theorems  of  §  75  are  employed. 

112.  Every  set  of  points  can  be  expressed  as  the  sum  of  a  set  which  is 
dense  in  itself,  and  of  an  enumerable  set  which  is  non-dense  in  every 
perfect  set. 

In  the  first  place,  if  a  set  is  non-dense  with  respect  to  every  perfect  set, 
it  is  enumerable.  For,  if  a  set  is  not  enumerable,  there  is  a  set  of  points  of 
unenumerability  which  is  perfect,  and  contains  points  of  the  set  and,  in 
general,  others  not  in  the  set.  The  given  set  would  be'  everywhere  dense 
in  this  perfect  set,  contrary  to  the  hypothesis  that  it  is  non-dense  in  every 
perfect  set. 

Let  E  be  a  given  set ;  and  consider  a  point  p,  whether  belonging  to  E  or 
not,  which  is  such  that,  in  every  neighbourhood  of  p,  there  is  a  perfect  set 
with  respect  to  which  E  is  everywhere  dense. 

Let  P  be  the  set  of  all  such  points  p.  Any  perfect  set  in  which  E  is 
everywhere  dense  must  be  contained  in  P ;  hence  P  has  no  isolated  points ; 
moreover  P  is  clearly  a  closed  set,  and  therefore  it  is  perfect.  It  will  be 
shewn  that  E  is  everywhere  dense  in  P.  If  a  perfect  set  be  contained  in  any 
interval,  or  domain,  contiguous  to  P,  and  E  were  dense  with  respect  to  such 
perfect  set,  it  would  be  everywhere  dense  with  respect  to  some  perfect 
portion  of  such  set,  and  this  portion  would  therefore  belong  to  P,  which  is 
impossible ;  hence  E  is  non-dense  in  any  perfect  set  contained  in  a  domain 
contiguous  to  P.  If  E  were  not  everywhere  dense  in  P,  the  part  of  P  in  some 
interval,  or  cell,  would  contain  no  points  of  E.  Any  point  of  P  belonging 
to  this  part  would  have  a  neighbourhood  free  from  points  of  E,  which  is 
contrary  to  the  definition  of  P.  Hence  E  is  everywhere  dense  in  P.  Let 

E=D(E,P)  +  E3', 

so  that  E2  consists  of  those  points  of  E  which  do  not  belong  to  P.  It  will  be 
shewn  that  E2  is  non-dense  with  respect  to  every  perfect  set,  and  con- 
sequently enumerable.  If  there  were  a  perfect  set  in  which  E.2  were  not  non- 
dense,  it  would  be  everywhere  dense  in  a  perfect  set  R,  a  portion  of  the  former 
one.  If  R  have  points  not  belonging  to  P,  E,  and  therefore  E2,  is  not  every- 
where dense  in  R.  If  R  is  contained  in  P,  E2  has  no  points  in  R.  Therefore 
E.t  is  non-dense  with  respect  to  every  perfect  set. 

*  Integrates  de  Lebesgue,  p.  113. 
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In  the  case  of  a  closed  set,  the  part  that  is  dense  in  itself  is  the  perfect 
set  which  is  a  part  of  the  closed  set,  the  other  part,  which  is  irreducible  in 
every  interval,  or  domain,  contiguous  to  the  perfect  set,  is  non-dense  in  every 
perfect  set. 

Denjoy*,  to  whom  the  above  theorem  is  due,  has  indicated  a  method,  in 
the  general  case,  of  obtaining  by  a  successive  process  the  enumerable  set  that 
is  non-dense  with  respect  to  every  perfect  set. 

113.  If  G  be  any  plane  set  of  points,  and,  corresponding  to  each  point  P, 
of  G,  a  cell  with  centre  P,  and  sides  all  equal  to  2/z,  be  contemplated,  the  set  of 
all  points  which  belong  to  one  or  more  such  cells,  when  the  cells  for  all  the 
points  P,  of  G,  are  considered,  is  said  to  be  a  neighbourhood  (h)  of  the  set  G. 
The  neighbourhood  (h)  of  the  set  G  is  said  to  be  complete,  or  incomplete, 
according  as  the  cells  are  taken  to  be  closed,  or  open. 

Every  point  of  the  incomplete  neighbourhood  (h),  denoted  by  H,  of  a  given 
set  G,  is  an  interior  point  of  H.  For  any  point  Q,  of  the  set  H,  is  an  interior 
point  of  a  cell,  all  the  interior  points  of  which  belong  to  H ;  hence  a 
neighbourhood  k'  (<A)  of  Q  can  be  determined  so  that  all  its  points  belong 
to  H.  Thus  the  incomplete  neighbourhood  (h)  of  any  given  set  G  is  an 
open  set. 

(a)  The  complete  neighbourhood  (h),  denoted  by  H,  of  a  closed  (bounded)  set 
G,  is  a  closed  set. 

For,  if  Qlt  Q2>  •••  be  a  sequence  of  points  of  H  which  converges  to  a 
point  Q(xw,  x®),  and  if  Qn  be  denoted  by  (xn(l),  #M(2))>  there  exists  a  point 
Pn(%nM,  £n(2l)>  of  G,  such  that  tfn(1>  -  £„<«  |,  |  xn®  -jn®j  are  both  ^  h.  The  set 
of  points  [Pn]  has  at  least  one  limiting  point  P(|(1),  £ (2)),  which  is  necessarily 
a  point  of  G.  A  sequence  of  increasing  integers  ml}  m.2,  ms,  . . .  can  be  so  chosen 
that  the  sequence  Pmi,  Pmz,  Pms,  ...  converges  to  P  as  its  sole  limiting 
point. 

Sinr>p       v     I1)  —  h  <  £     W  <  T     |J>  4-  h     and    r     {->  —  h  <  £     (2)  <  T     (2'  4-  h 
J"ins      —  li  =  %ms      =  "*"tns      T^  '<)    "     -1    ""ms      —  "•  =  <sms       =•*/«,  ll; 

for  every  value  of  s,  we  have 

xw  -  h  £  f<'>  £  x®  +  h,  and  x®  -h  ^  f  <2>  ^  x®  +  h. 

It  now  follows  that  Q  is  in  the  complete  neighbourhood  of  G.    Therefore  the 
set  H  is  closed. 

(6)  If  G-i,  G2  be  two  (bounded)  closed  sets  which  have  no  point  in  common, 
a  neighbourhood  of  G^  can  be  determined  which  contains  no  points  of  G-2. 

If  d  be  the  distance  between  the  sets  G:,  6r2,  the  complete,  or  the 
incomplete,  neighbourhood  (//.),  where  h<d/*/2,  of  Glt  contains  no  points 
of  G2. 

*  See  his  memoir  "  Sur  les  nombres  d(5riv6s,"  Journ.  de  Math.,  ser.  7,  vol.  i,  1915,  p.  235. 
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The  following  theorem  can  be  deduced : 

(c)  If  the  (bounded)  closed  set  G  consists  entirely  of  interior  points  of  a 
set  H,  a  neighbourhood  (h)  of  G,  complete,  or  incomplete,  can  be  so  determined 
that  it  consists  entirely  of  interior  points  of  H. 

The  points  of  G  have  a  maximum  distance  p  from  the  point  (0,  0). 
Choose  a  number  p  (>  p),  and  let  Hl  be  that  part  of  H  that  consists  of  points 
at  a  distance  ^  p'  from  the  point  (0,  0). 

It  is  clear  that  every  point  of  G  is  an  interior  point  of  Hl.  Let  L  be  the 
set  of  points  on  the  boundary  of  H^ ;  L  is  then  a  closed  set.  A  neighbour- 
hood (h)  of  G  can  be  determined  which,  whether  complete  or  incomplete, 
contains  no  points  of  L.  Every  point  of  this  neighbourhood  (h)  is  an  interior 
point  of  H!,  and  therefore  of  H.  For  any  such  point  Q  is  in  the  neighbour- 
hood (h)  of  some  point  P,  of  G.  If  the  neighbourhood  (h)  of  P  does  not 
consist  entirely  of  interior  points  of  H},  it  must  contain  one  or  more  points 
of  L ;  and  this  is  not  the  case.  Therefore  Q  is  an  interior  point  of  Hf. 

THE  CLASSIFICATION  OF  A  FAMILY  OF  SETS  OF  POINTS. 

114.  It  has  been  shewn,  in  §56,  that  the  set  M(0l}  0.,, ...  On,  ...),  of  points 
each  of  which  belongs  to  one  at  least  of  the  open  sets  01;  02, ...  On,  ...,  is 
itself  an  open  set. 

Let  us  consider  a  set  D(0i,  02,  ...  On,  ...)  which,  when  it  exists,  consists 
of  all  the  points  common  to  the  open  sets  of  the  sequence.  This  set  belongs 
to  a  class  of  sets  which  may  be  denoted  by  0(d} ;  so  that  each  set  of  the  class 
is  obtained  from  some  sequence  of  open  sets.  A  set  which  can  be  exhibited 
as  M(0i(d],  02(d),  ...)  belongs  to  a  class  of  sets  which  may  be  denoted  by  Odm. 
In  the  same  way,  a  set  which  can  be  exhibited  as  D(0^dm\  02(dm), ...)  belongs 
to  a  class  which  may  be  denoted  by  Odind.  By  continuing  this  process  we  can 
obtain  an  indefinite  number  of  classes  of  sets ;  of  these,  the  first  is  the  class  0 
of  open  sets,  the  second  is  the  class  of  sets  0(d\  the  third  is  the  class  of  sets 
0(dm},  the  fourth  the  class  of  sets  0(dmd] ;  and  so  on.  The  class  &>  may  be 
defined  as  the  class  of  sets,  each  of  which  consists  of  the  points  common  to 
all  the  sets  of  a  sequence,  each  member  of  which  belongs  to  one  of  the  classes 
1,  2,  3,  —  The  class  to  4- 1  consists  of  sets  each  of  which  consists  of  the  points 
belonging  to  one  or  more  of  the  sets  belonging  to  a  sequence  of  sets  of 
class  o> ;  the  class  o>  -4-  2  consists  of  sets  each  of  which  consists  of  the  points 
common  to  all  the  sets  of  a  sequence  of  sets  belonging  to  class  to  -f  1. 

Proceeding  in  this  manner,  we  may  contemplate  the  existence  of  a  class 
corresponding  to  any  ordinal  number  of  the  second  class. 

*  See  Hobson,  Proc.  Loud.  Math.  Soc.,  (2),  vol.  xiv.    The  theorem  (c)  was  first  proved  by  Bolza 
in  a  different  manner;  see  Vorlesunyen  ilber  Variationsrechnung,  p.  155. 
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This  classification  has  been  given  by  Hausdorff*,  who  applies  the  term 
Borel-set  to  denote  any  member  of  any  of  these  classes,  all  of  which  he  shews 
exist.  He  has  proved  that  every  Borel-set  is  either  enumerable  (or  finite), 
or  has  the  cardinal  number  of  the  continuum.  This  is  a  generalization  of  the 
theorem  that  every  closed  set,  which  is  an  0(d\  has  one  of  the  cardinal 
numbers  a  and  c.  For  the  whole  class  0(d\  the  theorem  was  established  by 
W.  H.  Young  (see  §  97),  in  connection  with  the  theory  of  ordinary  inner 
limiting  sets.  The  result  obtained  by  Hausdorff  settles  the  question  as  to  the 
possible  cardinal  numbers  of  sets  of  points,  in  the  case  of  sets  belonging  to  a 
very  extensive  class. 

A  detailed  treatment  of  the  possible  peculiarities  of  structure  of  sets  of 
points  in  plane  space,  or  in  space  of  higher  dimensions,  is  of  the  highest 
importance  in  relation  to  the  application  of  the  principles  of  the  theory  of  sets 
of  points  to  abstract  geometry.  As  a  development  of  this  character  is  beyond 
the  scope  of  the  present  work,  reference  is  here  made  to  the  account  given  by 
Schoenflies'f,  where  references  to  the  literature  of  the  subject  will  be  found. 
A  memoir  J  by  W.  H.  and  G.  C.  Young,  "  On  the  internal  structure  of  a  set 
of  points  in  space  of  any  number  of  dimensions,"  may  also  be  here  referred 
to.  Information  on  the  subject  will  also  be  found  in  W.  H.  Young's  treatise 
"  On  the  theory  of  sets  of  points."  Reference  may  also  be  made  to  TO  N. 
Watson's  tract,  "  Complex  integration  and  Cauchy's  theorem,"  where  a  proof 
will  be  found  of  Jordan's  fundamental  theorem,  that  a  plane  is  divided  into 
two  distinct  open  connex  domains,  by  means  of  a  closed  Jordan  curve. 
Another  proof  ||  has  been  given  by  Brouwei; 

SETS  OF  SEQUENCES  OF  INTEGERS. 

115.  A  theory  of  sets  of  sequences  of  integers,  of  which  the  formal  character 
is  similar  to  the  theory  of  sets  of  points  in  any  number  of  dimensions,  has 
been  developed  by  Baire§,  with  a  view  to  application  to  the  Theory  of  Functions. 

A  group  of  integers  («1}  «2,  ...  ap),  of  order  p,  consists  of  a  system  of  p 
positive  integers  arranged  in  a  definite  order. 

The  group  (a1}  «2,  ...  ap),  of  order  p,  is  said  to  be  contained  in  each  of  the 
groups  («!>,  («1:  flf2),  («!,  Ofo,  «s),  ...  («!,  «2,  ...  «p_j)  of  orders  1,  2,  3,  ...  p  -  1, 
respectively. 

A  sequence  of  integers  (al,  «2,  ...  ap,  ...)  consists  of  an  infinite  number  of 
integers,  defined  in  any  manner,  and  arranged  in  an  order  similar  to  the 
sequence  1,  2,  3,  ....  This  sequence  is  said  to  be  contained  in  each  of  the 
groups  (ftj),  («!,  «2),  •••  («i,  «2,  •••  <*p)>  ••••  Let  P  be  a  set  of  such  sequences  of 
integers,  and  let  A  be  any  other  sequence  of  integers ;  then  if,  for  every  n, 

*  Math.  Annalen,  vol.  LXXVII  (1916),  p.  430. 

f  See  his  Entwickehtng  dcrLehrevon  den  Punktmannigfaltigkeiten,  vol.  n,  chap.  iv. 

J  Proc.  Loud.  Math.  Soc.,  (2),  vol.  xvi,  p.  337.          §  Comptes  Rendus,  vol.  cxxix,  1899,  p.  946. 

:;  Math.  Annalen,  vol.  LXIX,  p.  169,  and  vol.  LXXI,  p.  314. 
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there  are  sequences  in  P,  other  than  A  itself,  which  are  contained  in 
the  same  group,  of  order  n,  as  A  itself,  that  is  sequences  having  their  first 
?i  integers  the  same  as  the  first  n  integers  in  A,  then  the  sequence  A  is  said 
to  be  a  limit  of  the  set  of  sequences  P.  The  sequence  A  may,  or  may  not,  itself 
belong  to  P. 

The  set  P  is  said  to  be  closed,  in  case  all  its  limits  belong  to  it.  The  set 
is  said  to  be  perfect  when  it  is  closed,  and  also  every  sequence  in  the  set  is  a 
limit  of  the  set. 

A  set  E,  of  groups  of  integers,  is  said  to  be  complete  if,  when  g  is  any  group 
of  order  p,  belonging  to  E,  the  groups  of  orders  1,  2,  3,  ...  p  —  1,  which  contain 
g,  also  belong  to  E. 

A  complete  set  E,  of  groups  of  integers,  is  said  to  be  closed,  if  every  group 
g,  belonging  to  E,  contains  at  least  one  group  of  higher  order  than  itself,  which 
is  also  contained  in  E. 

Having  given  a  complete  set  of  groups  E,  a  sequence  A  may  exist  such 
that  all  the  groups  containing  A  belong  to  E.  The  set  F,  of  all  sequences  such 
as  A,  is  said  to  be  determined  by  the  set  of  groups  E.  The  set  F,  if  it  exists, 
is  closed. 

Every  closed  set  of  groups  E  determines  a  closed  set  of  sequences  F,  and 
conversely,  every  closed  set  of  sequences  F  is  determined  by  a  unique  closed 
set  of  groups  E.  In  case  F  is  perfect,  E  is  also  said  to  be  perfect.  In  order 
that  E  may  be  perfect,  it  is  necessary  and  sufficient  that  every  group  belonging 
to  E  should  contain  at  least  two  groups  of  one  and  the  same  order  superior  to 
its  own  order,  and  belonging  to  E. 

If  P  is  a  set  of  sequences,  then  the  set  P'  of  those  sequences  which  are 
limits  of  the  set  P  is  said  to  be  the  derived  set  of  P,  and  may  be  denoted  by 
P'.  The  derived  set  P'  is  closed. 

The  successive  derivatives  P",  P'",  ...  PM,  ...  P(a),  of  finite  or  transfinite 
orders,  are  then  defined  as  in  the  theory  of  sets  of  points.  If  P  is  a  closed 
set  of  sequences,  there  exists  a  number  a  of  the  first  or  the  second  class,  such 
that  P(a)  =  P<a+1>.  Unless  P  is  an  enumerable  set,  it  can  be  resolved  into  the 
sum  of  an  enumerable  set  and  a  perfect  set. 

Let  us  consider  a  perfect  set  of  groups  E  determining  a  perfect  set  of 
sequences  F.  A  set  P,  of  sequences  all  belonging  to  F,  is  said  to  be  non-dense 
in  F,  or  in  E,  provided  that  every  group  of  E  contains  at  least  one  group  of  E 
which  contains  no  sequence  of  P. 

A  set  of  sequences  P,  all  belonging  to  F,  is  said  to  be  of  the  first  category, 
relative  to  F,  if  there  exists  an  enumerable  sequence  of  sets  P1}  P.2,  ...  Pn,  ... , 
each  of  which  is  non-dense  in  F,  and  such  that  each  sequence  of  P  is  part  of 
one  at  least  of  the  sets  Pn  P2,  ...  PM,  —  The  set  obtained  by  removing  the 
set  P  from  F  is  said  to  be  of  the  second  category  relative  to  F.  The  same 
generic  distinction  between  sets  of  the  first  and  of  the  second  categories  holds, 
as  in  the  theory  of  sets  of  points. 


CHAPTER  III 

THE  METRIC  PROPERTIES  OF  SETS  OF  POINTS 

116.  IN  Chapter  II  an  account  has  been  given  of  those  properties  of  sets 
of  points  which  are  called  descriptive,  in  order  to  distinguish  them  from  the 
metric  properties  which  we  now  proceed  to  discuss.    In  the  investigation  of 
descriptive  properties  the  notion  of  the  distance  between  a  pair  of  points 

f p  }  - 

x,  x',  defined  arithmetically  by  42  (x  —  x'}*\  ,  has  been  employed,  but,  although 

U 

the  descriptive  properties  of  a  set  are  invariant  for  a  large  class  of  trans- 
formations, the  distance  between  a  pair  of  points  x,  x'  is  not  invariant  in  such 
transformations. 

The  earliest  theory  of  the  metric  properties  of  sets  was  originated,  for  the 
case  of  linear  sets,  by  Hankel*,  and  was  further  developed  by  Harnackf, 
StolzJ,  and  by  Can  tor  §,  who  extended  the  conception  of  the  content  of  a  set 
to  the  case  of  sets  of  points  in  a  domain  of  any  number  of  dimensions. 
Although  this  theory  of  content  has  now  been  almost  entirely  superseded 
by  the  later  theory  of  measure,  developed  by  Borel||  and  Lebesguell,  we 
proceed  to  give  an  account  of  it,  not  only  for  historical  reasons,  but  also  in 
order  to  point  out  the  respects  in  which  it  has  defects  which  are  remedied  in 
the  later  theory  of  the  measure  of  sets  of  points.  A  very  general  theory  of 
measure  has  been  given  by  Carath^odory**. 

THE  CONTENT  OF  A  SET  OF  POINTS. 

117.  Suppose  a  linear  set  of  points  G  to  be  contained  in  the  finite  interval 
(a,  6).   Let  a  system  of  nets  be  fitted  on  to  the  interval  (a,  6).    In  this  case  it 
will  be  convenient  to  take  the  meshes  of  the  nets  to  be  closed  at  both  ends. 
In  the  net  of  order  r,  let  there  be  nr  meshes,  and  of  these  suppose  vr  contain 
points  of  G  within  them,  or  at  their  ends.    Let  Snr  Vf  (£  b  —  a)  denote  the 
sum  of  the  breadths  of  the  vr  meshes.    It  is  clear  that  Snr  Vf  ^  $nr+i,»r+i ;  and 
thus  the  sequence  of  numbers  Sni  Vl,  Sn2  V2, ...  SnTi,,r  ...,  which  are  all  >0,  has 
a  definite  limit  2,  which  is  such  that  0  ^  2  ^  6  —  a.    The  number  8nriVT—  2 
may  be  made  as  small  as  we  please,  by  taking  r  sufficiently  large. 

It  will  be  shewn  that  the  number  S  is  the  same  whatever  closed  system 
of  nets  is  employed.  Suppose  two  systems  of  nets  to  give  the  values  21;  22,  of 
2,  respectively.  We  may  denote  by  $Wr(i),,,ru),  Snr(z)tVr(2)  the  values  of  the  sum 
Snr>Vr  for  the  two  systems  of  nets,  {DnM},  {Dn('2)\. 

*  Math.  Anualen,  vol.  xx,  p.  86.  f  Math.  Annalen,  vol.  xxv,  p.  241. 

J  Math.  Annalen,  vol.  xxm,  p.  152;  see  also  Pasch,  Math.  Annalen,  vol.  xxx,  p.  142. 
§  Math.  Annalen,  vol.  xxm,  p.  473.  ||  See  Lemons  sur  la  Theorie  des  Fonctions. 

11  See  the  memoir  "  Integrate,  longueur,  aire,"  Annali  di  Mat.,  (3),  vol.  vn,  also  Lemons  sur 
V integration.  **  See  Gottinger  Nachrichten,  1914,  p.  404. 
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If  a  net  Dw(1)  of  one  system  be  superimposed  on  a  set  /)OT'(2>  of  the  other 
system,  we  obtain  a  net  which  may  be  denoted  by  A*,™'- 

If  <r  denote  the  sum  of  the  breadths  of  those  meshes  of  Dn^",  n,<*>  that 
contain  points  of  G,  it  is  clear  that  a-  £  8nr(l\  •><>>• 

Let  r  be  so  large  that  <S>nr(»f Vfw  —  2i  <  e,  an  arbitrarily  chosen  positive 

number ;  and  let  s  be  so  large  that  the  greatest  of  the  breadths  of  the  meshes 

in  the  nets  Z)g(2)  is  <  tj,  an  arbitrarily  chosen  positive  number.     Of  the  i/,(a) 

meshes  in-the  sum  Sntw  Vfw  the  number  that  are  not  also  meshes  in  the  sum  a 

.is  at  most  nr  -  1 ;  hence  $»,(2)f  „,»)  <  <r  +  tnr'n-    Therefore 

22  <  SntW,  */2)  <  ^«r(1),  "r(1)  +  nrV  <  2j  +  6  +  Hrr). 

Now  e  and  77  are  arbitrarily  small,  and  it  therefore  follows  that  2a  ^  2,. 
It  can  similarly  be  proved  that  ^  ^  22 ;  and  from  the  two  relations  it  is 
inferred  that  2j  =  22.  Therefore  2  is  a  definite  number,  independent  of 
the  particular  system  of  nets  employed  in  defining  it. 

We  have  now  established  the  following  theorem : 

If  G  be  any  given  set  of  points  in  the  interval  (a,  b),  there  corresponds  to  O 
a  definite  number  2,  which  is  such  that  all  the  points  of  G  are  interior  points  of 
a  definite  number  of  intervals  whose  sum  exceeds  2  by  less  than  an  arbitrarily 
chosen  positive  number  e,  the  number  of  the  intervals  depending  on  e. 

The  number  2  is  called  the  content  of  the  set  G,  and  the  content  may  have 
any  value  in  the  closed  interval  (0,  b  —  a). 

Those  sets  of  points  for  which  the  content  is  zero  are  of  special  importance 
in  the  Theory  of  Functions.  A  set  of  zero  content  is  said  to  be  an  unextended, 
or  a  discrete,  or  an  integrable  set  of  points. 

It  is  clear  that  the  definition,  and  the  proof  of  existence  given  above,  are 
applicable  in  the  case  of  a  bounded  set  of  points  in  space  of  any  number  of 
dimensions,  provided  intervals  are  replaced  by  cells,  and  the  breadth  of  an 
interval  is  replaced  by  the  product  of  the  lengths  of  the  sides  of  a  cell,  which 
is  regarded  as  the  content  of  the  cell.  A  point  of  the  set  that  is  on  the 
boundary  of  a  cell  is  also  in  one  adjacent  cell  at  least. 

118.    The  content  of  a  set  of  points  is  the  same  as  the  content  of  its  derivative. 

Let  2'  be  the  content  of  G'  the  derivative  of  a  set  G  ;  then  the  points  of  G' 
may  be  included  in  a  finite  set  of  intervals,  or  cells,  the  sum  of  whose  contents 
is  less  than  2'  +  S,  where  8  is  an  arbitrarily  chosen  positive  number.  There 
can  be  only  a  finite  number  of  points  of  G  which  do  not  fall  within  the  intervals, 
orinthecells,  that  include  the  points  of  G'',  and  this  finite  number  of  points  may 
be  included  in  intervals,  or  cells,  the  sum  of  the  contents  of  which  is  arbitrarily 
small,  say  e.  All  the  points  of  G  are  now  included  in  a  finite  number  of  intervals, 
or  cells,  of  total  content  less  than  2'  +  8  +  e ;  and  a  series  of  diminishing  values 
may  be  assigned  to  8  and  e ;  each  sequence  having  the  limit  zero.  Therefore 
both  2'  +  8  +  €  and  2'  +  8  converge  to  the  value  2';  which  proves  the  theorem. 
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It  follows,  from  this  theorem,  that  the  content  of  any  set  is  the  same  as 
that  of  any  of  its  successive  derivatives.  In  the  case  of  a  set  which  is  of  the 
first  species,  one  of  the  derivatives  contains  only  a  finite  number  of  points, 
and  consequently  the  set  must  be  of  zero  content. 

119.  A  definition  of  the  content  of  a  linear  set  of  points  has  been  given  by 
Cantor*  which,  though  differing  in  form  from  that  of  Hankel  and  Harnack,  is 
in  reality  equivalent  to  it.  Instead  of  enclosing  the  points  of  the  set  G  in  a 
finite  number  of  intervals,  Cantor  encloses  each  point  of  G  in  an  interval  2/j, 
of  which  the  point  is  the  middle  point,  the  number  p  being  the  same  for  each 
point  of  the  set,  those  parts  of  intervals  2/>  which  do  not  lie  within  (a,  6)  being 
disregarded.  We  have  in  this  manner  obtained  an  infinite  number  of  overlap- 
ping intervals  which  contain  all  the  points  of  G,  and,  as  is  clear,  all  the  points 
of  G',  which  is  a  closed  set.  If  we  replace  this  set  of  intervals  by  the  set  of 
non-overlapping  intervals  with  the  same  interior  points,  each  interval  of  this 
latter  set  is  ^  p.  The  set,  which  is  non-overlapping,  and  equivalent  to  the 
infinite  set,  is  consequently  a  finite  set,  the  sum  of  whose  lengths  may  be 
denoted  by  FI  (p,  G).  When  p  is  diminished  indefinitely,  the  number  FI  (p,  G), 
which  cannot  increase  as  p  is  diminished,  must  have  a  definite  lower  limit, 
which  defines  the  content  of  either  of  the  sets  G  and  G.  Since  the  infinite 
set  of  intervals  which  has  been  employed  only  covers  a  finite  number  of 
detached  lengths,  this  definition  is  equivalent  to  that  of  Hankel  and  Harnack. 
Cantor  applies  this  definition  to  the  case  of  a  set  of  points  in  a  jp-dimensional 
continuum,  by  enclosing  each  point  in  a  "  sphere  "  of  radius  p,  with  its  centre 
at  the  point ;  the  content  is  then  the  lower  limit  of  the  volume  of  the  continuum 
contained  within  the  spheres. 

The  essential  point  in  the  above  definition  of  the  content  of  a  set  of  points 
is  that  all  the  points  are  contained  in  a  finite  number  of  intervals,  or  cells,  which 
therefore  contain  all  the  limiting  points^ ;  and  the  lower  limit  of  the  sum  of 
the  contents  of  these  intervals,  or  cells,  is  taken  as  defining  the  content  of  the 
set.  If  the  points  are  contained,  from  the  commencement,  in  an  infinite  number 
of  intervals,  or  cells,  which  are  of  unequal  span,  in  accordance  with  some  pre- 
scribed law,  and  the  spans  of  these  intervals,  or  cells,  are  then  diminished,  each 
one  in  a  prescribed  manner  tending  to  the  limit  zero,  then  the  limit  of  the  sum 
of  those  parts  of  the  fundamental  interval,  or  cell,  which  are  included  in  the 
infinite  set  of  intervals,  or  cells,  is  not  necessarily  equal  to  the  content,  as  above 
defined.  For  example,  let  us  consider  the  set  of  rational  points  in  the  interval 
(0,  1).  These  points  can  be  arranged  in  enumerable  order  PI,  P2,  P3,  ... :  now 
enclose  PI  in  an  interval  of  length  e/2,  P2  in  an  interval  e/22,  etc.,  Pn  in  an 
interval  of  length  6/2",  and  so  on ;  the  total  content  of  these  intervals 
cannot  exceed  eS  l/2n,  or  e  ;  and  this  has  the  limit  zero,  as  e  is  diminished 

*  Math.  Annalen,  vol.  xxm,  p.  473. 

f  See  Harnack,  Math.  Annahn,  vol.  xxm,  p.  241. 
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towards  zero.  On  the  other  hand,  the  content  of  the  set  of  rational  points  is 
the  same  as  that  of  the  derived  set ;  but  this  consists  of  all  the  points  of  the 
interval  (0, 1),  and  is  therefore  unity.  In  general,  any  enumerable  set  of  points 
can  be  contained  in  an  infinite  number  of  intervals,  or  cells,  which  have  a  total 
content  that  is  arbitrarily  small,  and  has  the  limit  zero ;  whereas  the  content 
of  the  set  is  not  in  general  zero. 

A  completely  satisfactory  definition  of  the  content  of  a  set  of  points  of  the 
most  general  character  should  satisfy  the  condition  of  affording  a  consistent 
generalization  of  the  notion  of  the  length  of  a  continuous  linear  set  of  points, 
or  of  the  notions  of  area  and  volume,  in  the  case  of  sets  of  points  in  two,  or  of 
three,  dimensions.  In  the  case  of  closed  sets,  the  definition  given  above  leaves 
nothing  to  be  desired  in  this  respect ;  but  in  the  case  of  unclosed  sets,  the 
definition  leads  to  consequences  which  are  at  variance  with  the  fundamental 
properties  of  lengths,  areas,  and  volumes,  as  understood  for  the  case  of  con- 
tinuous domains.  If  Glt  G2  are  two  complementary  sets  of  points  in  the  con- 
tinuous interval  (0,  1),  then,  in  order  that  the  contents  of  the  sets  Gl}  G2  may 
accord  with  a  generalization  of  the  notion  of  length,  their  sum  should  be 
unity ;  however,  when  Gl  and  Gz  are  unclosed,  this  condition  is  in  general  not 
satisfied  by  the  definition  given  above.  For  example,  if  Gl  consists  of  the 
rational  points,  and  G2  of  the  irrational  points,  each  of  the  two  sets  Glt  G2  has 
its  content  unity,  the  same  as  that  of  the  interval  (0,  1)  itself.  Again,  let  us 
consider  an  everywhere  dense  set  of  non-overlapping  intervals  contained  in 
(0,  1);  then  the  internal  points  of  these  intervals  form  an  open  set  Gl}  of 
which  the  derivative  consists  of  all  the  points  of  the  interval  (0, 1 ) ;  the  external 
and  the  end-points  of  the  intervals  forming  a  non-dense  closed  set  G2.  It  will 
be  shewn  subsequently  that  the  everywhere  dense  set  of  non-overlapping 
intervals  can  be  so  chosen  that  the  limit  of  the  sum  of  their  lengths  is  an 
arbitrary  number  I,  where  I  is  subject  to  the  condition  0  <  I  £  1 ;  whereas  the 
content  of  the  set  Gl  is,  in  accordance  with  the  definition  given  above,  always 
unity,  and  therefore  may  differ  from  the  sum  of  the  contents  of  the  sets  of 
points  contained  in  the  separate  intervals.  To  obtain  the  content  of  the  closed 
set  G2,  cut  off,  from  each  of  the  intervals  which  define  Glt  the  l/2nth  part  of 
its  length  at  each  end;  the  limiting  sum  of  the  intervals,  so  restricted,  is 
1(1  —  I/ft).  Of  these  restricted  intervals,  a  finite  number  can  be  so  taken  that 
their  sum  is  >  I  (1  —  l/n)  —  e,  and  <  /  (1  —  l/n),  where  e  is  an  arbitrarily  chosen 
positive  number.  All  the  points  of  G2  are  now  enclosed  in  the  finite  set  of 
intervals  which  is  complementary  to  the  finite  set  of  restricted  intervals.  The 
sum  of  these  complementary  intervals  is<  1  —  I  (1  —  l/n)  +  e,  and  >1  —  I  (1  —  l/n); 
the  sum  has  for  its  lower  limit  the  number  1  —  I,  which  is  therefore  the  content 
of  GV  The  sum  of  the  contents  of  Gly  G2  is  therefore  not  equal  to  unity,  which 
is  nevertheless  the  content  of 

0,  + 0.3  (0,1). 
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THE  PROBLEM  OF  MEASURE. 

120.  The  problem  of  assigning  definite  numerical  measures  to  sets  of 
points  defined  in  linear,  plane,  solid,  or  ^-dimensional,  space  is  taken  to 
require  that  the  measure  of  a  set  shall  satisfy  the  following  conditions  : 

(1)  The  measure  of  a  set  shall  be  in  accord  with  the  usual  notions  of 
length,  area,  volume,  of  an  interval,  a  rectangle,  or  a  cell  in  three  or  more 
dimensions. 

Accordingly  there  must  exist  separate  systems  of  measures  for  linear, 
plane,  or  ^-dimensional,  sets  of  points,  corresponding  to  generalizations  of  the 
measures  of  length,  area,  or  volume. 

(2)  The  linear  measure  of  the  set  of  points  in  a  linear  interval  (a,  b)  is 
taken  to  be  b  —  a,  whether  the  set  includes  neither,  one,  or  both,  of  the  end- 
points  a  and  6,  of  the  interval. 

The  ^-dimensional  measure  of  the  set  of  points  in  the  cell 

(a(l),  a(2),  . 
is  taken  to  be  the  product 


whether  the  set  of  points  includes  none,  some,  or  all,  of  the  points  on  the 
boundary  of  the  cell. 

Thus,  when  p  —  2,  the  measure  of  the  set  of  points  in  the  rectangle  is  in 
agreement  with  the  ordinary  measure  of  its  area  ;  when  p  =  3,  the  measure  of 
the  set  of  points  in  the  rectangular  parallelepiped  is  in  agreement  with  the 
ordinary  measure  of  its  volume. 

It  will  be  observed  that,  in  accordance  with  this  postulation,  the  ^-dimen- 
sional measure  of  a  g-dimensional  cell,  when  q  <  p,  is  zero.  Thus  a  linear 
interval  has  the  plane  measure  zero;  and  a  plane  rectangle  has  the  three- 
dimensional  measure  zero. 

(3)  The  measure  of  a  set  of  points  is  to  be  a  number  dependent  on  the 
set,  such  that  the  measure  of  the  sum  of  two  sets,  which  have  no  point  in 
common,  is  the  sum  of  the  measures  of  the  two  sets.  It  then  follows  that  the 
measure  of  the  sum  of  any  finite  number  of  sets,  no  two  of  which  have  a 
point  in  common,  is  the  sum  of  the  measures  of  the  sets.  The  measure  of 
a  set  being  regarded  as  a  function  of  the  set,  is  thus  required  to  be  an 
additive  function,  i.e.  a  function  such  that  its  value  for  the  set  El  4-  E2  is  the 
sum  of  its  values  for  El  and  Ez. 

It  follows  that,  if  a  set  F  be  a  component  of  a  set  E,  the  measure  of 
the  set  E  —  F  is  to  be  the  excess  of  the  measure  of  E  over  that  of  F.  In 
case  E  and  F  have  the  same  measures,  the  set  E  —  F  is  to  have  the  measure 
zero. 
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In  particular,  the  ja-dimensional  measure  of  a  (p  —  l)-dimensional  set  is 
zero.  For  such  a  set  may  be  taken  to  be  on  the  boundary  of  a  ^-dimensional 
cell ;  and,  in  accordance  with  (1),  the  measure  of  the  set  of  points  interior  to 
the  p-dimensional  cell  is  unaltered  by  addition  of  the  (p  —  l)-dirnensional  set 
on  its  boundary  ;  consequently  the  p-dimensional  measure  of  this  last  set  must 
be  zero. 

(4)  The  measure  of  the  sum  of  an  enumerably  infinite  sequence  of  sets, 
no  two  of  which  have  a  point  in  common,  is  to  be  the  limiting  sum  of  the 
measures  of  the  sets,  whenever  that  limiting  sum  exists.     This   may  be 
expressed  as  the  postulation  that  the  measure  of  a  set  shall  be  a  completely 
additive  function  of  the  set. 

(5)  The  measures  of  two  sets  such  as  are  obtained  from  one  another  by  a 
congruent  transformation  are  to  be  identical. 

Congruent  sets  are  such  that,  to  each  pair  of  points  P,  Q,  of  one  of  the  sets  i 
there  corresponds  a  unique  pair  of  points  P',  Q',  of  the  other  set,  such  that  the 
distance  of  P'  from  Q7  is  the  same  as  that  of  P  from  Q. 

In  order  that  a  system  of  measurement  of  sets  may  be  set  up,  for  each 
number  of  dimensions,  which  shall  satisfy  the  above  postulates,  definitions 
will  be  introduced  which  will  apply  to  a  certain  category  of  sets,  called 
measurable  sets,  and  which,  for  sets  of  this  category,  will  be  shewn  to  provide 
a  function  of  the  set  that  satisfies  the  above  postulates.  It  will  appear  that 
the  class  of  measurable  sets,  in  linear,  plane,  or  higher  dimensional,  space 
includes  all  the  sets  which  are  defined  by  certain  prescribed  methods.  The 
possibility  of  extending  the  theory  to  cases  of  non-measurable  sets  will  be  left 
out  of  account*. 

THE   MEASURES   OF   OPEN  AND  CLOSED  SETS. 

121.  A  linear  set  of  points  0,  open  relatively  to  a  linear  interval  (a,  b) 
which  contains  the  set,  is  the  sum  of  a  unique  enumerable  set  of  open 
intervals. 

In  accordance  with  (3),  the  measure  of  0  must  be  taken  to  be  the  limiting 
sum  of  the  measures  of  the  intervals  of  this  set.  That  this  limiting  sum  is 
finite,  and  cannot  exceed  I,  the  measure  of  (a,  6),  is  seen  by  considering  the 
first  n  of  the  intervals,  as  arranged  in  order ;  the  sum  of  these  n  intervals  is 
less  than  I  by  the  sum  of  the  lengths  of  the  finite  set  of  intervals  comple- 
mentary to  them ;  and  this  holds  good  however  large  n  may  be. 

Every  open  interval  may  be  regarded  as  consisting  of  the  points  of  an 
enumerable  set  of  closed  intervals,  no  two  of  which  overlap  one  another, 

*  A  definition  of  a  non-measurable  set  has  been  given  by  Van  Vleck,  Trans.  Amer.  J/V/f/i.  Soc., 
vol.  xxxv,  p.  209,  but  this  definition  requires  an  infinite  number  of  arbitrary  acts  of  choice.  Such 
sets  have  also  been  defined  by  Lebesgue,  Bull,  de  la  Soc.  Math,  de  France,  vol.  ix,  p.  '237. 
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although  they  may  have  an  end-point  in  common  ;  therefore  any  open  set  may 
be  regarded  as  consisting  of  the  points  of  an  enumerable  set  of  non-overlapping 
closed  intervals.  The  measure  of  the  open  set  can  then  be  regarded  as  the 
limiting  sum  of  the  measures  of  such  a  system  of  non  -overlapping  closed 
intervals. 

In  accordance  with  (1),  it  makes  no  difference  that  pairs  of  the  intervals 
may  have  an  end-point  in  common. 

If  G  be  any  closed  linear  set  in  (a,  b),  the  complementary  set  C(G)  is  open 
relatively  to  (a,  b),  and  in  accordance  with  (2),  the  measure  of  G  must  be  given 
by  m(G)  =  l  —  m  {C(G}}  ;  where  the  measures  of  G  and  C(G)  are  denoted  by 
m(G)  and  m  {G  (G)}  respectively. 

122.  It  has  been  shewn  in  §  109  that,  in  space  of  any  number  of  dimensions, 
an  open  set  0,  in  a  cell  (a,  b),  is  constituted  by  the  points  of  an  enumerable 
set  of  closed  cells.  In  accordance  with  (3),  the  measure  of  0  must  be  the 
limiting  sum  of  the  measures  of  these  closed  cells.  That  this  limiting  sum  is 
finite,  and  does  not  exceed  the  measure  I,  of  the  cell  (a,  6),  is  proved  as  in  the 
case  of  a  linear  set. 

If  G  be  a  closed  set,  in  the  cell  (a,  b),  the  measure  of  G  must,  in  accordance 
with  (2),  be  given  \>y  m(G)  =  1  —  m  {C  (G)}  ;  the  set  C(G)  being  open  rela- 
tively to  (a,  b). 

Since  the  set  of  closed  cells,  or  intervals,  which  constitutes  a  given  open 
set  0  is  not  unique,  it  must  be  shewn  that  the  measure  of  0,  defined  as  the 
limiting  sum  of  the  measures  of  the  closed  cells,  or  intervals,  is  independent 
of  the  particular  sets  of  such  cells,  or  intervals.  In  order  to  shew  this,  the 
following  theorem,  somewhat  more  general  than  is  required  for  this  particular 
purpose,  will  be  proved  : 

If  an  open  set  0  consist  of  all  the  points  of  an  enumerable  set  {AMj,  of  non- 
overlapping  closed  cells,  and  if  (An'j  be  any  other  set  of  non-overlapping  closed 
cells  such  that  every  point  of  0  belongs  to  at  least  one  of  the  closed  cells  (An'j,  the 
limiting  sum  of  the  measures  of  |An'}  cannot  be  less  than  that  of  the  measures 


Assume,  if  possible,  that  the  limiting  sum  of  the  measures  of  the  cells  of 
{An}  exceeds  that  of  the  measures  of  {A,/}.  A  finite  set  Aj,  A2,  ...  Am,  of  the 

r  =  wt 

cells  {An},  may  then  be  so  determined  that  2  m(Ar)  is  greater  than  the 

r  =  \ 

limiting  sum  of  the  measures  of  the  cells  {A,/}. 

A  cell  An'  containing  A,/  in  its  interior,  and  with  the  same  centre,  can  be 
defined  such  that  m  (An')  —  m(An')  =  e/2n;  where  e  is  any  chosen  positive 
number;  and  this  can  be  done  for  each  of  the  cells  {  An'}.  The  limiting  sum  of  the 
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measures  of  the  cells  {A,/}  exceeds  that-^qt  tfle  cells  {An'j  by  e.    If  e  be 

r-ro 

sufficiently  small,  S  m  (Ar)  is  greater  than  the  limiting  sum  of  the  measures 

r-l 

of  the  cells  {A,/}.  The  set  of  points  in  Aj,  A.J,  .>.  Am  is  a  closed  set  Gm ',  and 
since  each  point  of  Om  is  interior  to  a  cell  of  {An'j,  from  the  Heine-Borel 
theorem  (§  74)  it  follows  that  a  finite  set  of  the  cells  {A,/)  can  be  determined 
so  as  to  contain  all  the  points  of  Gm.  The  total  measure  of  the  cells  of  this 

r—m 

finite  set  cannot  be  less  than  2  m  (Ar) ;  and  this  is  contrary  to  the  supposition 

r-l 

made  above. 

It  follows  from  this  theorem  that,  if  {A,,},  {A,/}  be  two  sets  of  closed  non- 
overlapping  cells,  each  of  which  constitutes  a  given  open  set  0,the  limiting  sums 
of  the  measures  of  the  cells  of  each  set  must  be  the  same;  for,  by  the  theorem, 
it  is  impossible  that  either  of  these  should  be  less  than  the  other. 

The  unique  measure  of  an  open  bounded  set  0  is  thus  defined*  as  the  limiting 
sum  of  the  measures  of  the  closed  cells,  or  intervals,  of  a  set,  all  the  points  of 
which  constitute  the  set  0. 

The  measure  of  a  bounded  closed  set  G  is  then  defined  to  be  the  excess  of  the 
measure  of  the  fundamental  cell,  or  interval,  in  which  G  is  contained,  over  the 
measure  of  the  open  set  C(G),  which  is  the  complement  of  G  with  regard  to 
the  fundamental  cell,  or  interval. 

All  the  points  of  the  closed  set  G  are  in  the  finite  set  of  cells,  or  intervals, 
complementary  to  the  cells,  or  intervals,  Aj,  A2,  ...  Aw.  Thus  m(G)  is  the 
lower  limit  of  the  sum  of  the  measures  of  this  finite  set,  as  n  is  indefinitely 
increased.  It  is  thus  seen  that  the  measure  of  a  closed  set  as  here  defined  is 
identical  with  the  content  of  such  a  set  as  defined  in  §  117.  It  has  been  seen 
that  any  open  set  0  contains  a  closed  set  whose  measure  is  arbitrarily  little 
less  than  m  (0). 

123.  It  follows,  as  a  particular  case  of  the  theorem  proved  in  §  122,  that, 
if  02  be  an  open  set  which  contains  another  open  set  Olt  m  (02)  =  m  (Oj).  For, 
if  Oj  is  constituted  by  a  set  of  closed  cells  {Art),  and  02  by  a  set  of  closed  cells 
{An'j,  the  limiting  sum  of  the  measures  of  the  cells  A, /  cannot  be  less  than 
that  of  the  measures  of  the  cells  An ;  and  thus  m  (02)  cannot  be  less 
than  m  (Oi). 

If  Ou  02, ...  On, ...  be  a  sequence  of  open  sets,  all  contained  in  a  finite  cell, 
or  interval,  then 

m(01)  +  m(Oa)+...  +  m(On)+...^m[M(01,  0,, ...  On, ...)]. 
It  has  been  shewn  in  §  56  that  M(0l,  0.,,  ...  On,  ...)  is  an  open  set;  let 
{Am}  be  a  sequence  of  closed  cells  which  constitute  it.    Let  On  be  constituted 
by  a  set  of  closed  cells  {Anm},  where  m  =  1,  2,  3, 

*  De  la  Valise  Poussin,  Integrates  de  Lebcsyue,  p.  22. 
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The  set  of  cells  {Anm},  where  n  and  m  have  all  integral  values,  is  an 
enumerable  set  of  cells,  the  limiting  sum  of  whose  measures  is 


By  the  theorem  proved  in  §  122,  this  cannot  be  less  than  the  limiting  sum 
of  the  measures  of  An  A2,  ...  Am,  ...  ;  or  than  the  measure  of 


It  has  here  been  assumed  that  if  {cnm}  is  a  double  sequence  of  positive 

00         00 

numbers,  2    2  cnm  is  unaltered  by  changing  the  order  of  the  terms. 

n=l  ?M=al 

If  Olt  Oz>  ...  On,  ...  be  a  sequence  of  open  sets,  all  contained  in  a  finite  cell, 
or  interval,  and  such  that  no  two  of  them  have  a  point  in  common,  then 

m(01)+m(0,2)+...+m(On)  +  ...  =  m[M(01,  Oa,  ...  On,  ...)]. 
In  this  case,  no  two  of  the  cells  Anm  overlap  one  another,  and  the  set 

M(0lt  02,...0n,...) 

is  constituted  by  the  enumerable  set  of  cells  Anm,  where 
n  =  l,  2,  3,...;  m=l,  2,  3,.... 

00 

Both  2  m(0n)  and  m[M(01>  02,  ...)]  are  equal  to  the  limiting  sum  of  the 

n  =  l 

measures  of  the  cells  Anm. 

It  has  thus  been  shewn  that,  for  open  sets,  the  measure  is  a  completely 
additive  function;  thus  satisfying  the  postulates  (3)  and  (4),  of  §  120. 

THE  CONTENT  OR  MEASURE  OF  A  CLOSED  SET. 

124.  In  a  linear  interval,  of  length  I,  there  can  be  denned  a  set  of  non- 
overlapping  intervals,  everywhere  dense  in  the  interval,  and  such  that  the 
sum  of  the  lengths  of  the  intervals  is  less  than  an  arbitrarily  chosen  number  e. 
To  establish  this  apparently  paradoxical  theorem,  let  us  consider  an  enumerable 
set  of  points  xlt  x2,  ...  xn,  ...  everywhere  dense  in  the  interval;  for  example, 
the  rational  points  of  the  interval.  With  each  point  ocn  as  centre,  an  interval 
of  length  e/2n  may  be  defined  ;  any  part  of  the  interval  which  is  not  in  the 
fundamental  interval  may  be  omitted.  The  overlapping  set  of  open  intervals 
obtained  by  taking  all  the  values  of  n  is  equivalent  to  a  non-overlapping  set 
of  open  intervals  A,  of  totnl  measure  less  than  e,  which  is  everywhere  dense 
in  the  fundamental  interval.  The  points  of  that  interval  complementary  to 
the  set  A  form  a  non-dense  closed  set  of  content,  or  measure,  greater  than 
I  —  e.  This  can  be  made  arbitrarily  near  to  I,  by  taking  e  small  enough. 

The  following  general  theorem  will  be  established  : 

The  content  of  a  non-dense  linear  closed  set  is  zero,  in  case  the  set  is 
enumerable;  and,  in  case  tJie  set  is  ttn  enumerable,  its  content  mny  be  zero,  or  may 
have  any  value  less  than  the  length  of  the  interval  in  which  the  set  is  contained. 
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The  content,  or  measure,  of  a  closed  set  is  the  sum  of  the  measures  of  its 
perfect  component  and  of  its  enumerable  component;  and  this  last  is  zero. 
If  the  set  is  enumerable,  it  has  no  perfect  component,  and  therefore  its  content 
is  zero. 

Let  8,,  82,  8S,  ...  denote  the  lengths  of  the  intervals  contiguous  to  a  non- 
dense  closed  set  G,  in  a  fundamental  interval  of  length  L 

Let  8j  =  Xj  I,   Sa  =  \2  (I  —  $1),   83  =  \3  (I  —  8l  —  8a), . . . 

and  generally,  Sn  =  Xn  (Z  —  Si  —  8a  —  . . .  —  8»_i) ; 

where  the  numbers  X,,  \a, ...  \n, ...  are  all  between  0  and  1. 

We  find  at  once 

8»  =  \n  (1  -  Xi)  (1  -  X,)  ...  (1  -  Xn-0  I 

and  hence     I  -  (8,  +  S2  +  . . .  -I-  8B)  =  (1  -  \x)  (1  -  X,) . ..  (1  -  X»)  I. 

The  content  of  the  set  G  is  therefore*  I  multiplied  by  the  limit  of  the  product 
(l-X1)(l-Xa)...(l-XB). 

The  values  of  \lf  \a, ...  Xn,  ...  can  be  so  chosen  that  the  content  of  the  set 
is  zero ;  for  example  if  Xj  =  X,  =  \3  =  — 

If  \!  =  01,  Xa  =  02/22> . ..  X»  =  ffi/n*,  . . . ;  where  0  <  6  <  1,  the  content  of  the  set 
is  £  sin  (7r6)/Tr0.  By  choosing  a  sufficiently  small  value  of  6,  this  may  be 
made  as  near  as  we  please  to  I.  By  proper  choice  of  0,  the  content  may  have 
any  value  less  than  I. 

125.  Any  linear  closed  set  G  that  has  interior  points  contains  a  non-dense 
closed  set  H,  such  that  m  (G)  —  m  (H)  is  less  than  an  arbitrarily  chosen  number  e. 

The  set  G  may  contain  a  finite,  or  an  enumerable,  set  of  closed  intervals, 

BI,  82, ...  8n, Remove  from  G  all  the  interior  points  of  8n>  except  those 

belonging  to  a  non-dense  closed  set  gn,  of  content  greater  than  m  (8n)  —  e/2n. 
The  set  which  consists  of  G,  with  the  interior  points  of  all  the  intervals  {8n} 
removed,  except  the  points  of  glt  gz,  ...gn,  -.•>  is  a  non-dense  closed  set  H, 
such  that  m(G)  —  m  (H)  <  e. 

Similar  results  hold  for  plane  closed  sets,  or  for  closed  sets  in  space  of  any 
number  of  dimensions.  It  will  be  sufficient  to  consider  the  case  of  plane  closed 
sets. 

In  a  cell  (a(l),  a(2) ;  6(1),  6(2))  an  everywhere  dense  set  of  cells  can  be  defined,  of 
which  the  total  measure  is  either  that  of  the  given  cell,  or  has  any  less  value  >  0. 

In  the  linear  interval  (a(1),  6(1))  an  everywhere  dense  set  of  intervals  {§„} 
can  be  defined,  as  in  §  124,  of  which  the  total  measure  has  a  prescribed  value 
>  0,  and  not  exceeding  6(1)  —  a(I).  Similarly  a  set  of  intervals  {8n'}  may  be  de- 
fined in  the  linear  interval  (tt(;!),  6(2)),  of  total  measure  >  0,  and  not  exceeding 
6(2)  -  a(2). 

*  See  Harnack,  Math.  Annalen,  vol.  xix,  p.  239. 
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Consider  a  cell  for  which  the  projections  of  the  sides  on  the  two  axes  of 
coordinates  are  8n,  SVi  where  n  and  n'  is  any  pair  of  integers.  The  set  of  all 
such  cells  is  everywhere  dense  in  the  fundamental  cell,  and  the  sum  of  their 
measures  has  a  value  >  0,  and  not  exceeding  the  measure  of  the  fundamental 
cell.  The  set  of  cells  may  clearly  be  chosen  so  that  the  total  measure  has  any 
such  prescribed  value.  The  set  of  points  which  is  complementary  to  the  set 
of  open  cells  so  defined  is  a  closed  non-dense  set,  of  which  the  measure 
may  be  zero,  or  may  have  any  prescribed  value  less  than  the  measure  of  the 
fundamental  cell. 

A  closed  plane  set  G,  that  has  interior  points,  contains  a  non-dense  closed 
component  H,  such  that  m(G)  —  m(H)  is  less  than  an  arbitrarily  prescribed 
number  e. 

It  has  been  shewn,  in  §  106,  that  such  a  set  as  G  contains  a  finite,  or  an 
enumerable,  set  of  closed  connex  domains  {Dn}.  The  interior  points  of  Dn  are 
constituted  by  an  enumerable  set  of  closed  cells.  Hence  the  interior  points  of 
all  the  domains  [Dn]  together  consist  of  an  enumerable  set  of  closed  cells  |AW}. 
We  can  remove  from  Aw  all  its  points  except  those  of  a  non-dense  closed  set 
of  measure  >  m  (Aw)  —  e/2w.  When  this  has  been  done  for  all  the  cells  {An},  we 
have  remaining  a  part  H,  of  G,  such  that  m(G)  —  m(H)  is  less  than  e;  for 
the  sets  of  points  removed  have  a  total  measure  <  2e/2n,  or  e.  The  set  H  is 
non-dense  in  the  fundamental  cell. 

EXAMPLES. 

1.  The  perfect  set  of  points  defined  by  x=°^  +  &  +  ••'+£  +  •••  >  where  the  numbers 

o        o  o 

<?!,  c2,  ...  have  each  one  of  the  values  0,  2  (see  Ex.  1,  §  83),  has  the  content  zero.    For  the 
limit  of  the  sum  of  the  lengths  of  the  complementary  intervals  is  unity. 

2.  The  non-dense  closed  set  considered  in  Ex.  3,  §  83,  has  the  content  zero.  For,  after/: 
operations,  the  sum  of  the  exempted  segments  is 


m        m*  m,"  mK  \    m 

When  k  is  increased  indefinitely,  the  limit  of  the  sum  of  the  free  intervals  is  1. 

3.    The  non-dense  closed  set  considered  in  Ex.  4,  §  83,  has  a  content  between  0  and  1. 
After  k  operations,  the  sum  of  the  exempted  segments  is 

!_      m-l      (m-l)(m*-I)  (m-  1)  (m2-!)  ...  (m*"1  -  1) 

m+^T'  m6  +•••- 


The  limit  of  the  sum  of  the  exempted  intervals  is  1  -  n  (  1  --  r  1  .  and  therefore  the  con- 

k=\      mkj 

00      /  1     \ 

tent  of  the  set  of  points  is  n  (  1  --  ;  )  ,  which  is  between  0  and  1,  depending  upon  the  value 

fc=A       m*/ 

of  m.    By  taking  m  sufficiently  great,  the  content  of  the  set  may  be  made  arbitrarily  near 
to  unitv. 
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THE  EXTERIOR  AND  INTERIOR  MEASURES  OF  A  SET. 

126.  Let  0  be  any  linear  set  of  points  in  a  given  interval  (a,  6),  and 
let  a  finite,  or  an  infinite,  set  of  non-overlapping  open  intervals  be  defined, 
such  that  every  point  of  G  is  in  one  of  these  intervals.  The  point  a  is 
regarded  as  an  interior  point  of  any  interval  of  which  it  is  an  end-point ;  and 
a  similar  remark  applies  to  b.  The  set  of  intervals  constitutes  an  open  set 
which  contains  G.  The  sum,  or  limiting  sum,  of  the  lengths  of  the  intervals 
has  a  definite  value,  not  greater  than  I.  The  lower  boundary  of  this  sum  or 
limiting  sum,  for  all  possible  such  sets  of  intervals,  is  a  number  which  is  called 
the  exterior  measure  of  the  set  G;  and  this  may  be  denoted  by  me(G). 

Alternatively,  the  exterior  measure  me(G\  of  the  set  G,  may  be  defined  as 
the  lower  boundary  of  the  sum,  or  limiting  sum,  of  the  lengths  of  the  intervals 
of  a  set  of  non -overlapping  closed  intervals  such  that  every  point  of  P  is  in  a 
closed  interval  of  the  set,  being  either  an  interior  point,  or  an  end-point,  when 
all  such  systems  of  closed  intervals  are  taken  into  account. 

To  shew  the  equivalence  of  the  two  definitions,  let  L1}  L2  denote  the  lower 
boundaries  defined  in  accordance  with  the  first,  and  the  second  definitions  re- 
spectively. A  set  of  non-overlapping  open  intervals  containing  G  can  be 
defined,  such  that  the  sum,  or  limiting  sum,  of  their  lengths  is  <  />j  +  e ;  where 
e  is  an  arbitrarily  chosen  positive  number.  Since  each  open  interval  is  con- 
stituted by  an  enumerable  set  of  non -overlapping  closed  intervals ;  the  set  of 
open  intervals  can  be  replaced  by  a  set  of  closed  intervals  which  contain  G; 
and  the  limiting  sum  of  the  lengths  of  the  intervals  is  the  same  in  the  two 
cases.  This  shews  that  L2  ^  Llt  since  we  have  a  set  of  closed  intervals  satisfying 
the  requisite  condition,  the  limiting  sum  of  whose  lengths  is  <  Ll  +  e.  Again, 
it  is  impossible  that  L2  <  Ll ;  for  if  a  set  of  non-overlapping  closed  intervals 
containing  all  the  points  of  G  existed,  the  limiting  sum  of  whose  lengths  is  <  Ll} 
we  could  enclose  each  of  these  intervals  in  an  open  interval,  of  length  exceeding 
that  of  the  closed  interval  by  less  than  an  arbitrarily  chosen  number,  and  in 
this  way  we  could  obtain  an  overlapping  set  of  open  intervals  containing  G, 
and  such  that  the  limiting  sum  of  their  lengths  would  be  <  Z/lt  This  over- 
lapping set  might  be  replaced  by  the  equivalent  set  of  non-overlapping  intervals, 
and  the  limiting  sum  of  the  latter  intervals  cannot  exceed  that  of  the  former, 
as  has  been  shewn  in  the  first  theorem  of  §123.  We  should  therefore  have  a 
set  of  non-overlapping  open  intervals,  enclosing  G,  the  limiting  sum  of  whose 
lengths  would  be  <  Llt  contrary  to  the  fact  that  Ll  is  the  lower  boundary 
of  all  such  limiting  sums. 

Let  C(G)  be  the  complementary  set  of  G,  relatively  to  the  interval  (a,  b) 
of  length  I,  in  which  G  is  contained.  If  me  [C(G)}  denote  the  exterior  measure 
of  C(G),  the  number  l  —  me{C(G)}  is  taken  to  define  the  interior  measure  of 
the  set  G ;  and  this  interior  measure  may  be  denoted  by  ?«,•((?). 
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127.  In  the  case  of  a  set  G  contained  in  a  finite  cell  of  any  number  of 
dimensions,  a  connex  open  domain  takes  the  place  of  an  open  interval  ;  and 
thus  the  exterior  measure  me  (G)  is  defined  as  the  lower  boundary  of  the 
limiting  sums  of  the  measures  of  connex  open  domains  which  form  a  finite,  or  an 
infinite,  sequence  containing  all  the  points  of  G,  when  all  such  sets  of  connex 
open  domains  are  taken  into  account.  Again,  since  any  open  set  is  constituted 
by  a  sequence  of  closed  cells,  the  exterior  measure  me(G)  may  also  be  defined  as 
the  lower  boundary  of  the  limiting  sums  of  the  measures  of  sequences  of  closed 
cells,  each  sequence  containing  all  the  points  of  G  in  the  interior  and  on  the 
boundaries  of  the  cells,  when  all  such  sequences  are  taken  into  account.  That 
this  second  form  of  the  definition  is  equivalent  with  the  first  may  be  shewn  in 
exactly  the  same  manner  as  in  the  case  of  a  linear  set  ;  for  every  sequence  of 
closed  cells  forms  part  of  a  sequence  of  open  cells,  such  that  the  difference  of 
the  sums  of  the  measures  of  the  cells  in  the  latter  sequence  and  in  the 
former  is  arbitrarily  small. 

As  before,  the  interior  measure  m{  (G)  of  G  is  defined  by 


where  I  is  the  measure  of  the  cell  in  which  G  is  contained. 

The  above  definitions  of  the  exterior  and  interior  measures  of  a  bounded 
set  of  points  in  any  number  of  dimensions  are  equivalent  to  the  following 

statements  : 

The  exterior*  measure  me(G)  of  a  bounded  set  G  is  the  lower  boundary  of 
the  measures  of  open  sets  which  contain  G.  The  interior  -\  measure  mi(G)  of  a 
bounded  set  G  is  the  upper  boundary  of  the  measures  of  closed  sets  contained  in  G. 

For,  as  the  exterior  measure  me  {C  (G)}  is  the  lower  boundary  of  open  sets 
which  contain  C  (G),  it  follows  that  l  —  me{C(G)}  is  the  upper  boundary  of 
the  measures  of  the  closed  sets  complementary  to  these  open  sets. 

If  a  set  G!  contains  a  set  G2,  it  is  clear  from  the  definition  that 

m.  (0.)  £«•«?•)• 

MEASURABLE  SETS  OF  POINTS. 

128.  It  appears  from  the  above  definitions  that  every  bounded  set  of 
points  has  definite  exterior  and  interior  measures. 

When  the  exterior  and  interior  measures  of  a  set  G,  of  points  inp  dimensions, 
are  equal  to  one  another,  the  set  G  is  said  to  be  measurable,  and  the  number 
me(G)  =  m{(G)  is  defined  to  be  the  measure  of  G.  When  G  is  measurable  its 
measure  is  denoted  by  m  (G). 

It  is  clear  from  this  definition  that,  if  G  is  measurable,  so  also  is  C(G). 

*  This  form  of  the  definition  is  due  to  De  la  Valise  Poussin,  see  Integrates  dc  Lebesgue,  p.  22. 
f  See  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  n,  p.  28. 
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It  will  be  shewn  that  this  definition  satisfies  the  conditions  stated  in  §  120, 
whenever  it  is  applicable.  If  a  is  a  set  of  non-overlapping  cells,  or  intervals, 
enclosing  a  set  of  points  G,  measurable,  or  not,  and  ft  is  a  similar  set  for  C(0), 
then  all  the  points  of  the  fundamental  cell,  or  interval,  are  enclosed  in  the  set 
made  up  of  «  and  ft.  It  follows  that  m(a)  +  m(ft)  ^  I;  and  since  me(G), 
mf{C(G)}  are  the  lower  boundaries  of  m(ci),  m(ft),  respectively,  we  have 
me  (G)  +  me  {G  (G)}  £  I,  and  therefore  me  (G)  £  m<  (G). 

The  condition  that  a  set  G  is  measurable,  in  the  sense  above  defined,  may 
be  stated  in  either  of  the  following  forms  : 

A  set  of  points  G  is  measurable  if  an  open  set  0,  containing  G,  and  a  closed 
set  H,  contained  in  G,  can  be  so  determined  that  m(0)  —  m  (H)  is  less  than  an 
arbitrarily  prescribed  positive  number  e. 

A  set  of  points  G  is  measurable  if  its  points  can  be  enclosed  in  a  finite,  or  an 
infinite,  set  a,  of  open  intervals  (in  the  case  of  linear  sets),  or  of  open  connex 
domains  (in  the  case  of  sets  in  space  of  higher  dimensions),  and  if  C(G)  can  be 
similarly  contained  in  a  set  ft,  such  that  the  sum,  or  limiting  sum,  of  the  measures 
of  the  open  intervals,  or  connex  domains,  which  contain  all  those  points  which  are 
common  to  a.  and  ft  is  arbitrarily  small. 

The  set  of  points  D(a,  0)  forms  an  open  set  (§  56)  which,  if  the  condition 
be  satisfied,  may  be  taken  to  have  its  measure  <  6.  As  is  easily  seen  by  con- 
sidering the  three  sets  a,  ft,  D  (a,  ft),  we  have  m  (a)  +  m(ft)  =  m{D  (a,  ft)}  +  /  ; 
where  I  is  the  measure  of  the  cell,  or  interval,  in  which  the  given  set  is 
contained.  It  then  follows  that 

{m(a)-mi(G)}  +  {m(ft)-me[C(G)]}<6. 

Since  m(a)-mi(G),  m(ft)  —  me[C  (G)]  are  not  negative,  it  follows  that  each 
of  them  is  <  e  ;  also  m  (a)  ^  me  (G)  ^  mt  (G),  hence  me  (G)  -  m,i  (G)  <  e.  Since 
this  holds  for  every  value  of  e,  we  have  me  (G)  =  raj  (G)  ;  and  thus  the  condition 
stated  is  sufficient  that  the  set  G  may  be  measurable.  If  G  be  assumed  to  be 
measurable,  a.  and  ft  can  be  so  chosen  that  m  (a)  —  m  (G)  <  |e,  and 


we  then  have  m  (a)  +  m  (ft)  <  e  +  1,  which  is  equivalent  to  the  condition  that 
m  {D  (a,  ft)}  <  e  ;  and  thus  the  condition  is  necessary. 

129.  It  must  be  shewn  that  the  definitions,  given  in  §§120  and  122,  of  the 
measures  of  open,  closed,  or  partly  closed,  cells  or  intervals,  and  of  open  and  closed 
sets,  are  consistent  with  the  definition  of  a  measurable  set  given  in  §  1*28,  that  is. 
that  these  sets  have  their  exterior  and  interior  measures  equal  to  one  another. 

The  cell  (a(1),  a(2),  ...  a(p);  bw,  b(2),  ...  bw)  is  contained  in  the  cell 


-  -  a,  a 


™  -  ~     (2)  - 


_L      (D  _  a(D)    fli'i 


2n 
and  contains  the  cell  obtained  by  writing  —  n  instead  of  n. 
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/      i\p     /      I\P 

The  measures  of  these  cells  are     1  +  - 1  I,    I  1 )  I,  respectively,  and 

\        n)        \       n) 

thus  differ  from  one  another  by  <  e,  if  n  be  chosen  sufficiently  large.  There- 
fore the  exterior  and  interior  measures  of  a  cell  are  identical  with  I,  the 
measure  of  the  cell,  as  denned  in  §  120. 

It  is  clear  that  the  exterior  measure  of  an  open  set  0  is  identical  with  the 
measure  of  the  set,  as  defined  in  §  121,  for  it  has  been  shewn  that  no  open  set 
of  measure  less  than  that  of  the  set  can  contain  0.  Also  0  contains  the  closed 
set  formed  by  a  finite  number  of  the  closed  cells,  or  intervals,  which  constitute 
0,  and  the  sum  of  the  measures  of  these  cells,  or  intervals,  converges  to  the 
measure  of  0 ;  thus  0  is  contained  in  an  open  set,  consisting  of  0  itself,  and 
contains  a  closed  set,  such  that  the  difference  of  the  measures  of  the  open  and 
closed  sets  is  arbitrarily  small ;  therefore  both  the  exterior  and  interior 
measures  of  0  are  identical  with  the  measure  of  0,  as  defined  in  §  121.  Since  a 
closed  set  is  the  complement  of  an  open  set  0,  it  follows  that  the  closed  set  is 
measurable.  It  has  thus  been  proved  that : 

Every  bounded  open  set,  and  every  bounded  closed  set,  is  measurable. 

It  will  now  be  shewn  that : 

Every  enumerable  set  of  points  is  measurable,  and  its  measure  is  zero. 

Let  P,,  P2, ...  Pr,  ...  denote  the  points  of  the  set.  Each  point  Pr  can  be 
enclosed  in  an  open  cell,  or  interval,  with  Pr  as  centre,  and  of  measure  e/2r. 
In  accordance  with  a  theorem  proved  in  §  123,  the, open  set  of  points  con- 
sisting of  all  the  points  belonging  to  one  or  more^of  these  cells,  or  intervals, 

00 

has  its  measure  £  S  e/2r,  or  ^  e.    It  follows  that  the  exterior  measure  of  the 

r=l 

set  is  zero,  and  thus  that  the  interior  measure  is  also  zero,  and  that  the  set  is 
measurable,  with  measure  equal  to  zero. 

130.  IfGi,G.2,...Gn,...is  an  enumerable  (or  a  finite)  sequence  of  measurable 
sets,  all  contained  in  a  finite  cell,  or  interval,  the  set  M  (G^ ,  G2,  ...  Gn,  . . .),  which 
consists  of  all  points  that  belong  to  one  or  more  of  the  sets,  is  measurable. 

Let  Gj,  and  C(Gl)  be  contained  in  open  connex  domains,  or  open  intervals, 
of  sets  «!,  @i,  of  such  domains,  or  intervals,  so  defined  that  the  measure  of 
D  («! ,  /3j)  is  <  £e.  Let  «2,  /32  be  similar  sets  of  connex  open  domains,  or  intervals, 
which  contain  G2  and  C  (G2)  respectively,  and  are  such  that  m  [D  (a2,  /32)]  <  e/22. 
Let  a2'  =  D(a2, /Si),  and  @2' =  D  (j32,  fti).  For  G3  and  C (G3)  we  similarly  con- 
sider sets  «3,  (33,  of  open  domains,  or  intervals,  such  that  m[D(as,  /33)]<e/23; 
and  let  a3'  =  D(a3,  /32'),  @3'  =  D((3t,  /32') ;  and  so  on,  for  all  the  sets  G4,  G5  .... 
The  points  of  M(Gl,  G.,,  G3>  ...)  are  all  contained  in  the  sets  of  domains,  or 
intervals,  alt  a2',  a3',  ...  ;  and  C{M(G1,  G2,  ...)}  is  contained  in  /3/,  whatever 
value  i  may  have. 
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The  two  sets  of  domains,  or  intervals,  which  enclose  M  ((?,,  G9,  ...)  and  its 
complement,  have  in  common  a  set  of  points  of  which  the  measure  is  less  than 

e/2  +  e/2'  +  . . .  +  e/2<  +  m  (a'i+1)  +  m  (a'i+a)  +  •  •  •  • 

The  series  Sm  (a')  is  convergent,  since  each  term  is  positive,  and  the  sum  of 
any  number  of  terms  is  less  than  a  fixed  finite  number.  Therefore  the  number 
t  can  be  so  chosen  that  m  (a',-f ,)  +  m  (a't+2)  + . . .  is  less  than  e. 

Now  M (Glt  GZ,  ...)  and  its  complement  have  been  enclosed  in  sets  of 
intervals,  or  cells,  such  that  the  set  common  to  both  seta  has  measure  <  2e. 
Since  e  is  arbitrary,  it  has  been  proved  that  M  (Glt  G2,  ..•)  satisfies  the  criterion 
that  it  is  measurable. 

If  Glt  GZ,  . . .  Gn,  . . .  be  measurable  sets  in  linear  or  higher  dimensional  space, 
all  contained  in  a  bounded  domain,  and  no  two  of  the  sets  have  a  point  in  common, 
the  measure  of  G^  +  Gz  +  . . .  is  the  limiting  sum  of  the  measures  of  the  sets. 

That  GI  +  G2  +  . . .  is  measurable  is  a  particular  case  of  the  last  theorem. 

In  accordance  with  what  has  been  shewn  above  m(G1  +  Gz  +  ...)  differs 
from  the  measure  of  the  set  consisting  of  all  the  sets  aly  a2',  a3', ...  by  less  than 
e.  Also  GZ,  G3,  ...  are  all  parts  of  C(G1);  so  that  G±  is  enclosed  in  a1}  G2  in 
o2',  G3  in  a3', . . . ;  hence  m  (G^  differs  from  m  («i)  by  less  than  £e,  m  (G2)  differs 

from  m  (a/)  by  less  than  ^  e,  and  in  general  m  (Gi)  differs  from  m  (a/)  by  less 

00 

than  e/2'.    Therefore  2m((rt-)  differs  from  m  («1)  +  m(a2/)  +  •••  +m  (a/)  +  ... 
t=i 

00 

by  less  than  e.    It  follows  that  2  m(GV)  and  m(Gl-\-  G2+  ...)  differ  from  one 

i  =  l 

another  by  less  than  2e;  and,  since  e  is  arbitrarily  small,  the  theorem  is 
established.       • 

It  should  be  observed  that  although,  for  each  set  Gi,  a  pair  of  sets  of 
intervals  at-,  &  can  be  so  determined  that  the  measure  of  D(a{,  &•)  is  less  than 
the  prescribed  number  e/2*,  there  are  an  infinite  number  of  such  pairs  at-,  & 
which  satisfy  this  condition,  and  one  such  pair  is  selected  out  of  this  infinite 
number.  It  is  however  assumed,  in  the  above  proof,  that  an  indefinite  number  of 
such  selections  is  made,  namely  one  for  each  value  of  i.  It  may  not  be  possible  to 
give  a  law  which  determines  how  this  selection  is  to  be  made  for  every  value 
oft,  unless  the  given  sets  are  of  some  particular  nature  such  that  we  are  able  to 
assign  such  a  law.  Consequently,  the  existence  has  been  assumed  of  the  sets 
ai,  j3j,  for  i=  1,  2,  3,  ...,  independently  of  whether  we  are  able  to  define  these 
sets  or  not.  This  assumption  is  a  particular  case  of  an  axiom  which  will  be 
discussed  in  Chapter  iv,  and  which  is  known  as  Zermelo's  axiom,  or  as  the 
general  principle  of  selection.  A  corresponding  remark  is  applicable  to  another 
proof  of  the  above  theorem  which  has  been  given*  by  de  la  Valle'e  Poussin. 
This  proof  depends  upon  the  fact  that  a  measurable  set  Gn  is  the  sum  of  a 

*  See  Integrates  de  Lebesgue,  pp.  22 — 25. 
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closed  set  Fn  and  of  a  set  en,  of  which  the  exterior  measure  is  less  than  an 
arbitrarily  small  number  en.  For  an  enumerable  sequence  of  sets,  the  choice 
of  the  sets  Fn  requires  the  use  of  Zermelo's  axiom  (see  §  197). 

131.    If  a  measurable  set  Gl  contains  another  measurable  set  Gz,  the  set 
GI  —  G2  is  measurable,  and  its  measure  is  m  (Gi)  —  m  (G2). 

The  complement  of  G1  —  G2  consists  of  G2  together  with  C  (6^),  hence 
Gl  —  G2  is  measurable.    Further,  since  G1  =  (Gl  —  G2)  +  6r2,  we  have 


If  GI,  G2,  ...  Gn,  ...  are  all  measurable  sets,  the  set  D(Gl,  G2,  •••  Gn,  ...)  of 
points  common  to  all  the  given  sets  is  also  'measurable. 

For  the  complement  of  D(G1}  G2,  ...)  is  M{C(G1),  C(G2),  ...};  and  since 
C  (G^),  C  (G2),  ...  are  all  measurable,  it  follows  that  the  complement  of 
•D(Gi,  G2,  ...)  is  measurable,  and  therefore  that  the  set  itself  is  measurable. 

If*  H  is  the  set  of  points  each  of  which  belongs  to  an  infinite  number  of  the 
measurable  sets  G1}  G2,  G3,  ...,  the  set  H  is  measurable. 

For  the  set  C  (H)  consists  of  those  points  which  belong  to  none,  or  only  to 
a  finite  number,  of  the  sets  G1}  G2,  ...  ;  and  hence  C(H}  consists  of  the  points 
belonging  to  one  or  more  of  the  sets  Ll}  L2,  ...  Ln,  ...,  where  Ln  denotes  the 
set  D  [C(Gn\  C  (Gn+l),  ...}.  The  sets  Ln  are  all  measurable,  and  hence  C  (H) 
is  measurable  ;  and  therefore  H  is  measurable. 

If*  K  is  the  set  of  points  each  of  which  belongs  to  all  the  measurable  sets 
Gn<  Gn+l,  ....  where  n  has  a  definite  value  for  each  point  of  K,  the  set  K  is 
measurable. 

For  the  set  C(K)  is  the  set  of  points  each  of  which  belongs  to  an  infinite 
number  of  the  measurable  sets  C  (G^),  C  (G2),  ...,  and  hence,  by  the  last  theorem, 
C  (K)  is  measurable.  Therefore  K  is  a  measurable  set. 

If  GM  is  the  inner  limiting  set  of  a  sequence  {Gn}  of  measurable  sets  Gn, 
each  of  which  contains  the  next,  m  (G,,)  =  lim  m  (Gn). 

w~oo 

For  (G,  -  G.2)  +  (GZ-G3)  +  ...  =  £,-£„ 

and  m  (Gn  -  Gn+l)  =  m  (Gn)  -  m  (Gn+1), 

hence  m  (G^)  —  lim  m  (Gn)  =  m(Gl—  GM)  =  m  (G^)  —  m  (Gu), 

n~  oo 

and  therefore  m  (6rw)  =  lim  m  (Gn). 

HrvtD 

If  GM  is  the  outer  limiting  set  of  a  sequence  [Gn]  of  measurable  sets  Gn,each 
of  which  is  contained  in  the  next,  and  if  GM  is  a  bounded  set,  m  (G^)  =  lim  m  (Gn). 

n  ^  oo 

*  See  Borel's  Lejon*  sur  lesfonctions  de  variables  rtelles,  p.  18.  The  set  His  named  by  Borel  the 
"  ensemble  limite  complet,"  and  the  set  K  the  "  ensemble  limite  restraint  "  of  the  given  sequence 
of  sets. 
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For  0,  +  (08-  0,)  +  (0,-  0,)+  ...  =  Gu 

and  therefore  m  (Gu)  =  lim  m  (Gn). 

n~« 

If*  0,,  0a,  ...  0n,  ...  is  a  sequence  of  sets  stick  that 

\imM(Gn,  Gn+l,  ...)  =  limD(0n,  On+l,  ...), 

n  ~  '  a  ^  * 

then,  if  either  of  these  limits  be  denoted  by  Ga,  m(Gv)  =  lim  m(Gn). 

n  ~«> 

The  sets  M(Gn,  Gn+l,  .  .  .)  are  such  that  each  contains  the  next,  and  therefore 
m(Gw)  =  lim  m[M(Gn,  0W+1,  ...)]  =  limm(0B);  and  the  sets  D(Gn,  0n+1,  ...) 

«~oo  M~ao 

are  such  that  each  is  contained  in  the  next,  and  therefore 

0»+1,  ...)]*  lim  m(0B); 


we  thus  conclude  that  m  ((•?„)  =  lim  m  (Gn). 

n~oo 

In  general  lim  M  (Gn,  Gn+1,  ...)  is  called  the  upper  limit  of  Gn,  and  may 

n~  oo 

be  denoted  by  lim  0B  ;  and  lim  D  (Gn,  Gn+1>  .  .  .)  is  called  the  lower  limit  of  Gn, 

n  ~  oo  n  ~  oo 

and  may  be  denoted  by  lim  Gn.    In  the  case  contemplated  in  the  theorem,  Gn 

n  ~  «> 

has  a  unique  limit  denoted  by  lim  6rn,  or  6rw. 

M~00 

SETS  THAT  ARE  MEASURABLE  (B). 

132.  All  the  sets  which  have  been  shewn  to  be  measurable,  in  accordance 
with  the  definition  of  a  measurable  set  given  in  §  128,  are  obtained  from  the 
single  point,  the  single  interval,  or  cell,  open  or  closed,  by  taking  a  finite,  or 
enumerably  infinite,  set  of  such  fundamental  sets,  and  by  taking  the  set  common 
to  a  finite,  oran  enumerably  infinite,  number  of  the  sets  so  obtained,  or  by  taking 
the  complements  of  the  sets  so  obtained.  All  sets  defined  in  this  manner  are 
said  to  be  measurable  (B),  since  they  were  the  only  kind  of  measurable  sets 
contemplated  by  Borel  in  his  original  treatment  of  metric  properties.  Thus 
the  complement  of  any  set  that  is  measurable  (B)  is  so  also  ;  the  set  which  is 
common  to  any  finite,  or  enumerably  infinite,  number  of  sets  all  measurable  (B) 
is  also  measurable  (B)  ;  also  closed  and  completely  open  sets  are  all  measurable 
(B).  It  can  be  shewn  however  that  measurable  sets  may  exist  which  are  not 
measurable  (B).  For  example,  any  component  whatever  of  a  perfect  set,  of 
measure  zero,  has  its  external  measure  zero,  and  is  therefore  measurable.  The 
question  whether  a  valid  definition  can  be  given  of  a  set  which  is  not  measurable 
will  not  be  here  discussed,  as  the  question  of  the  validity  of  such  definitions 
depends  upon  debatable  questions  in  the  abstract  theory  of  aggregates.  The 
relation  of  measurable  sets  in  general  to  those  which  are  measurable  (B)  is 
contained  in  the  following  theorem  : 

*  See  de  la  Vallee  Poussin,  Amer.  Journal  of  Math.,  vol.  xvr,  p.  437. 
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Any  measurable  set  G  is  contained  in  a  set  Gl  which  is  measurable  (B),  and 
such  that  m  (G^  =  m  (G) ;  and  G  contains  a  set  G2  which  is  measurable  (B),  and 
such  that  m  (6r2)  =  m  (G). 

If  |en]  denote  a  sequence  of  decreasing  positive  numbers  converging  to  zero, 
the  set  G  can  be  enclosed  in  a  set  of  non-overlapping  intervals,  or  open  domains, 
{an}  of  total  measure  <  m  (G)  +  en. 

The  inner  limiting  set  Gl  of  the  sequence  of  sets  of  intervals,  or  domains,  is 
measurable  (B),  and  its  measure  is  m(G),  as  has  been  shewn  in  §  131;  also  G! 
contains  G  as  a  component.  The  set  Gl  —  G  has  measure  zero,  and  its  points 
can  be  enclosed  in  a  set  of  intervals,  or  domains,  @n,  contained  in  an,  and  there- 
fore of  measure  <  en.  The  inner  limiting  set  H,  of  all  the  set  /9n,  is  measurable 
(B),  and  its  measure  is  zero ;  and  the  set  Gl  —  H  =  G2  is  measurable  (B),  and 
has  m(G)  for  its  measure.  Moreover  6r2  is  contained  in  G. 

A  classification  of  sets  of  measure  zero  has,  on  account  of  the  fundamental 
importance  of  such  sets  in  the  theory  of  functions,  been  undertaken*  by  Borel. 

CONGRUENT  SETS. 

133.  The  definition  of  the  measure  of  a  set,  given  in  §  128,  having  been 
shewn  to  apply  to  the  class  of  measurable  sets,  it  has  been  shewn  that,  so  far 
as  such  measurable  sets  are  concerned, wehaveametric  system  which  satisfiesthe 
postulates  (1),  (2),  (3),  (4),  of  §  120,  namely  that  the  measure  of  such  a  set  is  a 
completely  additive  system,  and  that  it  is  in  agreement  with  the  elementary 
theory  of  the  measure  of  lengths,  areas,  and  volumes,  of  intervals  and  cells.  It 
only  remains  to  be  shewn  that  (5),  of  §  120,  is  satisfied  by  the  definition  which 
has  been  introduced  and  developed. 

A  congruent  transformation  of  a  set  of  points  is,  expressed  in  geometrical 
language,  equivalent  to  a  translation  together  with  a  rotation.  That  a  trans- 
lation does  not  affect  the  metric  properties  is  obvious  from  the  fact  that  it 
only  consists  of  an  addition  of  fixed  numbers  h1}  hz,  ...  hp  to  the  numbers 
xl,  xZ)  ...  xp  which  represent  a  point;  and  in  all  the  definitions  and  proofs 
connected  with  the  metric  properties  of  a  set,  only  the  differences  #/ —  xl} 
xs  ~  xn  •••  f°r  pairs  of  points  are  involved.  So  far  as  a  transformation  by  rotation 
is  concerned,  it  is  only  necessary  to  shew  that  a  cell  when  rotated  has  a  measure 
which  is  equal  to  that  of  the  original  cell.  It  will  be  observed  that  all  the 
cells  employed  in  the  descriptive  properties  of  sets,  and  so  far  in  the  metric 
theory,  are  orientated  alike.  We  cannot  therefore,  ab  initio,  consider  the  set 
of  points  of  a  cell,  as  having  its  measure  unaltered  when  the  cell  is  rotated,  and 
when  it  is  therefore  no  longer  a  cell  in  the  original  sense  of  the  term.  For 
simplicity  the  case  of  the  rectangle,  or  plane  cell,  will  here  be  considered. 

*  See  Hull,  de  la  soc.  math,  de  France,  vol.  XLI,  p.  1. 
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The  original  cell  may  be  taken  to  be  the  set  of  points  (x,y)  for  which 
—  a  £  x<a,    —  b  £  y  £.b. 

Corresponding  to  (x,  y)  we  have  in  the  congruent  set.  obtained  by  rotation 
through  an  angle  a,x'  =  x  cos  a  +  y  sin  a,  y'=  —  x  sin  a  +  y  cos  a.  Thus  the  points 
of  the  cell  correspond  to  the  points  of  the  three  sets  defined  by 

—  sec  a  (a  +  y  sin  a)  ^  x  £  cosec  a(b  +  y  cos  a) 

a  sin  a  +  b  cos  a£  y<  asina  —  b  cos  a, 

—  sec  a  (a  +  y  sin  a)  ^  a  ^  sec  a  (a  —  y  sin  a) 

a  sin  a  —  6  cos  a£y<b  cos  a  —  a  sin  a, 

6  cos  a  —  asma£y  £  asina  +  b  cos  a 

—  cosec  a(b  —  y  cos  a)  ^  x  £  sec  a(a  —  y  sin  a), 
provided  tan  a  <  b/a. 

In  the  first  of  the  above  sets,  divide  the  interval  of  y  into  n  equal  parts ; 
and  consider  the  part  for  which  y  is  in  one  of  these  parts  (—  cr,  —  cr+1),  where 
cr  <  cr+l.  The  corresponding  part  of  the  set  includes  the  rectangle  of  which  the 
measure  is 

(cr+1  —  cr)  {a  sec  a  +  b  cosec  a  —  c,.+1  (tan  a  +  cot  a)}, 
and  is  included  in  the  rectangle  of  which  the  area  is 

(cr+1  —  cr)  {a  sec  a  +  b  cosec  a  —  cr  (tan  a  +  cot  a)}. 

The  difference  of  these  is  (cr+1  —  cr)-  (cot  a  +  tan  a). 

The  sum  of  all  such  differences  is  less  than  n~*  (cot  a  +  tan  a)  4a2  sin2  a,  which 
can  be  made  arbitrarily  small  by  sufficiently  increasing  n.  The  measure  of  the 
set  is  then  easily  shewn  to  be  2a'2  tan  a.  Similarly  it  can  be  shewn  that  the 
measures  of  the  other  two  sets  are  (b  cos  a  —  a  sin  a)  4a  sec  a,  and  2a2  tan  a ;  thus 
the  sum  of  the  measures  of  the  three  parts  is  4>ab,  which  is  the  measure  of 
the  original  cell. 

The  cases  in  which  tan  a  has  values  >  b/a  may  be  considered  in  a  similar 
manner.  The  method  here  adopted  is  clearly  capable  of  extension  to  the  case 
of  a  cell  in  any  number  of  dimensions. 

THE  MEASURE  OF  UNBOUNDED  SETS. 

134.  Let  G  be  an  unbounded  set  of  p  dimensions,  and  let  it  be  such  that 
the  component  "of  G  which  is  in  any  finite  ^-dimensional  cell  is  measurable. 
Consider  a  sequence  of  such  cells,  each  of  which  is  contained  in  the  next,  and 
such  that  if  (an(1),  an(-\  . . .  an(l>) ;  bn(l),  bn(-\  . . .  bn(p})  is  the  nth  cell  of  the  sequence, 
On(1),  ««(2)>  •••  «n(p)  all  have  the  limit  -  oo  ,  and  6B{1),  bn(-\  ...  bH{"}  all  have  the 
limit  +  oo  ,  as  n  is  indefinitely  increased.  If  Gn  be  the  component  of  G  in  this 
cell,  the  sequence  of  numbers  m(Gn)  is  non-diminishing.  Consequently 
m  (Gn)  either  converges  to  a  finite  limit,  or  becomes  indefinitely  great,  as  n 
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is  indefinitely  increased.    If  lim  m  (Gn)  is  finite,  the  set  0  is  regarded  as 

tt~oo 

measurable,  and  m(G)  is  defined  to  be  equal  to  the  limit.  If  the  limit  is  in- 
definitely great,  G  is  measurable,  but  of  infinite  measure.  If  H  be  any  bounded 
measurable  set  of  points,  it  is  contained  in  the  nth  cell,  if  n  be  sufficiently 
great.  The  set  D  (G,  H)  is  the  same  as  D  (Gn>  H),  and  this  set  is  measurable. 
Also  m  [D  (G,  HJ]  £  m  (Gn).  If  H  be  identified  successively  with  the  sets  Hr 
of  a  sequence,  such  that  Hr  is  contained  in  Hr+1,  we  have 

m  [D  (G,  Hry\  £  m(Gnr)  £  m  (G), 

where  nr  is  an  integer  corresponding  to  r.  The  sequence  of  numbers 
m  [D  (G,Hr)]  is  monotone,  non-decreasing,  and  has  a  limit  ^  m  (G).  This  holds 
good,  even  if  lim  m  (Hr)  is  indefinitely  great.  Thus  the  part  of  G  that  is 

r—oo 

contained  in  any  measurable  set  H  is  measurable,  if  G  be  so,  and  its  measure 
does  not  exceed  m  (G). 

The  theorem  m  (6?i)  +  m  (G2)  -f  ...  —  m  (Gi  +  G2  +...)>  where  Gl9  G2,  ...  are 
measurable  sets,  no  two  of  which  have  a  point  in  common,  and  such  that 
G1  +  GZ+  ...  has  a  finite  measure,  holds  good  when  Glt  6r2,  ...  are  unbounded 
measurable  sets.  For,  if  An  denote  the  cell  employed  above,  we  have 

m  [D  (An,  G,)]  +  m  [D  (An,  0,)]  +...  =  m[D  (A7l,  G,  +  G2  +  .  .  .)]. 
If  r  be  so  large  that  m  (G>+])  +  m  (Gr+z)  +  .  .  .  <  e,  we  have  also 

m  [D  (AB,  £r+1)]  +  m  [D  (An,  Gr+1)]  +  ...<€, 
for  every  value  of  n. 
Hence,  we  have 
m 


where  6n  is  such  that  0  <  6n  <  1. 

It  follows,  by  letting  n  increase  indefinitely,  that 

m  (G,}  +  m  ((?,)  +  ...  +  m  (Gr)  +  0e  =  m  (G,  +  G2  +...), 
where  9  is  such  that  0  ^  6  £  1.    Since  e  is  arbitrary,  it  follows  that 

m(Gl)  +  m(G2)  +  ... 
converges  to  m  (Gj  +  G.2  +...)• 

The  other  theorems  relating  to  bounded  measurable  sets  may  be  extended 
to  unbounded  measurable  sets  of  finite  measure. 

THE  MEASURE  OF  SETS  RELATED  TO  A  SYSTEM  OF  SETS. 

135.  It  has  been  shewn  in  §  131  that,  for  a  sequence  [Gn]  of  measurable  sets, 
each  of  which  contains  the  next,  if  m  (Gn)  is,  for  every  value  of  //,  greater  than 
a  fixed  positive  number  G,  the  measure  of  the  inner  limiting  set  is  =  C. 
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In  case  the  sets  £,  ,  Ot,  •  .  .  ,  each  of  which  contains  the  next,  are  not  assumed 
to  be  measurable,  the  corresponding  result  holds  as  regards  the  interior  measures 
of  the  sets.  We  can  obtain  the  following  theorem  : 

If*  Glt  Gz,  ...  be  a  sequence  of  sets  of  points,  each  of  which  contains  the 
next,  and  if  the  interior  measure  of  each  set  is  greater  than  a  fixed  positive 
number  C,  then  the  interior  measure  of  the  inner  limiting  set  is  =  C. 

Closed  components  P,,  Q2,  Q3,  ...  of  the  sets  Glt  Ga,  G3,  ...  can  be  so  de- 
termined that 


where  e  is  an  arbitrarily  chosen  positive  number.  The  set  Q2  has  a  closed 
component  P3  =  D(P1}  Q2),  of  measure  m  (P2)  =  m  (Px)  +  m  (Q2)  -  m  [M(Plt  Q2)]. 
Since  M(P1}  Q2)  is  a  closed  set  contained  in  G1}  its  measure  does  not  exceed 
W<((TI);  hence  we  have 

m  (P2)  £  m  (Px)  +  m  (Q,)  -  m{  (GJ  Z  m<  (0.)  -    i  +  1 


Next,  by  considering  P3=  D(P2,  Q3),  it  can  be  shewn  as  before  that 


and  so  on.  We  have  now  a  sequence  of  closed  sets  Pu  P2)  P3,  .  .  .  ,  each  of  which 
contains  the  next,  and  such  that  the  measure  of  each  of  them  is  >  (7  —  e. 
Therefore  the  measure  of  Pw,  their  inner  limiting  set,  is  ^  C  —  e;  and  Pu  is  a 
component  of  G^  It  follows,  since  e  is  arbitrarily  small,  that  mt  (G»}  ^  C. 

136.    We  are  now  in  a  position  to  establish  the  following  general  theorem  : 

If*  Glt  GV,  ...  Gn,  ...  is  a  sequence  of  sets  of  points,  each  of  which  sets  is  a 
component  of  a  closed  set,  of  finite  content.  I,  and  if  the  interior  measure  of  each 
of  the  sets  Glt  Gz,  ...  Gn,  ...  is  greater  than  a  fixed  number  C,  then  there  exists 
a  set  of  points,  of  interior  measure  ^  C,  and  of  the  power  of  the  continuum,  such 
that  each  point  of  the  set  belongs  to  an  infinite  number  of  the  given  sets. 

Choose  a  closed  component  of  each  of  the  given  sets,  of  content  >  C  ; 
let  these  components  be  Qlt  Q2,  ...  Qn,  ....  Choose  an  integer  m  such  that 
mC  £  I  <  (m  +  1)  C,  and  let  us  consider  the  first  n  (>  m  +  1)  of  the  sets 
Qu  Q2,  ....  The  points  common  to  any  pair  of  these  closed  sets  form  a  closed 
set,  and  the  set  which  contains  all  the  points  which  belong  to  at  least  two  of 
the  n  closed  sets  is  also  a  closed  set  Q1>w  of  content  Iln.  Those  points  of  Qi  „ 
which  belong  to  Ql  form  a  closed  set  of  content  ^  /m;  hence  there  is  a  set  of 
points  of  Qj,  of  measure  ^  /(Qi)  —  /m,  which  do  not  belong  to  any  of  the  sets 
Qa,  Qs,  •••  Qn',  and  the  measure  of  this  set  is*>  C—  fln.  Similarly,  each  of  the 

*  W.  H.  Young,  Proc.  Land.  Matlt.  Soc.  (2),  vol.  n,  p.  '25. 
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sets  Q2,  Q3>  ...  Qn  has  a  component  of  measure  >C  —  Iln,  consisting  of  points 
which  do  not  belong  to  any  of  the  other  sets,  or  to  Q,.  The  measures  of  all 
these  sets  added  together  is  >n(C  —  7m);  and  it  must  be  less  than  I,  since 
the  sets  do  not  contain  any  points  common  to  two  of  them,  and  they  are  all 
enclosed  in  a  set  of  measure  I.  Hence 

n(C-Iin)<(m  +  l)C,  or  Iln  >  (l  -  ^-^)  C. 


It  has  thus  been  shewn  that  the  closed  set  Ql>n  has  the  power  of  the  continuum, 
since  its  content  is  proved  to  be  positive  ;  and  this  holds  for  every  value  of  n 
which  is  >m  +  l.  Considering  now  the  next  n  sets  Qn+i,  Qn+2,  •••  Qm>  there 

is  a  closed  set  of  content  >  (  1  —  -  ~}@>  consisting  of  points  each  of  which 
belongs  to  two  at  least  of  the  sets  ;  and  a  similar  result  holds  for  each  system 

Of  71  SetS  Qrn+i,  Qrn+2>  "•  Q(r+i)n- 

We  have  now  an  infinite  sequence  of  closed  sets  QljH,  Q2jn,  Qs>n,  ...  Qr,n,  ••• 

each  of  which  has  content  >  (  1  --      -  )  C,  and  the  points  of  each  of  them 

V  n     ) 

belong  to  two  at  least  of  the  given  sets.  By  applying  similar  reasoning,  and 
taking  n  sets  at  a  time,  we  see  that  there  are  an  infinite  number  of  sets  each 

of  content  >  (  1  --  -     ~  )  (  1  --  r-  I  C,  and  such  that  each  point  of  any  one 
V  n     /  \  n     J 

of  them  belongs  to  four  at  least  of  the  given  sets.  Proceeding  in  this  manner, 
we  obtain  sets  of  points,  each  of  content 


and  such  that  each  point  of  each  set  belongs  to  at  least  2S+1  of  the  given  sets. 
Now  let  n,  n,  ...  n(s)  be  so  chosen,  that 

m  +  l      1       m+l      1  m  +  l        1 

~rT<2€'   ~~ri~<4ie'   '     ~W<&+i€ 

then  the  content  of  each  of  the  sets  which  contains  points  belonging  to  28+1 
at  least  of  the  given  sets  is 


The  process  can  be  carried  on  without  limit ;  and  we  see  that  the  set  which 
consists  of  all  points  belonging  to  2m  at  least  of  the  given  sets  contains  closed 
components  of  content  >  (7(1  —  e).  Considering  the  sequence  P1}P.2,  ...  of  sets 
such  that  PI  contains  all  points  that  belong  to  two  at  least  of  the  given  sets, 
P2  contains  all  points  that  belong  to  22  at  least  of  the  given  sets,  and  so  on,  it 
is  clear  that  Pl  contains  P2,  and  P2  contains  P3,  etc.  But  the  interior  measure 
of  each  set  is  >  (7(1  —  e);  hence,  in  accordance  with  the  theorem  of  §  135,  there 
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exists  a  set  of  points  common  to  all  the  sets  P,,  I\,  ...,  of  interior  measure 
^  C(l  —  e).  This  set  consists  of  points  which  belong  each  to  an  infinite  number 
of  the  given  sets ;  and  its  interior  measure  is  ^  C,  since  e  is  arbitrarily  small. 
The  set  has  the  power  of  the  continuum,  since  it  contains  closed  components 
of  content  greater  than  zero.  It  should  be  observed  that  the  employment  of 
the  sets  {Qn}  involves  an  infinite  number  of  acts  of  choice. 

The  theorem  that  has  been  now  established  is  of  considerable  importance 
on  account  of  the  applications  of  it  which  can  be  made  in  various  parts  of  the 
theory  of  functions ;  it  is  due  *  to  W  H.  Young.  That  particular  case  of  the 
theorem  in  which  the  sets  are  all  measurable  was  first  stated  -f",  without  proof, 
by  Borel. 

An  important  case  of  the  theorem  arises  if  we  suppose  each  of  the  sets  to 
consist  of  a  finite,  or  an  enumerably  infinite,  set  of  closed  intervals ;  in  which 
case  the  sets  are  all  measurable.  The  theorem  may  then  be  stated  as  follows: 

If  there  be  given  an  infinite  number  of  sets  of  intervals,  in  a  finite  segment,  each 
set  consisting  of  a  finite,  or  an  enumerably  infinite,  number  of  non-overlapping 
intervals,  and  if  the  measure  of  each  set  of  intervals  is  greater  than  some  fixed 
positive  number  C,  then  there  exists  a  set  of  points,  having  the  power  of  the  con- 
tinuum, and  of  measure  =  C,  such  that  each  point  of  the  set  belongs  to  an  in- 
finite number  of  the  given  sets  of  intervals, 

This  theorem  contains  the  completion,  and  generalization,  of  a  theorem  due 
to  ArzelaJ  which  is  stated  by  him  as  follows : 

Let  2/0  be  a  limiting  point  of  any  set  of  numbers  (y),  and  let 

G0  =  (yi,y*,  •••) 

be  a  sequence  of  numbers  of  (y)  which  converges  to  the  limit  y0.  Assuming 
the  variables  to  be  orthogonal  coordinates  of  a  point  in  a  plane,  let  the  set  of 
straight  lines  y  =  yi,  y  =  yz>  y  —  y^  ••••>  be  drawn,  and  let  a  set  of  intervals  be 
taken  on  the  portion  of  each  of  these  straight  lines  which  is  in  the  interval 
(a,  b),  of  x.  Suppose  that  each  set  of  intervals  is  finite  in  number,  and  that 
this  number  is  variable  from  one  straight  line  to  another,  but  increases  in- 
definitely with  the  index  s,  of  ys.  Let  the  sum  of  the  intervals  SltS,  S.,)S). ..  £„,, 
on  the  line  y  =  yK>  be  ds.  If  for  every  value  of  s,  ds  is  greater  than  C,  a 
determinate  positive  number,  there  necessarily  exists  at  least  one  point  .r0 
in  the  interval  (a,  b),  such  that  the  straight  line  x  —  x0  intersects  an  infinite 
number  of  the  intervals  8. 

Arzela  subsequently  removed  the  condition  that  each  set  of  intervals  is;  a 
finite  one. 

*  Proc.  Land.  Math.  Soc.  (2),  vol.  n,  p.  26. 
t  Comptes  Rendus,  December  1903. 

J  Rend.  delV  Ac.  del  Lincei  (4)  1,  (1885),  p.  037  ;  a  second  proof,  which  is  however  not  rigorous. 
has  been  given  by  Arzeli  in  the  Memorie  delta  R.  Acc.d.  SV.  di  JSolwjna,  (5),  vol.  vin,  181M*. 

H.  1-2 
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THE  METRIC  DENSITY  OF  A  SET  OF  POINTS. 

137.  Let  P  be  any  point  not  necessarily  belonging  to  a  measurable  set  G 
in  linear,  plane,  or  p-dimensional  space. 

Suppose  the  interval,  or  cell,  D  to  contain  the  point  P  within  it,  and 
let  0D  be  the  part  of  G  contained  in  D.  If  a  definite  number  pp(G)  exists, 


such  that 


pP(G)—      Tr      <  6>   where   e  is   an  arbitrarily  chosen  positive 


number,  for  all  such  cells  D,  of  which  the  span  is  less  than  a  number  T;€,  dependent 
on  e,  then  the  number  pp  (G)  is  said  to  be  the  metric  density  of  the  set  G  at  the 
point  P.  It  is  clear  that,  unless  P  is  a  limiting  point  of  G,  whether  it  belong  to 
the  set  or  not,  the  metric  density  at  P  is  zero. 

This  definition*  is  equivalent  to  the  statement  that  the  metric  density, 
when  it  exists,  at  P,  is  the  limit  of  the  ratio  m  (GD)lm  (D)  for  any  sequence  of 
intervals,  or  cells,  containing  P  within  them,  such  that  the  breadth  of  the 
intervals,  or  the  span  of  the  cells,  converges  to  zero. 

It  may  however  happen  that  no  such  unique  limit  exists  ;  it  may  in  fact 
be  possible  to  define  different  sequences  of  the  intervals,  or  cells,  for  which  the 
above  ratio  converges  to  different  limits.  In  that  case  the  limit  ofm(GD)/m  (D) 
is  said  to  be  indeterminate.  There  is  then  an  upper  and  a  lower  limib  of  the 
numbers  to  which  m  (GD)jm  (D)  converges,  for  all  possible  sequences  of  D  for 
which  the  breadth,  or  diagonal,  of  D,  and  therefore  m  (D),  converges  to  zero. 
These  two  numbers  are  both  in  the  closed  interval  (0,  1),  and  the  greater  of  them 
is  called  the  upper  metric  density  of  G,  at  the  point  P,  and  may  be  denoted 
bypp((r);  the  smaller  of  the  numbers  is  called  the  lower  metric  density  of  G,  at  P, 
and  may  be  denoted  by  pp(G).  Thus,  at  every  point  P,  of  G  or  G,  pp(G}  =  pP(G)', 
when  the  sign  of  equality  holds,  there  is  a  metric  density  pP(G),  at  P. 

If  C  (G)  be  the  complement  of  G  relatively  to  an  interval,  or  cell,  in  which 
G  is  contained,  we  see  that 

m(0D)      m[C(GD)}  = 
m(D)          m(D) 
and  thence  it  follows  that 

pp  (G)  +  P_P  {C  (G)}  =  1,    pP  (G)  +  pp  {C  (G)}  =  1  ; 

and  in  case  the  metric  density  of  G,  at  P,  exists,  that  of  C(G),  at  P,  also  exists, 
and  the  sum  of  the  two  is  unity.  It  is  clear  that,  if  pP(G)  =  0,  then  also 
pP(G)  exists,  and  has  the  value  zero. 

A  set  G  is  said  to  be  metrically  dense  at  a  point  P,  whether  or  not  P 
belongs  to  G,  if,  in  every  cell,  or  interval,  according  to  the  dimensions  of  the 
space  in  which  the  set  is  defined,  which  contains  P  in  its  interior,  there  is  a 
set  of  points  of  G  of  measure  >  0.  A  point  at  which  G  is  metrically  dense  must 
belong  to  G'. 

*  See  Lebesgue,  Annales  de  Vecole  norm.  (3),  vol.  xxvu,  p.  406. 
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The  metrical  density  of  G  may  be  zero  at  a  point  at  which  G  is  metrically 
dense ;  for  m(GD)  may  be  >  0  for  every  cell  D ;  and  yet  the  limit  m  (GD)/m  (D) 
may  exist,  and  have  the  value  zero. 

The  terms  metrical  density,  metrically  dense  are  employed  because  the 
terms  dense,  and  non-dense  have  already  been  used  in  a  descriptive  sense.  A  set 
may  be  everywhere  dense,  and  yet  nowhere  metrically  dense,  as  is  the  case, 
for  example,  for  the  set  of  rational  points  in  the  linear  interval  (0, 1). 

The  set  of  all  points  at  which  a  set  G  is  metrically  dense  is  closed. 

For  if,  in  every  neighbourhood  of  a  point  P,  there  are  points  at  which  G  is 
metrically  dense,  it  is  clear  that,  at  the  point  P,  G  is  metrically  dense. 

138.  Those  points  of  a  measurable  set  G  at  which  the  set  is  not  metrically 
dense  form  a  component  of  G,  of  measure  zero,  provided  that  component  exists. 

Apply  a  system  of  nets  to  the  cell,  or  interval,  in  which  G  is  contained. 
Let  Z)MI  be  the  first  of  the  nets  which  has  one  or  more  meshes  each  of  which 
contains  a  part  of  G  of  measure  zero ;  let  Dni'  denote  those  meshes  of  Dni  for 
which  this  is  the  case.  Let  Dnz  (n2  >  %)  be  the  first  net  which  has  meshes, 
not  contained  in  Dni',  in  each  of  which  the  component  of  G  has  measure  zero ; 
let  D^  denote  these  meshes.  Proceeding  in  this  manner,  we  define  a  sequence 
Dni',  Dn2',  . . .  each  of  which  consists  of  a  set  of  meshes  belonging  to  Dni ,  D^ ,  ... 
respectively,  and  such  that  Dnr'  does  not  belong  to  Dnf_^,  for  any  value  of  r. 
Every  point  P,  belonging  to  G,  at  which  G  is  not  metrically  dense  belongs  to 
one  of  the  meshes  of  the  sets  Dni,  Dnz,  ...,  each  of  which  contains  an  enumerable 
set  of  points  of  G.  Since  the  set  of  all  points  of  G  contained  in  all  these  sets 
of  meshes  is  enumerable,  it  follows  that  the  component  of  G,  at  each  point  of 
which  G  is  not  metrically  dense,  has  measure  zero. 

A  set  of  points  is  said  to  be  metrically  dense  in  itself  when  every  point  of 
the  set  is  a  point  at  which  the  set  is  metrically  dense. 

If,  from  any  set  G,  such  that  m  (G)  >  0,  those  points  at  which  the  set  is  not 
metrically  dense  be  removed,  there  remains  a  set  H  such  that  m  (H )  =  m  (G), 
which  is  metrically  dense  in  itself.  For,  in  the  arbitrarily  small  neighbour- 
hood of  any  point  of  H,  there  is  a  set  of  points  of  G  of  measure  >  0,  and  therefore 
a  set  of  points  of  H  of  measureNzero.  The  following  theorem  has  therefore 
been  established : 

Any  measurable  set,  of  measure  >  0,  is  the  sum  of  a  set  of  measure  zero, 
which  tnay  be  non-existent,  and  of  a  set  that  is  metrically  dense  in  itself. 

In  case  the  set  G  is  closed,  we  may  add  to  H  its  limiting  points,  all  of 
which  belong  to  G ;  we  thus  obtain  a  closed  component  H  of  G,  where 
m  (H)  =  m  (G).  The  set  H  has  the  same  property  as  H ;  for,  in  the  neighbour- 
hood of  a  limiting  point  of  H,  there  must  be  a  set  of  points  of  H  of  measure 
>  0.  It  is  clear  that  H  has  no  isolated  points,  hence  it  is  a  perfect  set.  It  has 
thus  been  proved  that : 

12—2 
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A  closed  set  G,  of  measure  >  0,  is  the  sum  of  a  perfect  set  H,  of  measure  equal 
to  that  of  G,  and  metrically  dense  in  itself,  and  of  a  set  of  measure  zero. 

Any  measurable  set  G  contains  a  non-dense  closed  component,  of  measure 
arbitrarily  less  than  that  of  G  (§125).  This  non-dense  closed  component  con- 
tains a  perfect  set,  also  non-dense,  which  is  metrically  dense  in  itself,  and  of 
measure  equal  to  that  of  the  component.  We  have  thus  proved  the  following 
theorem,  due  to  Lusin  : 

Any  measurable  set,  of  measure  >  0,  contains  a  non-dense  perfect  set, 
metrically  dense  in  itself,  and  of  measure  differing  by  an  arbitrarily  small 
amount  from  that  of  the  given  set. 

139.  The  fundamental  theorem  in  the  theory  of  metrical  density  is  that, 
at  every  point  of  a  measurable  set,  with  the  possible  exception  of  the  points 
which  belong  to  a  component,  of  which  the  measure  is  zero,  the  metrical  density 
of  the  set  exists,  and  is  equal  to  unity. 

Before  proceeding  to  the  proof  of  this  theorem,  it  is  useful  to  introduce  the 
notion  of  the  metrical  density  of  a  set  relatively  to  a  given  system  of  nets. 

This  is  obtained  by  restricting  the  cells,  or  intervals,  D,  employed  in  the 
definition  given  in  §  137,  all  to  belong  to  a  given  system  of  nets. 

If  E  and  F  are  two  measurable  sets,  and  those  points  of  E  at  which  the  upper 
metrical  density  of  F  is  greater  than  some  positive  number  c  form  a  component 
E},  of  E,  such  that  rm(El}  >0,  then  a  system  of  nets  can  be  so  determined  that 
those  points  of  E  in  which  the  upper  metric  density  of  F  relatively  to  the  system 
of  nets  is  >  c  ( >  0)  form  a  set  E2  such  that  m  (E2)  >  0. 

If  p  be  the  number  of  dimensions  of  the  space  in  which  E  and  F  are 
situated,  we  can  define  a  symmetrical  system  of  nets  in  which  each  net  Dn+i 
is  obtained  from  Dn  by  dividing  each  of  the  meshes  of  Dn  into  3^  equal  parts. 
We  may  suppose  each  mesh  of  Dn  to  have  all  its  edges  of  equal  length.  Next 
let  us  consider  a  second  symmetrical  system  of  nets  such  that  the  boundaries 
of  its  meshes  contain  the  centres  of  the  meshes  of  the  nets  of  the  first  system. 
The  second  system  of  nets  is  thus  obtained  by  a  translational  displacement  of 
the  first  system,  such  that  one  of  the  corners  of  each  mesh  of  Dn  is  moved  to 
the  centre  of  the  mesh.  Two  such  systems  of  nets  have  been  termed  conjugate 
nets  by  de  la  Vallee  Poussin,  who  introduced  the  conception. 

Every  cell,  of  equal  sides  a,  is  contained  in  the  interior  of  a  mesh  of  one  or 
other  of  the  two  systems  of  nets,  provided  a  be  sufficiently  small ;  and  it  may 
be  contained  in  the  interior  of  each  of  two  meshes,  one  belonging  to  each 
system.  For,  in  the  linear  case,  the  two  sets  of  meshes  of  nets  of  the  same 
order,  belonging  to  the  conjugate  systems,  divide  the  fundamental  interval  into 
equal  parts  of  lengths  say  ft.  Now  consider  any  interval  of  length  <  /3 ;  if  it 
contains  an  end-point  of  a  mesh,  of  length  2/3,  of  one  of  the  two  systems,  it 
must  be  contained  completely  in  a  mesh  of  the  other  system,  also  of  length  2/3. 
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In  the  case  of  a  two-dimensional  system,  consider  a  square  of  sides  <  /8 ;  if 
this  square  contains  a  corner  of  a  mesh,  of  sides  2/8,  of  one  of  the  conjugate 
systems,  it  is  contained  in  the  interior  of  a  mesh,  of  sides  2/9,  of  the  other 
system.  Similarly,  it  is  seen  that  this  holds  in  the  case  of  nets  of  any  number 
of  dimensions.  The  same  result  clearly  holds  as  regards  a  cell  with  unequal 
edges,  the  greatest  of  which  is  <  ft. 

Suppose  that,  at  a  point  P,  ofE,  the  upper  metric  density  of  F  is  >  c  ;  then 
a  sequence  of  cells,  each  containing  the  next,  and  converging  to  the  single 
point  P,  exists,  such  that,  in  each  cell,  the  ratio  of  the  measure  of  the  part  of 
F  in  the  cell  to  the  measure  of  the  cell  is  >  c.  From  and  after  some  particular 
cell  of  the  sequence,  every  one  of  the  cells  is  in  the  interior  of  a  mesh  of  one 
of  the  two  conjugate  symmetrical  systems  of  cells  Nl}  N2.  It  follows  that 
there  must  be  an  infinite  number  of  the  cells  of  the  sequence  interior  to  meshes 
of  one  and  the  same  of  the  two  conjugate  systems  N1}  N2. 

At  each  point  P,  of  El ,  the  upper  metric  density  of  F  is  >  c,  relatively  to 
one  of  the  two  systems  of  nets  Nlt  N2.  Since  the  measure  of  El  is  >  0,  there 
must  be  a  part  E2,  of  El}  of  measure  >  0,  such  that  the  upper  metric  density 
of  F  relatively  to  one  of  the  systems  of  nets  N1}  Nz,  is  at  all  points  of  Ez 
greater  than  c.  That  one  of  the  systems  N1 ,  N2  for  which  this  is  the  case  is  a 
system  such  as  is  required. 

140.    The  following  fundamental  theorem  will  now  be  established : 

The  metric  density  of  a  measurable  set  E  exists,  and  is  equal  to  unity,  at  all 
points  of  E,  with  the  possible  exception  of  the  points  of  a  component  of  which  the 
measure  is  zero ;  and  the  metric  density  of  E  exists,  and  is  equal  to  zero,  at  all 
points  of  C  (E},  with  the  possible  exception  of  points  of  a  set  of  measure  zero. 

We  shall  first  suppose  that  E  is  a  bounded  set.  It  will  be  shewn  that  the 
points  of  E  at  which  the  upper  metric  density  of  C(E)  is  >  0  form  a  set  of 
measure  zero. 

Let  us  suppose,  if  possible,  that  the  measure  of  the  component  of  E,  at  each 
point  of  which  the  upper  metric  density  ofC(E)  is  greater  than  a  fixed  positive 
number  a,  is  >  0.  A  system  of  nets  can  then  be  so  determined  that  the  upper 
metric  density  of  C(E},  relatively  to  the  system  of  nets,  is  >  a,  at  all  points  of 
a  component  Ea,  of  E,  such  that  m(Ea}>  0.  Let  O  be  an  open  set  of  points 
which  contains  Ea,  and  is  such  that  m(£l)  —  m(Ea)  is  less  than  an  arbitrarily 
chosen  positive  number  ??. 

Any  point  P,  of  Ea,  is  contained  in  a  unique  sequence  of  meshes  of  the  nets, 

one  in  each  of  the  nets  Dl}  D.,,  . . .  Dn, From  and  after  some  value  s,  of  n,  all 

these  meshes  are  contained  in  fl.  Of  these  meshes  dg,  ds+l,  ...,  there  must  be 
one  of  lowest  rank  np  such  that  the  measure  of  the  part  of  C(E)  that  it  con- 
tains is  >  am  (dnp).  When  dnp  has  been  determined,  we  may  suppose  it  to 
correspond  to  each  of  the  points  P,  of  Ea ,  which  it  contains.  To  any  other  point 
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P',  of  Ea,  not  contained  in  dnp,  there  corresponds  a  similar  mesh  dn^.  An 
enumerable  set  of  such  meshes  will  contain  all  the  points  of  Ea,  and  it  is  con- 
tained in  H.  The  measure  of  the  part  of  C(E)  contained  in  this  enumerable 
set  of  meshes  is  >  am(Ea).  But  the  measure  of  the  set  of  meshes  is  <  m(Ea)  +  rj, 
and  therefore  the  measure  of  the  part  of  C  (E)  contained  in  the  set  of  meshes 
is  <  77.  Now  77  can  be  so  chosen  as  to  be  <  am  (Ea) ;  hence  the  assumption 
that  m  (Ea)  >  0  leads  to  contradiction.  It  follows  that  m  (Ea)  =  0.  Taking  for 
a  the  values  in  a  decreasing  sequence  of  numbers  that  converges  to  zero,  the 
set  of  points  of  E  at  which  the  upper  metric  density  of  C  (E)  is  >  0  is  the 
outer  limiting  set  of  the  sets  Ea,  all  of  which  have  measure  zero.  Therefore 
those  points  of  E  at  which  the  upper  metric  density  of  C(E)  is  >  0  form  a  set 
of  measure  zero.  With  the  exception  of  this  set,  at  every  point  of  E  the  metric 
density  of  C(E)  is  zero. 

It  then  follows  that,  at  every  point  of  E  not  belonging  to  a  component  of 
measure  zero,  the  metric  density  of  E  is  unity.  Since  E  and  C  (E)  can  be 
interchanged,  we  see  that  the  density  of  E  at  all  points  of  C  (E)  is  zero, 
except  at  points  of  a  component  of  G(E)  of  measure  zero. 

In  case  the  set  E  is  unbounded,  we  may  consider  the  parts  of  it  in  cells  of 
a  sequence  each  of  which  contains  the  preceding  one,  and  of  which  the  measures 
increase  indefinitely.  Each  of  the  cells  may  contain  an  exceptional  set  of 
measure  zero,  at  which  the  metric  density  of  the  component  of  E  in  the  cell 
either  does  not  exist,  or  is  not  equal  to  unity.  The  outer  limiting  set  of  these 
exceptional  sets  has  also  the  measure  zero ;  hence  the  theorem  holds  for  the 
unbounded  set  E. 

The  above  theorem  was  first  established,  for  the  case  of  linear  sets,  by 
Lebesgue,  who  employed  the  theory  of  integration.  Other  proofs,  independent 
of  the  theory  of  integration,  have  been  given,  for  linear  sets,  by  Denjoy*,  and 
by  Lusin  and  Sierpinski  f.  The  above  proof  is  founded  on  the  treatment  of  the 
subject  by  de  la  Vallee  Poussin,  who  established  the  theorem  for  the  case  of 
sets  of  points  in  any  number  of  dimensions. 

THE  RESOLUTION  OF  SETS  OF  POINTS  IN  ACCORDANCE  WITH 
METRICAL  PROPERTIES. 

141.  It  was  shewn  in  §  91  that  every  set  of  points  can  be  expressed  as  the 
sum  of  an  enumerable  (or  finite)  series  of  sets  which  have  certain  descriptive 
properties.  We  proceed  to  analyse  a  given  measurable  set  into  the  sum  of 
parts  which  have  certain  metrical  properties. 

It  will  be  shewn  that : 

*  Journal  de  Mat.  (7),  vol.  i,  p.  132. 

t  Rend.  d.  Circ.  Mat.  di  Palermo,  vol.  XLII,  p.  167. 
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Any  bounded  measurable  set  can  be  expressed  as  the  sum  of  an  enumerable 
sequence  of  perfect  sets,  together  with  a  set  of  which  the  measure  is  zero.  More- 
over, the  perfect  sets  can  be  so  chosen  as  to  be  all  non-dense,  and  such  that  each 
of  them  is  metrically  dense  in  itself. 

If  S  be  a  measurable  set,  it  has  been  shewn  in  §  125  that  it  contains  a 
perfect  set  H}  such  that  m  (S)  —  m  (H^  <  e/4.  If  HI  is  not  non-dense  it  has  a 
non-dense  perfect  component  GI,  such  that  m  (#,)  —  m(Gi)  <  e/4;  therefore 
m  (S)  —  m  ((?i)  <  e/2.  The  perfect  set  6r,  has  a  perfect  component  (§  138)  Llt 
of  the  same  measure  as  Ol}  and  metrically  dense  in  itself.  We  have  now 
S^L1  +  M1'1  where  m  (S)  -  m  (L^  <  e/2  ;  or  m  (A/,)  <  e/2.  The  set  Ml  may 
be  similarly  resolved  into  7>2  +  M2 ;  where  Z2  is  perfect,  non-dense,  and  metrically 
dense  in  itself,  and  where  m  (M2)  <  e/22.  This  procedure  may  be  carried  on 
successively,  and  terminates  only  if  one  of  the  sets  Mn  is  either  absent  or  has 
the  measure  zero.  In  case  Mn  exists  for  every  value  of  n,  its  inner  limiting 
set  is  M^,  which  has  the  measure  zero,  since  lim  m  (Mn)  =  0.  We  have  there- 

n~  oo 

fore,  in  this  case,  S  =  L^  +  L.2  +  . . .  +  Ln  +  . . .  +  Mu,  where  the  series  LltL2, ... 
of  perfect  sets,  all  non-dense  and  metrically  dense  in  themselves,  is  either 
finite,  or  forms  an  unending  series. 

It  will  be  observed  that  the  set  Ma  is  a  residual  set,  relatively  to  S, 
although  it  has  the  measure  zero.  In  particular,  the  set  of  all  the  points  of  a 
cell  can  be  resolved  in  this  manner  into  a  sum  of  sets ;  the  residual  $„  being 
everywhere  dense,  and  of  cardinal  number  c,  although  its  measure  is  zero. 

JORDAN'S  MEASURE  OF  A  SET  OF  POINTS. 

142.  A  definition  has  been  employed  by  Jordan*,  and  by  Peanof,  of  the 
measure  of  a  set  of  points,  which  differs  from  that,  developed  by  Borel  and 
Lebesgue,  which  has  been  employed  in  the  present  Chapter.  It  is  applicable 
to  sets  of  points  in  space  of  any  number  of  dimensions,  and  is  of  utility  in 
the  theory  of  quadratures. 

Let  a  set  G  be  in  the  interior  of  a  closed  cell  (or  interval)  A ;  and  let  a 
system  of  nets  with  closed  meshes  be  fitted  on  to  A.  Let  En(1)  be  the  sum  of 
the  measures  of  those  meshes  of  Dn  which  are  such  that  every  point  of  each 
of  them  is  an  interior  point  of  G,  and  let  2J-'  be  the  sum  of  those  meshes  of 
Dn  which  are  such  that  each  of  them  contains  either  an  interior  point  of  G  or 
a  point  on  the  frontier  of  G  and  C  (G).  It  can  be  shewn  that  -,,(1)  converges, 
as  ?i~oo  ,  to  a  number  8lt  and  that  Sn(>2)  converges  to  a  number  8.2 ;  where  8l 
and  S^  are  independent  of  the  particular  system  of  nets  fitted  on  to  A. 

The  number  Sl  is  called  the  interior  extent  of  the  set  G,  and  &,  is  called 

*  Journal  de  Mat.  (4),  vol.  via ;  also  fours  d' Analyse,  vol.  i,  p.  '2.S. 
t  Applicazioni  geom.  del.  calc.  infinit.  (1887),  p.  153. 
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the  exterior  extent  of  G.  When  Sl  =  Sz,  the  set  is  said  to  be  measurable  (/), 
and  the  number  Sl  =  S2  is  said  to  be  its  measure  (J).  The  exterior  extent  of 
a  set  is  identical  with  its  content,  as  defined  by  Harnack  and  Cantor  (§  118). 

In  accordance  with  this  definition,  any  set  which  contains  no  interior 
points  has  its  interior  extent  zero.  For  example,  the  interior  extent  of  the 
irrational  points  of  the  linear  interval  (0,  1)  is  zero,  and  its  exterior  extent  is  1. 

A  set  G  consists,  in  general,  of  interior  points  forming  a  set  /,  and  of  points 
Fl  ,  all  of  which  belong  to  the  frontier  F.  The  interior  extent  of  G  is  identical 
with  the  measure  of  the  open  set  /,  and  the  exterior  extent  of  G  is  identical 
with  the  measure  of  the  closed  set  /  +  F  ;  the  measures  of  these  sets  being 
defined  as  in  §  122.  It  is  then  clear  that  the  necessary  and  sufficient  condition 
that  a  set  G  should  be  measurable  (J)  is  that  its  frontier,  which  is  a  closed 
set,  should  have  the  measure  zero.  Accordingly,  a  set  that  is  measurable  (J") 
is  also  measurable  in  accordance  with  the  definition  here  adopted  (§  128),  but 
the  converse  does  not  hold. 
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143.    If  G  be  any  closed  set  of  points  in  a  rectangle  A  BCD,  and  through 
the  points  P,  of  AB,  straight  lines  PP'  are  drawn  perpendicular  to  AB,  and 


p' 


P,  Q 


FIG.  2. 


if/(P)  denote  the  linear  content  of  the  linear  component  of  G  which  is  on  the 
straight  line  PP',  and  which  may  be  termed  the  section  of  G  by  PP',  then 
the  set  of  points  P,  on  AB,  which  is  such  that  f(P}  =  cr,  is  a  closed  set;  a 
denoting  any  positive  number. 

Let  Pj  be  a  limiting  point  of  the  set;  and  if  possible,  let  the  linear  content 
of  that  component  of  G  which  is  on  P^Y  be  <  cr ;  we  can  then  determine  a 
finite  number  of  intervals  8lf  &>,  ...  Sr>  on  P^Y,  whose  sum  is  >  AD  —  cr,  and 
which  are  free  in  their  interiors  and  at  their  ends  from  points  of  G.  On  each 
of  these  intervals  £  we  can  describe  a  rectangle  which  contains  no  points  of  G 
within  it  or  on  its  boundaries:  this  may  bo  done  on  either  side  of  PjP/; 
for  each  point  of  8  can  be  enclosed  in  a  rectangle  free  from  points  of  G ',  and 
by  the  Heine-Borel  theorem,  a  finite  number  of  these  rectangles,  enclosing 
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all  the  points  of  B,  exists.  Take  a  point  Q  belonging  to  the  set  of  points  for 
which  f(Q)  =  cr,  and  let  PjQ  be  less  than  the  breadth  of  all  the  rectangles 
described  on  the  intervals  {8}  on  one  side  of  PiP/.  On  QQ'  there  is  a  finite 
number  of  intervals  free  from  points  of  G,  whose  sum  is  >AD  —  a,  by  the 
assumption  as  to  P^PJ ;  hence  the  linear  content  of  the  component  of  G 
which  is  on  QQ'  must  be  <  cr,  which  is  contrary  to  the  hypothesis.  It  follows 
that /(Pi)  =  cr ;  hence  the  set  of  points  on  AB  is  closed. 

It  will  now  be  shewn  that,  for  a  closed  set  of  points  G,  if  for  every  position 
of  P,  on  AB,  the  linear  content  of  the  section  of  G  by  PP  is  <  <r,  then  the 
content  of  G  is  «r .  AB. 
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Taking  any  point  P,  of  AB;  on  PP',  a  finite  number  of  intervals,  whose 
sum  is  >  AD  —  a-,  can  be  found  which  are  free  from  points  of  G ;  and  on  each 
of  these  intervals  a  rectangle  can  be  drawn  on  each  side  of  PP',  containing 
no  points  of  G  in  its  interior  or  on  its  boundary.  We  can  now  draw  two 
straight  lines  pp',  qq',  one  on  each  side  of  P,  so  that  each  of  them  passes 
through  the  interiors  of  all  the  rectangles  so  described.  We  have  now  found 
an  interval  pq  containing  P,  such  that  in  pqq'p'  there  is  an  area 

>pq(AD  -  a) 

free  from  points  of  G.  Corresponding  to  each  point  P,  of  AB,  such  an  interval 
pq  can  be  found ;  and,  in  accordance  with  the  Heine-Borel  theorem,  a  finite 
number  of  these  intervals  can  be  selected,  such  that  every  point  of  AB  is  in 
the  interior  of  one  at  least  of  them.  The  end-points  of  these  intervals  divide 
AB  into  a  finite  number  of  parts,  such  that,  above  any  one  part,  of  length  a, 
there  is  an  area  >a(AD  —  a)  free  from  points  of  G ;  and  hence  there  is 
altogether  an  area  >  AB  (AD  —  a-)  free  from  points  of  G.  It  follows  therefore 
that  the  content  of  G  is  <  AB .  cr. 

We  shall  now  establish  the  following  theorem,  which  is  of  importance  in 
the  theory  of  double  integration  : 

If  G  be  a  closed  set,  and  if  the  linear  content  of  the  set  of  points  P,  on  AB, 
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for  which  the  linear  content  of  the  section  of  0  by  PP'  is  =  cr,  have  the  value 
zero,  for  every  positive  value  of  cr,  then  the  set  G  is  of  zero  content. 

The  points  on  AB,  for  which /(P)  ^  cr,  can  be  enclosed  in  a  finite  number 
of  intervals  whose  sum  is  <  e,  where  e  is  an  arbitrarily  small  number ;  and  in 
each  of  the  remaining  parts  of  AB,  the  value  of /(P)  is  <  cr ;  hence  by  the 
foregoing  theorem  the  content  of  G  is  <  cr(AB  —  e)  +  e.  AD;  and  since  this 
holds  for  arbitrarily  small  values  of  a  and  e,  it  follows  that  the  content  of  G 
must  be  zero. 

Conversely,  it  may  be  shewn  that : 

If  G  be  a  closed  set,  of  plane  content  zero,  the  set  of  points  P,  on  AB,for 
which  the  linear  content  of  the  section  of  G  by  PP'  is  =  cr,  has,  for  every  positive 
value  of  <r,  content  zero.  The  linear  measure  of  the  set  of  points  P  for  which 
the  linear  content  of  the  section  of  G  by  PP'  is  greater  than  zero  is  zero. 

Let  /  denote  the  linear  content  of  the  set  (P)  for  which /(P)  ^  cr ;  divide 
AB  into  n  equal  parts,  and  AD  also  into  n  equal  parts,  and  through  the  end- 
points  of  these  parts  draw  straight  lines  dividing  the  rectangle  into  equal  parts 

each  of  area  —  .  AB .  AD.   Then  the  sum  of  those  parts  of  AB  which  contain 

n2 

points  of  the  set  (P)  is  always  greater  than  /;  and  in  each  such  part  there  is 
at  least  one  point  P,  such  that  the  sum  of  the  parts  of  PP'  which  contain 
points  of  G  is  >  cr.  It  follows  that  the  sum  of  those  rectangular  portions 
which  contain  points  of  G  is  >  cr/,  however  great  n  may  be ;  and  hence  that 
the  content  of  G  is  =  cr/.  Therefore  it  follows  that  G  cannot  have  zero 
content  unless  /  is  zero. 

The  second  part  of  the  theorem  is  proved  by  giving  to  cr  the  values  in  a 
sequence  {crn}  which  converges  steadily  to  zero.  We  have  then  only  to 
consider  the  outer  limiting  set  of  the  sequence  of  sets  which  correspond  to 
the  values  of  a  in  the  sequence  |crn}. 

EXAMPLES. 

1.  Let  a  set  of  points  (x,  y)  in  the  rectangle  for  which  O^^^l,  O^y^l,  be  denned 
as  follows*  : — The  numbers  x,  y  are  expressed  in  the  dyad  scale,  and  only  those  values  of  x 
and  y  are  taken  which  are  expressed  by  terminating  radix-fractions,  the  number  of  digits 
being  the  same  for  x  as  for  y.  If  x  denotes  a  terminating  radix-fraction,  there  is  only  a 
finite  number  of  points  (x' ' ,  y'}  of  the  set  on  the  straight  line  x  =  x' ;  similarly  if  y'  denotes 
a  terminating  radix-fraction,  there  is  only  a  finite  number  of  points  of  the  set  on  the 
straight  line  y=y'.  The  two-dimensional  set  is  however  everywhere  dense  ;  for,  considering 
a  straight  line  y  =  x  +  a,  where  a  is  a  positive,  or  a  negative,  radix-fraction  with  a  finite 
number  of  digits,  we  see  that,  corresponding  to  any  number  x  expressed  by  a  finite  number 
of  digits  greater  than  the  number  of  digits  by  which  a  is  expressed,  there  is  a  point  (x,  y} 
on  the  straight  line  belonging  to  the  set.  The  component  of  the  set  on  the  straight  line 
y  =  x  +  a,  being  everywhere  dense,  and  the  values  of  a  being  everywhere  dense  in  the  interval 
(-  1,  1),  it  follows  that  the  set  is  everywhere  dense  in  the  rectangle. 

*  Pringsheim,  Sitzungsberichte  d.  Milnch.  Akad.,  vol.  xxix,  p.  48. 
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This  example  shews  that  an  everywhere  dense  two-dimensional  set  may  be  linearly 
non-dense  on  each  straight  line  belonging  to  two  parallel  sets.  It  also  shews  that  a  two- 
dimensional  set  may  exist  which  is  extended,  but  is  unextended  on  straight  lines  belonging 
to  either  of  two  parallel  sets. 

2.  Let  a  cross*  formed  by  two  pairs  of  straight  lines  parallel  to  the  pairs  of  sides  of 
a  square  be  constructed,  and  so  that  the  remainder  of  the  square  consists  of  four  equal 
squares  at  the  corners.  Let  the  interior  points  of  the  cross  be  removed  from  the  square, 
and  then  let  a  similar  cross  be  removed  from  each  of  the  remaining  four  squares.  Pro- 
ceeding in  this  manner,  let  the  crosses  be  so  chosen  that  the  area  of  each  square  after  the 

_  £. 
with  stage  of  the  process  is  ab   2™  times  the  area  of  each  square  after  the  preceding  stage. 

The  sum  of  the  areas  of  the  squares  which  remain  after  the  with  stage  is 


where  Q  is  the  area  of  the  original  square.  A  non-dense  closed  set  of  points  is  denned 
as  the  points  which  remain  when  this  process  is  carried  on  indefinitely.  The  limit  of  the 
sum  of  the  crosses  is  that  of 


\Q; 

and  this  is  Q,  or  <  Q,  according  as  a  <  £  ;  it  follows  that  the  closed  set  has  content  zero,  if 
a  <  £  ;  but  if  a  =  £,  the  content  is  b~p  ty. 

*  Veltmann,  Schldmilch's  Zeitsch.,  vol.  xxvn,  pp.  178,  314. 


CHAPTER  IV 

TRANSFINITE  NUMBERS  AND  ORDER-TYPES 

144.  A  PRELIMINARY  account  has  been  given,  in  Chapter  u,  of  the  theory 
of  transfinite  ordinal  and  cardinal  numbers ;  it  was  shewn  that  the  introduction 
of  such  numbers  was  suggested  by  the  exigencies  of  the  theory  of  linear  sets  of 
points,  and  that,  in  particular,  the  necessity  for  the  use  of  transfinite  ordinal 
numbers  arises  whenever  a  convergent  sequence  of  points  is  transcended  by 
adjoining  to  the  points  of  the  sequence  their  limiting  point  and  any  further 
points  which  it  may  be  desirable  to  regard  as  belonging  to  the  same  set  as  the 
points  of  the  sequence.  The  fundamental  discovery  of  G.  Cantor,  that  the 
rational  points  of  an  interval  form  an  enumerable  set,  whereas  the  set  of  points 
of  the  continuum  is  unenumerable,  by  establishing  the  existence  of  a  distinction 
between  the  characters  of  two  infinite  sets,  suggests  the  development  of  a 
general  theory  of  cardinal  numbers  of  infinite  aggregates.  The  procedure  we 
adopted,  of  introducing  the  fundamental  notions  of  transfinite  ordinal  and 
cardinal  numbers  in  connection  with  the  theory  of  sets  of  points,  is  in  accord 
with  the  historical  order  in  which  the  whole  theory  of  transfinite  numbers  and 
order-types  was  developed.  The  account  of  the  theory  of  transfinite  numbers 
given  in  Chapter  II  is  in  general  agreement  with  Cantor's  earlier  presentation* 
of  his  ideas  ;  his  later f,  and  more  abstract,  treatment  of  the  subject  is  the  one 
upon  which  the  account  given  in  the  present  Chapter  is  founded. 

In  order  that  the  reader  may  be  put  into  a  position  to  form  his  own  con- 
clusions as  to  the  validity  of  a  scheme  which  must  be  regarded  as  still,  to  some 
extent  at  least,  in  the  controversial  stage,  it  has  been  thought  best  to  postpone 
any  discussion  of  the  difficulties  of  the  theory,  until  after  the  conclusion  of  the 
detailed  account  of  the  theory  in  its  constructive  aspect. 

In  the  latter  part  of  the  Chapter,  some  critical  remarks  upon  the  logical 
basis  of  the  theory  will  be  made;  these  must  necessarily  be  of  an  incomplete 
character,  partly  from  considerations  of  space,  and  also  because  any  complete 
criticism  of  such  a  scheme  as  Cantor's  theory  of  transfinite  numbers  would 
involve  the  consideration  of  questions  of  an  epistemological  character,  which 
for  obvious  reasons  cannot  be  adequately  dealt  with  in  a  work  of  a  professedly 

*  See  his  "  Grundlagen  einer  allgemeinen  Mannigfaltigkeitslehre,"  Leipzig,  1883,  or  Math. 
Annalen,  vol.  xxi ;  see  also  Xeitschrift  fUr  Phil,  und  phil.  Kritik,  vols.  LXXXVIII,  xci  and  xcn. 
Cantor's  ideas  were  foreshadowed  in  a  paradoxical  form  by  Bolzano  in  his  Paradoxien  des 
Unendlichen,  Leipzig,  18(51 ;  and  although  infinite  numbers  had  been  discussed  by  earlier 
writers,  Bolzano  is  the  only  real  predecessor  of  Cantor  in  this  department  of  thought. 

t  This  is  contained  in  the  two  articles  "  Beitrage  zur  Begriindung  der  transfiniten  Mengen- 
ehre,"  in  the  Math.  Annalen,  vol.  XLVI  (1895),  and  vol.  XLIX  (1897). 


144, 145]  Cardinal  numbers  189 

mathematical  complexion.  Objections  which  may  be  urged  against  some  parts 
of  the  theory  will  however  be  fully  stated.  Some  consideration  will  also  be 
given  to  the  question,  whether,  and  how  far,  the  theory  is  indispensable  as  a 
logical  basis  of  continuous  Analysis. 


THE   CARDINAL   NUMBER   OF   AN    AGGREGATE. 

145.  A  collection*  of  definite  distinct  objects  which  is  regarded  as  a  single 
whole  is  called  an  aggregate. 

An  aggregate  may  be  denoted  symbolically  by  a  large  letter  M,  the 
elements  of  the  aggregate  by  small  letters  m\  and  the  constitution  of  the 
aggregate  may  be  denoted  by  the  equation  M  =  {m}. 

The  consideration  of  questions  which  arise  in  connection  with  this 
definition,  as  to  the  mode  in  which  the  objects  of  the  aggregate  must  be 
specified  in  order  that  the  aggregate  may  be  adequately  defined,  and  as 
regards  the  conditions,  if  any,  which  must  be  satisfied  in  order  that  a  col- 
lection may  be  regarded  as  a  whole,  or  aggregate,  of  such  a  character  that  it 
can  be  an  object  of  mathematical  thought,  will  be  postponed.  For  the 
present,  it  is  sufficient  to  remark  that  an  adequate  definition  of  any  par- 
ticular aggregate,  which  is  not  necessarily  finite,  must  contain,  as  a  minimum, 
a  set  of  rules  or  specifications  by  means  of  which  it  is  theoretically  deter- 
minate, in  respect  of  any  object  whatever,  whether  such  object  does,  or  does 
not,  belong  to  the  aggregate.  The  set  of  prime  numbers,  for  example,  is  re- 
garded as  an  aggregate  although,  when  a  particular  number  is  presented  to 
us,  we  may  be  practically  unable  to  decide  whether  that  number  is  prime  or 
not.  In  this  case,  however,  a  finite  number  of  processes  will  suffice  to  decide 
the  question.  If  however,  we  take  the  case  of  the  algebraical  numbers,  the 
state  of  things  is  different;  for  we  are  not  in  possession  of  any  general  method 
which  enables  us  to  decide  whether  a.  given  number  is  algebraic  or  not. 
Nevertheless,  the  question  being  regarded  as  having  a  logically  determinate 
answer,  the  algebraical  numbers  are  regarded  as  forming  an  aggregate,  in  the 
sense  here  employed. 

An  aggregate  does  not  depend,  for  its  validity  as  a  mathematical  entity, 
upon  the  possibility  of  producing  all  its  members,  successively  or  otherwise, 
but  upon  the  sufficiency  of  the  rules  by  which  its  elements  are  to  be  dis- 
tinguished, as  belonging  to  it,  in  that  particular  kind  of  objects  to  which  they 
belong;  that  is,  upon  the  sufficiency,  in  this  direction,  of  its  definition  of 
membership. 

*  This  definition  is  given  by  Cantor,  Math.  Annalen,  vol.  XLVI,  p.  481,  as  follows: — "  Unter 
eiuer  '  Menge '  verstehen  wir  jede  Zusammenfassung  I/  von  bestimmten  \vohl  unterschiedenen 
Objecten  m  uuserer  Anschauung  oder  un seres  Denkens  (welche  die  '  Elemente '  von  M  genuimt 
werden)  zu  einem  Ganzen." 
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Two  aggregates  M,  N  are  said  to  be  equivalent  to  one  another  when 
they  are  such  that  a  law  or^  correspondence  can  be  established  between  the 
elements  of  one  aggregate  and  those  of  the  other,  so  that,  to  each  element 
of  one  of  the  aggregates,  there  corresponds  one,  and  only  one,  element  of  the 
other  aggregate. 

This  relation  of  equivalence  between  two  aggregates  M  and  N  may  be 
expressed  symbolically  by  M  ~  N,  or  N  **  M. 

It  is  clear  that,  if  each  of  two  aggregates  is  equivalent  to  a  third,  the  two 
aggregates  are  equivalent  to  one  another. 

Aggregates  which  are  equivalent  to  one  another  are  said  to  have  the  same 
power,  or  cardinal  number. 

A  cardinal  number  is  accordingly  characteristic  of  a  class  of  equivalent 
aggregates. 

The  question  whether  two  defined  aggregates  have,  or  have  not,  the  same 
cardinal  number,  is  thus  equivalent  to  the  question  whether  it  is,  or  is  not, 
possible  to  establish  a  systematic  (1,  1)  correspondence  between  the  elements 
of  the  two  aggregates,  in  accordance  with  the  above  definition  of  equivalence. 

A  particular  aggregate  can  ordinarily  be  shewn  to  be  equivalent  to  itself. 
The  law  of  correspondence  between  an  element  and  another  element  which 
can  be  set  up  is  in  general  of  a  character  which  admits  of  a  certain 
arbitrariness.  The  cardinal  number  is  accordingly  regarded  as  independent 
of  the  notion  of  order  in  the  aggregate. 

The  power,  or  cardinal  number,  of  an  aggregate  M  has  been  defined  by 
Cantor  as  the  concept  which  is  obtained  by  abstraction  when  the  nature 
of  the  elements  of  M,  and  the  order  in  which  they  are  given,  are  entirely 
disregarded. 

Cantor  regards  the  fact  that  equivalent  aggregates  have  the  same  cardinal 
number  as  a  deduction  from  this  definition. 

The  cardinal  number  of  M  is  a  characteristic  of  M  which  may  be  denoted 
by  M,  to  indicate  that  both  the  order  of  the  elements,  and  their  precise 
individual  nature,  are  irrelevant  as  regards  the  cardinal -number. 

The  relation  of  equivalence  M~N,  between  two  aggregates,  implies  the 
equality  M  =  JV;  and  this  equation  expresses  the  necessary  and  sufficient  con- 
dition for  the  equivalence  of  M  and  N. 

Since  Cantor  regards  the  cardinal  number  of  M  as  independent  of  the 
precise  nature  of  the  elements  of  M,  we  may,  in  accordance  with  this  view, 
substitute  for  each  element  the  number  unity.  We  have  thus  a  new 
aggregate  which  is  a  collection  of  elements  each  of  which  is  the  number  1, 
and  is  equivalent  to  M;  and  this  new  aggregate  is  regarded  by  Cantor 
as  a  symbolical  representation  of  the  cardinal  number  M. 
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THE   RELATIVE   ORDER   OF  CARDINAL   NUMBERS. 

146.  Every  aggregate  Mlt  which  is  such  that  all  its  elements  are  also 
elements  of  M,  is  called  a  part,  or  sub-aggregate,  of  M. 

If  MX  is  a  part  of  Ml ,  and  3/,  is  a  part  of  M,  then  Mz  is  a  part  of  M. 

A  finite  aggregate  cannot  be  equivalent  to  any  of  its  sub-aggregates;  but 
as  will  be  seen  in  detail  further  on,  an  infinite  aggregate  always  possesses 
sub-aggregates  which  are  equivalent  to  itself.  This  is  the  characteristic 
distinction  between  finite  and  infinite  aggregates,  and  has  in  fact  been 
employed  by  Dedekind  and  others  to  define  an  infinite  aggregate  as  one 
which  is  equivalent  to  one  of  its  parts. 

If  two  aggregates  M,  N,  with  the  cardinal  numbers  a  =  M,  ft  =  N,  are 
such  that,  (1)  there  exists  no  part  of  M  which  is  equivalent  to  N,  and 
(2)  there  exists  a  part  NI  ,  of  N,  which  is  equivalent  to  M,  it  is  clear  that  the 
corresponding  conditions  are  satisfied  for  any  two  aggregates  which  are 
equivalent  to  M,  N  respectively;  and  thus  the  two  conditions  characterise 
a  relation  between  the  cardinal  numbers  a,  /8  of  the  two  aggregates.  When 
the  above  conditions  are  satisfied  we  say  that  a  is  less  than  /3,  and  that  /3  is 
greater  than  a;  which  is  expressed  symbolically  by  a  <  /3,  /3  >  a.  This  is  the 
definition  of  inequality  for  two  cardinal  numbers,  and  of  the  relations  greater 
and  less,  in  the  purely  ordinal  sense  in  which  they  are  here  used. 

The  condition  contained  in  the  definition  is  inconsistent  with  the  relation 
of  equality  between  a  and  /3  being  satisfied.  For  if  a  =  /3,  then  M  *»  N ',  hence 
since  N^  ~*  M,  we  have  N^  ~  N:  therefore,  since  M~  N,  there  must  be  a  part 
of  M,  say  M1}  such  that  M1~M,  which  would  involve  Ml^N;  but  this  is 
contrary  to  one  of  the  conditions  contained  in  the  definition  of  inequality. 

It  is  easily  seen  that  if  a  <  /3,  and  /3  <  7,  then  a  <  7. 

147.  It  has  been  seen  that  the  three  relations  a  =  /3,  a</3,  a>/3  are 
mutually  exclusive;  but  the  question  arises  whether  any  two  cardinal  numbers 
a,  0  whatever  must  satisfy  one  of  these  relations.     An  affirmative  answer  to 
this  question  would  be  required  before  it  could  be  maintained  that  all  cardinal 
numbers  can  be  regarded  as  being  alike  capable   of   having  relative  rank 
assigned  to  them,  in  a  single  ordered  aggregate. 

Two  aggregates  M,  N,  of  which  we  may  denote  parts  by  Ml}  Nlt  must 
satisfy  one,  and  only  one,  of  the  following  four  conditions: 

(1)  M,  N  have  parts  M^  NJt  such  that  Ml  ~  N,  and  jVj  ~  M. 

(2)  M  has  a  part  Ml,  such  that  M1  ~  N;  but  no  part  of  N  exists  which 
is  equivalent  to  M. 

(3)  There  is  no  M^  which  is  equivalent  to  N;  but  there  is  an  Nj  which 
is  equivalent  to  M. 

(4)  There  exists  no  Ml  equivalent  to  N',  and  also  no  NI  equivalent  to  J/. 
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It  will  be  proved  that,  if  the  condition  (1)  is  satisfied,  then  M  =•  N.  The 
condition  (2)  expresses  the  relation  defined  as  M>  N.  The  condition  (3) 
expresses  the  relation  defined  as  -M  <  N. 

In  case  M  is  a  part  of  N,  M  is  itself  equivalent  to  a  part  of  N,  and  thus 
either  (1)  or  (2)  is  satisfied;  and  therefore  M ^  N. 

It  has  not  been  proved  that  the  relation  (4)  is  an  impossible  one;  except 
that,  in  the  case  of  finite  aggregates,  it  may  be  easily  seen  that  it  involves 
M=  N.  Until  this  point  is  cleared  up,  it  cannot  be  maintained  as  an  estab- 
lished fact  that  the  cardinal  numbers  a,  ft  of  any  two  aggregates  whatever 
satisfy  one  of  the  three  relations  a  =  ft,  a.  >  ft,  a.  <  ft. 

Two  aggregates  which  are  such  that  their  cardinal  numbers  a,  ft  stand  to 
one  another  in  one  of  the  relations  a  =  ft,  a  >  ft,  or  a  <  ft,  may  be  said  to  be 
comparable  with  one  another.  Otherwise  they  are  incomparable  with  one 
another. 

THE   ADDITION   AND    MULTIPLICATION   OF   CARDINAL   NUMBERS. 

148.  If  M,  N  are  two  aggregates  which  have  no  element  in  common,  then 
the  aggregate  which  has  for  its  elements  all  those  of  M  and  all  those  of  N  is 
called  the  sum  of  the  two  aggregates  M,  N,  and  may  be  denoted  by  (M,  N). 
A  similar  definition  applies  to  the  case  of  the  sum  of  any  number  of  aggre- 
gates, no  two  of  which  have  an  element  in  common. 

If  M',  N'  are  two  other  aggregates  with  no  element  in  common,  such 
that  M~M',  N~N',  it  is  clear  that  (M,  N)~(M',N')\  and  thus  the 
cardinal  number  of  (M,  N)  depends  only  on  those  of  M  and  N. 

If  M  =  a,  N  =  ft,  we  define  the  result  of  the  operation  of  addition  of  a  and  ft 
to  be  (MTN). 

1    From  the  independence  of  cardinal  numbers  of  the  order  of  elements,  we4 
deduce 

a  +  ft  =  ft+a,     a +08 +  7)  =  («  +  £) +  7; 

thus  the  operation  of  addition  of  cardinal  numbers  obeys  the  commutative 
and  associative  laws. 

149.  If  an  element  m,  of  M,  be  associated  with  an  element  n,  of  N,  so  as  to 
form  a  new  element  (m,  n),  the  aggregate  of  all  possible  elements  which  can 
be  formed  in  this  way  is  called  the  product  of  M  and  N,  and  may  be  denoted 
by(M.N). 

If  M  ~  M' ,  N  ~  N',  it  is  clear  that,  to  each  element  (m,  n),  of  (M .  N),  there 
is  a  corresponding  element  of  (M' .  N'},  hence  (M .  N)  ~  (M' .  N')  ;  and  thus 

( M .  N)  depends  only  on  M  and  N. 
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The  cardinal  number  of  the  product-aggregate  (M .  N)  defines  the  product  of 
the  cardinal  numbers  of  M  and  N. 

The  product  of  M  and  N  may  also  be  defined  as  the  cardinal  number  of  the 
aggregate  which  is  obtained  by  substituting  for  each  element  of  N  an  aggregate 
which  is  equivalent  to  M. 

It  is  seen  on  reflection  that  this  definition  is  equivalent  to  the  first  one. 
Since,  as  can  be  shewn  from  the  definition, 

(M.N)~(N.  M),    (M.(N.R))~((M.N) .  R) 
and  (M .  (N,  R))  ~  ((M .  N),  (M .  R)), 

we  see  that  cardinal  numbers  satisfy  the  relations 

a.tf  =  /3.a;  a  .  (/3  .  7)  =  (a  . /3)7;  a  (/3  +  7)  =  a/3  +  ay. 

It  has  thus  been  shewn  that  the  multiplication  of  cardinal  numbers  obeys 
the  commutative,  associative,  and  distributive  laws. 

The  definition  of  multiplication  may  be  extended*  to  the  case  in  which 
the  number  of  factors  is  not  necessarily  finite.  Let  us  consider  a  class  of 
aggregates  M,  where  the  class  contains  either  a  finite,  or  an  infinite,  number 
of  aggregates,  and  suppose  no  two  of  the  aggregates  have  an  element  in 
common.  Let  a  new  object  consist  of  an  association  of  elements  of  the  aggre- 
gates in  the  given  class,  one  element  belonging  to  each  of  those  aggregates. 
All  such  objects  may  be  regarded  as  the  elements  of  a  new  aggregate.  This 
new  aggregate  is  said  to  be  the  product-aggregate  of  the  given  class  of  aggre- 
gates, and  its  cardinal  number  defines  the  product  of  the  cardinal  numbers 
of  all  the  aggregates  of  the  given  class. 

CARDINAL   NUMBERS   AS   EXPONENTS. 

150.  If  we  have  two  finite  aggregates  M,  N,  containing  x  and  y  elements 
respectively,  we  may  suppose  that,  to  each  of  the  y  elements  of  JV,  one  element 
of  M  is  made  to  correspond,  so  that  the  same  element  of  M  may  be  used  any 
number  of  times;  any  such  particular  correspondence  we  call  a  distribution 
of  N  upon  M.  The  total  number  of  ways  of  distributing  N  upon  M  is  x'J. 
To  put  the  matter  in  a  concrete  form,  the  total  number  of  ways  of  distributing 
y  things  among  x  persons,  where  any  number  of  the  y  things  may  be  given  to 
one  person,  is  3&\  any  particular  mode  of  distribution  is  what  we  have  called 
a  mode  of  distributing  the  aggregate  of?/  things  upon  the  aggregate  of  a;  persons. 

The  definition  of  distributing  an  aggregate  N  upon  an  aggregate  M  is 
immediately  extensible  to  the  case  of  infinite  aggregates.  As  before,  the 
distribution  denotes  any  system  by  which,  to  each  element  of  N,  is  made  to 

*  See  Whitehead,  American  Journal  of  Math.  vol.  xxiv,  p.  383,  where  the  theory  of  cardinal 
numbers  is  treated  by  the  Peano-Russell  symbolical  method. 
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correspond  a  particular  element  of  M,  the  same  element  of  M  being  employed 
any  number  of  times,  or  not  at  all.  Denoting  by  NjM  each  particular  mode 
of  distributing  N  upon  M,  we  thus  form  the  new  aggregate  (N/M)  which  con- 
tains as  its  elements  all  such  distributions. 

It  is  seen  at  once  that,  if  M~  M',  N~N',  then  (N/M)  ~  (N'/M').  Thus 
the  cardinal  number  of  (N'/M)  depends  only  on  the  cardinal  numbers  of  M 
and  N. 

The  cardinal  number  of  the  aggregate  (N/M),  each  element  of  which  is  a 
distribution  of  N  on  M,  and  in  which  every  possible  mode  of  such  distribution 
occurs  as  an  element,  is  denoted  by  the  symbol  a?,  where  a  =  M,  /9  =  N',  thus 
a?==(N/M). 

It  is  easy  to  shew  that 

((N/M)  .  (RIM))  ~  ((N,  E)/M) 


_ 

Hence  if  M  =  a,  N  =  /3,  R  =  7,  we  see  that,  in  accordance  with  the  above 
definition  of  exponentials, 

a*  .  a*  =  a***,     a*  .  /S»  =  (a  .  #)>,     (a*)*  =  a?-Y', 

and  thus  the  same  laws  hold  as  for  exponents  in  which  only  finite  cardinal 
numbers  are  involved. 

THE    SMALLEST   TRANSFINITE    CARDINAL   NUMBER. 

151.    The  cardinal  number  of  the  aggregate  of  all  the  finite  integers 
1,  2.  3,  ...  n,  ......  is  called  Alef-zero,  and  is  denoted  by  N0;   thus  N0  =  {w}. 

The  number  N0  is  identical  with    the  number  which  has  previously  been 
denoted  by  a. 

If  we  add  to  {n}  a  new  element  e,  we  obtain  the  sum-aggregate  ({n},  e), 
and  this  is  equivalent  to  {n},  for  we  may  make  e  in  the  first  of  these  aggre- 
gates correspond  to  1  in  the  second,  and  in  general  n  to  n  +  1  ;  and  thus 
({n},  e)~  {n}.  From  this,  we  obtain  N0  +  1  =  N0>  a  relation  which  differentiates 
K0  from  all  the  finite  cardinal  numbers. 

The  cardinal  number  N0  is  greater  than  all  the  finite  cardinal  numbers,  and 
no  transfinite  cardinal  number  exists  that  is  less  than  K0. 

Since  the  finite  aggregate  (1,  2,  3,  ...  k)  is  a  part  of  [n\,  but  no  part  of  the 
finite  aggregate  is  equivalent  to  {n},  we  have,  by  the  definition  of  inequality, 
N0>&. 

Let  us  suppose,  if  possible,  that  a  =  [M]  <  Xo  ;  then  there  is  no  part  of  {M} 
equivalent  to  {n\,  but  there  is  a  part  of  {n}  equivalent  to  \M}.  Now  every  part 
of  {n}  is  either  finite,  or  else  is  equivalent  to  {«}.  For,  consider  a  part  N1}  of 
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{«};  this  must  contain  some  number  n  ;  either  n  is  the  number  of  lowest  rank 
in  Nlt  or  there  are  in  JV,  numbers  of  lower  rank  than  n.  In  either  case  N} 
contains  a  part  N*  which  is  a  part  of  the  finite  aggregate  1,  2,  3,  ...  ?5.  In  ac- 
cordance with  the  definition  in  §  2,  N2  has  a  number  nn  of  lower  rank  than 
all  the  other  numbers  in  1,  2,  3,  ...  n  ;  this  number  Jij  has  lower  rank  than  all 
the  other  numbers  in  Nl.  Removing  nt  from  Nlt  we  now  have  a  part  Nit  of 
{n},  and  it  may  be  proved  as  before  that  there  exists  a  number  n2,  of  lower  rank 
than  all  the  other  numbers  in  N2.  Proceeding  in  this  manner,  we  obtain  a  set 
Wi,  ih,  n»,  •••  °f  numbers  all  belonging  to  Nit  each  one  of  which  is  of  lower  rank 
than  the  next.  If  this  sequence  terminates  at  some  number  np,  N!  is  a  finite 
aggregate  ;  if  it  does  not  terminate,  it  is  a  sequence  equivalent  to  {n}.  Since 
every  element  of  NI  must  occur  as  one  of  the  numbers  {np},  it  thus  follows  that 
iVj  is  equivalent  to  {n}.  It  now  follows  that,  either  [M  }  is  a  finite  aggregate, 
in  which  case  its  cardinal  number  is  <  N0,  or  else  {M  }  is  equivalent  to  {n},  in 
which  case  its  cardinal  number  is  tf0,  contrary  to  the  hypothesis  that  it  is 
<  N0.  It  has  thus  been  shewn  that  the  cardinal  number  of  any  infinite 
aggregate  is  either  =  K0,  or  else  that  it  is  incomparable  with  N0. 

The  following  more  stringent  theorem  was  proved  by  Cantor  : 

The  cardinal  number  K0  is  less  than  any  other  transfinite  cardinal  number. 

The  proof  of  this  theorem,  unlike  that  of  the  foregoing  one,  requires  the 
employment  of  an  infinite  process  of  choosing  particular  elements  from  an 
aggregate.  It  is  equivalent  to  the  theorem  that  every  non-finite  aggregate 
contains  apart  which  is  equivalent  to  {n},  the  aggregate  of  finite  numbers,  that 
is,  a  part  whose  cardinal  number  is  K0  . 

If  M  be  any  non-finite  aggregate,  the  process  of  choosing  a  first,  second, 
.  .  .  nth  element  from  the  aggregate  can  be  continued  indefinitely  without  limit. 
Thus  M  contains  a  part  that  is  equivalent  to  {n},  and  therefore  (§  147)  its 
cardinal  number  is  =  K0.  Thus  every  cardinal  number  that  is  not  finite,  and 
is  different  from  N0,  is  >N0. 

152.  It  has  been  shewn  that  N0  4-  1  =  K0  :  a  similar  proof  would  shew  that 
K0  +  n  =  K0  ,  where  n  is  any  finite  integer. 

In  accordance  with  the  definition  of  addition,  N0  +  K0  is  the  cardinal  number 
of  the  aggregate  (1,  3,  5,  ...  2,  4,  6,  ...),  for  K0  is  the  cardinal  number  of  each 
of  the  aggregates  (1,  3,  5,  .  .  .)  (2,  4,  6,  .  .  .)  ;  hence,  since  the  cardinal  number  of 
(1,  3,  5,  ...  2,  4,  6,  ...)  is  the  same  as  that  of  {n},  we  have  N0+  X0=  K0,  which 
we  may  write  as 

N0.2  =  2.K0  =  N0. 

From  this  relation,  by  repeated  addition  of  N(,  to  both  sides  of  the  identity,  we 
find 
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In  order  to  express  the  product  K0  •  K»  we  form  the  aggregate  {(n,  n')}  of 
which  the  elements  (n,  n')  consist  of  every  pair  of  finite  cardinal  numbers. 
Let  n  +  n'  =  s,  then  s  has  the  values  2,  3,  4,  .  .  .  ;  and  for  any  fixed  value  of  s 
the  numbers  n,  n  have  a  definite  number  of  sets  of  values.  Let  s  =  2  ;  we 
then  have  one  element  (1,  1):  let  s  =  3,  we  then  have  two  elements  (1,  2), 
(2,  1):  for  s  =  4,  we  have  (1,  3),  (2,  2),  (3,  1),  and  so  on.  The  elements  of 
{(n,  n')}  may  thus  be  arranged  in  order  so  that  the  element  (n,  n)  is  at  the 
pih  place,  where  y  =  n  +  $  (n  +  n'  —  1)  (n  +  n'  —  2);  thus  the  aggregate  {(n,  n')} 
is  equivalent  to  [p],  which  has  the  cardinal  number  N0. 

It  has  now  been  proved  that  N0.N0=X0)  or  K02  =  N0;  and  from  this  the 
theorem  N0n  =  N0  follows,  by  repeated  multiplication  by  N0. 

The  theorems  n  .  K0  =  N0,  N02  =  N0  express  in  a  symbolical  form  the  results 
which  have  been  proved  in  §  58,  that  a  finite,  or  an  enumerably  infinite,  set 
of  enumerable  aggregates  makes  an  enumerable  aggregate. 

THE  EQUIVALENCE  THEOREM. 

153.  The  proof  referred  to  in  §  147  will  now  be  given,  that,  if  M,  N  are 
any  two  aggregates  such  that  M  contains  a  part  M-^  which  is  equivalent  to  N, 
and  N  contains  a  part  JVj  equivalent  to  M,  then  M  =  N.  This  theorem,  which 
may  be  called  the  equivalence  theorem,  was  first  proved  by  Schroder*  and 
independently  by  Bernstein  f  ;  but  the  form  in  which  the  proof  is  here  given 
is  due  to  ZermeloJ. 

Lemma  I.  If  a  cardinal  number  a  remains  unaltered  by  the  addition  of 
any  one  of  the  enumerable  set  of  cardinal  numbers  pi,p2,  •••  pn,  •  •  •  5  it  remains 
unaltered  if  all  these  cardinal  numbers  p  are  added  to  it  at  once. 

Let  M,  PI,  P2,  ...  Pn,  ...  be  aggregates  of  which  the  cardinal  numbers  are 
«>  PI>  PZ,  •••  pn>  •  ••  ;  and  such  that  Pj,  P2,  ...  Pn,  ...  are  all  parts  of  M.  We 
have  then,  M=(Pl}  Ml)  =  (P2,  JQ  =  ...  =(Pn,  Mn)  ...;  where  Ml}  M2,  ...  are 
all  parts  of  M,  and  in  virtue  of  the  hypothesis  made  in  the  statement  of  the 
theorem, 

M~Ml  =  M*=...=Mn... 
for  pn  +  M  =  M  =  pn  +  Mn. 

We  may  denote  the  (1,  1)  correspondence  which  can  be  set  up  (see  §  145) 
between  M  and  Mn,  by  Mn  =  (j>nM  ;  and  this  for  every  n.  Now  it  is  clear  that 
this  relation  of  correspondence  is  such  that 


*  See  Jahresbericht  d.  Deutsch.  Math.  Vereinigung,  vol.  v,  p.  81  (1896);  also  Nova  Acta  Leop., 
vol.  LXXI,  p.  303(1898). 

t  See  Borel's  Lemons  sur  la  theorie  dcs  fonctions,  p.  103. 

£  Gottinger  Nachrichtf.ii,  1901,  p.  34,  "  Ueber  die  Addition  transfiniter  Cardinalzablen."  Fox- 
remarks  upon  these  proofs  see  Korselt,  Math.  Annalen,  vol.  LXX,  p.  294. 
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Hence  M=(Pl,Ml\ 

M,  =  </>! 3f- (<kP2,  fr M3)  =  (P2',  A/,'), 

where  P2',  3fa'  are  those  aggregates  which  correspond  to  P2,  M2,  respectively, 
in  the  correspondence  denoted  by  ^. 

Also,  with  a  similar  notation, 

f-to^p,,  $ifcJf,)-(P,',  .a//), 


^/r-i  =  ^1</>2...<#>r-l^=(<#>102...^-1Pr,  &&  .  .  .  <f>r-iMr)  =  (Pr',  Mr'}. 

From  these  results  we  deduce 

^=(p1,p.;)ps',...p/)^;); 

and  no  two  of  the  parts  P:,  P2',  P3',  ...  P/,  of  J/,  have  elements  in  common. 
This  process  of  division  of  M  can  be  continued  indefinitely  ;  and  we  then  have 

M  _/p    P'   P'        M  "> 

JU    —  \-L.l,  -t   2  )  •*  8  >   ••'   m<a  )> 

where  P/,  for  every  r,  is  included,  and  MJ  consists  of  those  elements  which 
belong  to  Mr'  for  every  value  of  r.   From  this  we  see  that 


where  a'  is  the  cardinal  number  of  MJ. 

Let  us  now  consider  the  special  case  of  the  lemma  which  arises  when 
Pi,pv,  ...  are  all  equal,  say  to  p.  In  this  case,  we  see  that,  from  the  hypothesis 
a  =  p  +  a,  the  result  a  =  K0p  +  a!  follows,  where  K0  denotes  the  cardinal  number 
of  the  series  of  finite  integers. 

Now  since  K0=2K0,  we  have  K0p  +  a  =  2$0p  +  a!  =  K0p  +  a  ;  it  has  thus 
been  shewn  that,  if  a  =  a  +  p,  then  a  =  a  +  tf  0£>. 

Returning  to  the  general  case,  we  have 

a  =  a  4-  tf  Opi  =  a  +  N0p2  =  .  .  .  ; 

it  now  follows  that  a  =  #0pi  +  &opz  +  •••  +  a", 

where  a"  is  the  value  which  a'  takes  when  N0/>i,  K0p2,  •••  are  substituted  for 
Pi,pz>  .... 

Hence  we  have 

a  =  2K0  (p,  +  p.2  +  .  .  .  )  +  a"  =  a  +  K0  (p,  +  p,  +  .  .  .  ) 


and  therefore  the  Lemma  has  been  established. 

Lemma  II.    If  the  sum  of  two  cardinal  numbers  p  and  q,  when  added  to  a, 
leaves  a  unaltered,  then  a  is  unaltered  by  the  addition  of  either  p  or  q. 
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For  if  a  =  a  +  p  +  q, 

we  have  seen  that  a  =  a  +  K0  (p  +  q)  ', 

hence  a  =  a  +  (K0  +  1)  ^  +  N0<?, 

and  also  a  =  a  +  N0p  +  (No  +  1)  q  ', 

from  these  equalities  we  have 

a  =  a  +  p,  and  a  =  a  +  ^. 

We  are  now  in  a  position  to  prove  the  equivalence  theorem.    If 

a  =  ft  +  p,  and  @  =  a  +  q, 
we  have  a  =  a  +  jt)  +  <?, 

and  hence,  by  Lemma  II,     a  =  ct  +  p  —  a  +  q  =  /3; 
therefore,  if  M  has  a  part  equivalent  to  N,  and  N  has  a  part  equivalent  to  M, 

it  follows  that  M  =  a  +  p  =  a  +  q  =  N',  where  a  is  the  cardinal  number  of  the 
part  of  M  that  is  equivalent  to  N,  and  /3  is  that  of  the  part  of  N  that  is 
equivalent  to  M. 

In  case  M  is  itself  equivalent  to  N,  which  will  in  particular  happen  if  M 
is  a  part  of  N,  we  have  p  =  0,  and  the  theorem  holds  good. 

In  case  the  condition  a  =  /3  +  p  holds,  but  there  is  no  corresponding  con- 
dition /3  =  a  +  q,  we  have  in  accordance  with  the  definition  in  §  147,  a  >  /3. 
It  follows  that  the  sum  of  two  or  more  cardinal  numbers  is  greater  than,  or 
equal  to,  any  one  of  the  cardinal  numbers. 

The  following  theorem  may  be  established  : 

If  the  cardinal  number  a.  is  unaltered  by  the  addition  of  p,  and  if  /3  =  a, 
the  cardinal  number  ft  is  unaltered  by  the  addition  of  q,  where  q  £  p. 

For  let  ft  =  a  +  7,  p  =  q  +  r  ;  then,  from  a  =  a  +  p  =  a  +  q  +  r,  we  deduce  that 
a  =  a.  +  q.  It  then  follows  that 


154.  A  proof  has  been  given  by  Cantor*  that,  if  an  aggregate  exists  of 
which  the  cardinal  number  is  a,  then  an  aggregate  exists  of  which  the  cardinal 
number  is  greater  than  a. 

The  proof  is  a  generalization  of  the  second  proof,  given  in  §  60,  that  the 
cardinal  number  c,  of  the  continuum,  is  greater  than  that  of  the  rational  num- 
bers. The  proof  may  be  put  into  the  following  form  : 

Suppose  M—  [m]  to  be  an  aggregate,  of  cardinal  number  a  ;  this  aggregate 
M  maybe  supposed  to  be  simply  ordered(see  §157)in  any  manner.  In  M  let  each 
element  m  be  replaced  either  by  A  or  by  B,  where  A,  B  are  two  given  objects  ; 
then  M  is  replaced  by  a  similar  aggregate  (see  §  157),  in  which  each  element  is 

*  See  Jahresbericht  d.  Deutsch.  Math.  Vereinigung,  1897. 
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either  A  or  B.  An  infinity  of  such  aggregates  will  be  obtained,  differing  from  one 
another  in  respect  of  whether  A  or  B  has  been  put  in  the  place  of  each  element 
of  M ;  denoting  the  aggregate  of  all  such  possible  aggregates  MAB,  by  [MAB], 
it  will  be  shewn  that  the  cardinal  number  of  [MAB]  is  greater  than  that  of  M. 
In  the  first  place,  it  can  be  seen  that  the  cardinal  number  of  [MAB]  is  equal  to, 
or  greater  than,  that  of  {m} ;  for,  taking  any  one  element  m0,  of  {m},  replace  it 
by  A,  and  all  the  other  elements  by  B;  we  have  then  an  element  of  [MAB], 
and  there  is  such  an  element  corresponding  to  each  element  ma,  of  {m} ;  thus 
those  elements  of  [MAB]  in  which  there  is  only  one  A,  form  an  aggregate 
of  cardinal  number  equal  to  that  of  {m}.  Next,  let  us  assume  that,  if  possible, 
all  the  elements  of  [MAB]  are  placed  in  (1,  1)  correspondence  with  those  of 
{m} ;  it  will  then  be  shewn  that  an  MAB  can  always  be  defined  which  is  not 
included  in  the  correspondence.  Each  M°AB,  in  [MAB],  now  corresponds  to  a 
definite  m0,  in  {m}  ;  form  a  new  aggregate  M'AB  in  the  following  manner: — 
For  each  element  M°AB,  in  {MAB},  in  which  A  takes  the  place  of  m0,  in  {m}, 
substitute  B ;  and  for  each  element  M°AB,  in  {MAB},  in  which  B  takes  the  place 
of  ra0,  in  {m},  substitute  A  ;  in  this  manner  we  form  an  aggregate  M'AB  in 
which  each  element  is  either  A  or  B,  which  is  similar  to  {in},  and  which  is  not 
identical  with  any  MAB  that  occurs  in  the  correspondence  between  [MAB]  and 
{m}.  It  has  thus  been  shewn  that  the  cardinal  number  of  the  aggregate  of 
all* the  MAB  is  greater  than  that  of  M.  If  M  is  the  aggregate  al,a2,a3, ...  an, ... 
which  is  similar  to  the  aggregate  of  integral  numbers,  and  if,  for  A  and  B,  we 
take  0  and  1  respectively,  then  the  aggregate  {Mm}  may  be  interpreted  as  the 
aggregate  of  all  the  rational  and  irrational  numbers  between  0  and  1,  in  the 
binary  scale ;  and  this  aggregate  is  thus  shewn  to  be  unenumerable. 

Instead  of  replacing  the  elements  of  {m}  by  two  letters  A,  B,  we  might  have 
taken  any  finite  number  of  letters,  without  altering  the  principle  of  the  proof. 
In  §  60,  the  ten  digits  0,  1,  ...  9  were  taken  instead  of  A  and  B.  It  will  be 
observed  that,  even  if  {m}  is  normally  ordered  (see  §  165),  the  new  aggregate 
{M}  is  not  given  as  a  normally  ordered  aggregate ;  and  in  default  of  proof  it 
cannot  be  assumed  that  it  is  capable  of  being  arranged  in  normal  order. 

To  replace  all  the  elements  of  an  aggregate  either  by  A,  or  by  B,  is 
equivalent  to  taking  a  part*  of  the  given  aggregate.  The  theorem  has  thus 
been  established  that,  the  cardinal  number  of  the  aggregate,  each  element  of 
which  is  a  part  of  a  given  aggregate,  is  greater  than  the  cardinal  number  of  the 
given  aggregate,  all  possible  parts  being  contained  in  the  new  aggregate. 

155.  An  important  question  relating  to  cardinal  numbers  in  general  arises 
as  regards  the  sum,  a  +  /3,  of  two  transfinite  cardinal  numbers. 

If  a.  +  &  =  7,  it  may  be  the  case  that  y  is  greater  than  both  the  numbers 
a,  /3,  or  that  it  is  equal  to  one  of  them ;  it  being  assumed  that  a,  /?  are  com- 

*  See  Borel,  Lemons  mir  la  theorie  des  fond  ions,  p.  108. 
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parable  cardinal  numbers.  Whether  the  first  case  can  arise,  or  not,  has  not  been 
definitely  settled*,  even  in  the  case  in  which  a  =  /8;  thus  it  is  not  known 
whether  2a  is  necessarily  equal  to  a,  as  is  the  case  when  a  =  c,  or  when  a  =  N0. 

In  case  a  =  2a  =  a  +  a,  it  then  follows,  by  applying  Lemma  I  of  §  153,  that 
a  =  a  +  K0a  =  (K0  +  1) a;  and  thus  that  a  =  X0a. 

The  special  case  in  which  a  is  the  cardinal  number  of  a  set  of  points  in  space 
of  any  number  of  dimensions  has  been  referred  to  in  §  90,  where  the  theorem 
has  been  assumed  that,  if  x  be  the  cardinal  number  of  such  set,  then  ax  =  x. 
That  this  holds  good,  involves  the  assumption  that  2a?  =  x ;  which  is  equivalent 
to  the  assertion  that,  if  there  exists  a  set,  of  cardinal  number  x,  in  each  of  two 
non-overlapping  cells,  the  cardinal  number  of  the  combined  set  is  also  x. 

DIVISION  OF  CARDINAL  NUMBERS  BY  FINITE  NUMBERS. 

156.  If  two  aggregates  have  the  same  cardinal  number,  and  if  each  of  the 
two  aggregates  be  divided  into  the  same  finite  number  n,  of  parts,  such  that 
the  n  parts  of  the  first  aggregate  all  have  the  same  cardinal  number,  and  also 
the  n  parts  of  the  second  all  have  the  same  cardinal  number,  then  it  can  be 
proved  that  the  cardinal  number  of  one  of  the  parts  of  the  first  aggregate  is 
the  same  as  that  of  one  of  the  parts  of  the  second  aggregate.  Symbolically, 
the  theorem  may  be  stated  in  the  form  : — if  a,  /3  are  cardinal  numbers  such 
that  no.  —  w/3,  then  a  =  /3. 

This  theorem  has  been  proved  by  Bernstein f.  It  will  be  sufficient  to  give 
the  detailed  proof  in  the  case  n  =  2,  as  the  proof  in  the  general  case  is  obtained 
by  generalization  of  that  employed  in  the  particular  case. 

Since  an  aggregate  is  equivalent  to  itself,  any  special  mode  of  exhibiting 
such  equivalence,  by  which  each  element  is  made  to  correspond  to  a  definite 
other  element,  may  be  called  a  transformation  of  the  system  into  itself.  As 
regards  all  such  possible  transformations  the  following  propositions  may  be 
seen  to  hold: 

(1)  The  transformations  of  an  aggregate  M  into  itself  form  a  group  <f>M. 

(2)  Let  1,  x^>  Xz>  Xs>  •••  denote  a  sequence  of  transformations  of  M  into 
itself,  1  denoting  the  identical  transformation,  and  let  this  sequence  form  a 
group  which  is  necessarily  a  sub-group  of  <f>M',  then  the  condition  that  the 
sequence  forms  a  group  is  that,  corresponding  to  any  two  integers  m,  n,  there 
is  a  third  r,  such  that  %7n%n  =  Xr-    Further,  let  us  suppose  that,  to  every  Xn 
there  corresponds  a  definite  x'n,  such  that  XnXn  =  1-    If  m  be  an  element  of 
M,  such  that  m  ^  Xn  (w)>  f°r  w  =  1,  2,  3,  . . . ,  then  Xn  (m)  ¥*  Xn'  (m)>  where  n  and 
n'  are  any  unequal  integers. 

*  See  Schoenflies,  DieEntwickelung... ,  vol.  n,  p.  9;  also  Jourdain,  Phil.  Mag.  (7),  vol.  vi,  p.  73. 
t  Inaugural  Dissertation,   "  Untersuchungen  aus  der  Meugeulehre,"  Halle,  1901.      This  is 
reproduced  in  Math.  Annalen,  vol.  LXI,  p.  117. 
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(3)  If  m  and  m'  are  any  two  distinct  elements  of  M,  and  if  m 

for  n=l,  2,  3,  ...,  then  %n  (m)  ^  ^n.  (m  ').  For  if  ^n  (rn)  =  %n>  (m'),  we  should 
deduce  that  w  =  X  X*  (m)  **  X'  X*  (m/)  =  #>»"  (w')>  which  is  contrary  to  the 
hypothesis  made. 

(4)  If  7\,  Tg,  ...  are  parts  of  M,  such  that  each  T  has  no  element  in 
common  with  another  T,  we  may  say  that  the  T's  form  a  system  of  separate 
parts  of  M. 

If  T=~\t}  is  a  part  of  M,  and  if  t£<f>n(t'),  for  n  =  1,  2,  3,  ....  then  the 
equivalent  aggregates  T,  ^i  (71),  ^2  (21),  .  .  .  form  a  system  of  separate  parts  of  M. 

(5)  If  T  is  a  part  of  M  which  satisfies  the  condition  stated  in  (4),  then 


For  T,  fa  (T),  x-t  (T),  •••  are  all  parts  of  M  having  the  cardinal  number  T; 
and  if  R  is  the  part  of  M  which  remains  when  all  these  separate  parts  are 
removed,  we  have 

lf  =  I  +  N0.F; 

hence  M+  f  =  S+  (N0  +  1)  T 

=  I  +  K0.f  =£ 
After  these  remarks  we  can  proceed  to  the  proof  of  the  theorem  :  —  Let 

(a)  M  =  x  i  +  x  2  =  Hc3  +  #4  , 

(b)  Xi  =  x2, 

(c)  xs  =  xt; 

then  it  is  required  to  shew  that  x^  —  a~3;  which  involves  x2  =  x4. 

The  three  equations  (a),  (6),  (c)  may  be  regarded  as  denoting  that  there 
are  three  reversible  transformations  of  the  aggregate  M  into  itself,  which  may 
be  denoted  by  <f>a,  <^,  <£c  respectively  ;  the  reversibility  of  these  transformations 
is  expressed  by  <£a2  =  </>62  =  <f)c-  =  1. 

The  transformation  <f>a  involves  a;1  =  (a;18,  #14),  where  x13  are  those  elements 
of  xl  which  are  transformed  into  elements  of  xs,  and  #14  those  which  are  trans- 
formed into  elements  of  #4  ;  on  the  whole  we  have 


(6) 


where  x^  =  x^f. 


If  Tl  is  any  part  of  xlt  and  T2  an  equivalent  part  of  x.>,  we  may  denote  by 
Xi*,  x2*  the  aggregates  obtained  by  interchanging  those  elements  of  xl  which 
belong  to  2\  with  those  of  x.2  which  belong  to  T» ;  we  have  then  a  similar  set 
of  equations  to  (6)  for  the  new  starred  aggregates,  and  xl*  =  xl,  x.2 *  =  x.,, 
x3*  =  x3>  xf  —  xt.  Thus  if  the  theorem  be  proved  for  the  starred  aggregates, 
it  holds  for  the  original  ones. 
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We  have  to  shew  that,  after  suitable  transformations,  a  system  of  division 
of  the  aggregates  into  parts,  of  the  form  in  (6),  can  be  found,  such  that 
^13  +  ^14  =  ^14,  and  131  +  132  =  ^32-  For,  from  these  equations,  we  deduce 
x1  =  x2  =  xli,  xs  =  x4  =  x2Z,  and  then  the  aggregates  x2,  x4  are  such  that  each 
has  a  part  which  is  equivalent  to  the  other  ;  and  consequently,  in  accordance 
with  the  equivalence  theorem,  x2,  x4  are  equivalent  to  one  another;  or  12  =  #4. 
It  has  in  fact  to  be  shewn  that  x13  can  be  so  chosen  that  it  is  negligible  with 
respect  to  cardinal  number,  in  comparison  both  with  xu  and  with  x^. 

We  form  the  systems  of  transformations 


each  transformation  ^  in  this  system  has  one  inverse,  given  by  the  scheme 
X^Xm+i  =  !>  %4w+2;\>+2  =  1,  ;\>+3%4n+3  =  1 ;  thus  the  transformations  %  form  a 
group  of  reversible  transformations  of  M  (=  (xl ,  x2))  into  itself. 

An  element  els,  of  xl3,  is  either,  (i)  transformed  into  an  element  of  ^  by  a 
transformation  %,  with  finite  index,  or  else,  (ii)  e13  is  not  transformed  into  an 
element  of  x24  by  any  of  the  transformations  ^.  Suppose  then  that,  for  every 
element  el3,  of  xls,  the  second  of,  these  cases  arises,  then  %2r,  x&+i  transform 
the  elements  of  xls  into  aggregates  which  are  respectively  in  x^  and  xu,  and 
in  them  these  aggregates  form  an  enumerable  system  of  separate  parts  of  each. 
For,  in  the  case  contemplated,  ^2  transforms  x13  into  a  part  of  x^;  by  %4,  the 
elements  of  xri  become  elements  of  xw  or  x^,  consequently,  in  accordance  with 
(ii),  %4(#J3)  is  a  part  of  <%.  In  this  manner  it  is  seen  that  xl3  is  transformed, 
by  every  ^2n,  into  a  part  of  x23,  and  by  every  %2w+i>  into  a  part  of  #14.  It  then 
follows,  by  proposition  (5),  that  1J3  +  xu  =  xu,  x13  +  lr2s  =  x23,  and  the  theorem  is 
thus  completely  established.  The  remainder  of  the  proof  consists  in  shewing 
that,  by  an  exchange  of  elements  of  xl3  with  elements  of  #24,  it  is  possible  to 
arrange  so  that  the  case  just  considered  always  arises. 

Suppose  x:3  are  those  elements  of  xl3  which  are  transformed  by  %2  into 
elements  of  ^J  let  x13"  denote  those  elements,  different  from  xl3,  which  are 
transformed,  by  ^3,  into  elements  of  #24  which  were  not  affected  by  %2,  and  so 
on ;  we  thus  obtain  the  scheme 

v,  (xit)  in  XOA, 


2  XK"  ±  x13"     ^  (^is')  ±  X*  Ois")  ^  X*  (^""O  in 

3'  ±...±  x,3^      x*  W)  *  %3  W)  •  •  •  ^  Xn  («18(nl)     in 
We  take  now  the  equivalent  sums 

00  OC 

[>   1  _  •<  r   (»)    r,nr]   Tr    1  _  V  -,         (,r   (n)\ 
\_XK\  —  •*  *13     >  ana  L^24j  —  —  XH+I  V^is     ^j 
n=\  n=l 

and  we  carry  out  an  exchange  of  [#i3]  with  [#24]  ;  we  then  have 
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When  the  exchange  has  been  made,  of  the  elements  of  [#,3]  with  those  of 
[#«]>  we  denote  the  new  aggregates  by  starring  the  original  ones ;  we  have 
then,  in  accordance  with  the  formulae  (6),  expressions  for  x*,  #2*,  #8*,  ac*, 
and  we  can,  as  has  been  shewn  above,  attend  to  these,  instead  of  to  the 
original  #,,  xZl  #j,  #4.  Now  no  element  of  x13*  is  transformed  into  an  element 
of  irw*  by  any  of  the  transformations  %,  it  being  understood  that  the  trans- 
formations %  are  not  to  affect  the  substituted  elements;  and  thus,  by  the 
reasoning  which  has  been  given  above,  for  the  case  in  which  no  element  of 
a;13  is  transformed  into  an  element  of  #M,  the  theorem  is  established.  Bernstein 
has  also  proved  that,  if  2«  =  a  +  $,  where  a,  ft  are  cardinal  numbers,  then  a  =  /3. 

THE  ORDER-TYPE  OF  SIMPLY  ORDERED  AGGREGATES. 

157.  An  aggregate  M  is  said  to  be  a  simply  ordered  aggregate  when  each 
element  m  has  a  dejinite  rank  relatively  to  the  other  elements  of  M,  so  that,  of 
any  two  elements  m,  m'  whatever,  it  is  known  which  has  the  higher  and  which 
has  the  lower  rank. 

If  m  has  a  lower  rank  than  m',  the  fact  is  denoted  symbolically  by  m  <  m' ; 
and  if  a  higher  rank,  by  m  >  m'. 

If  an  aggregate  is  given,  at  first  unordered,  it  may  be  possible  to  order  the 
aggregate  in  a  variety  of  essentially  distinct  ways.  If  the  aggregate  is  finite, 
the  ordering  of  it  may  be  accomplished  by  arbitrarily  assigning  to  each  element 
its  rank  relatively  to  the  others.  In  case  the  aggregate  is  an  infinite  one,  the 
ordering  of  it  consists  in  the  setting  up  of  some  general  rule  which  suffices 
logically  to  assign  the  relative  order  of  any  two  elements. 

Besides  simply  ordered  aggregates  there  exist  also  doubly,  or  trebly,  ordered 
aggregates,  or  also  aggregates  with  higher  degrees  of  multiplicity  *  of  order. 
Each  element  of  such  an  aggregate  possesses  two,  three,  or  more  distinct 
characteristics  of  an  ordinal  character.  Only  simply  ordered  aggregates  will 
be  considered  here. 

Two  simply  ordered  aggregates  M,  N  are  said  to  be  similar,  when  a  (1,1) 
correspondence  can  be  established,  in  accordance  with  some  law,  such  that,  to  any 
two  dejinite  elements  m,  m,  of  M,  there  correspond  two  dejinite  elements  n,  n',  of 
N,  in  such  a  manner  that  the  relative  order  ofmf'm',  in  M,  is  the  same  as  that 
of  the  corresponding  elements  n,  n',  in  N. 

This  relation  of  similarity  may  be  represented  symbolically  by  M—N. 
Every  simply  ordered  aggregate  is  similar  to  itself. 

Two  simply  ordered  aggregates  which  are  similar  to  a  third  are  similar  to 
one  another. 

All  simply  ordered  aggregates  which  are  similar  to  one  another  are  said  to 
have  the  same  order-type. 

An  order-type  is  accordingly  characteristic  of  a  class  of  similar  aggregates. 
*  Multiple  order-types  have  been  considered  by  F.  Riesz,  Math.  AnnaJen,  vol.  LXI,  p.  406. 
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The  order-type  of  a  simply  ordered  aggregate  M  is  defined  by  Cantor  as 
the  concept  which  is  obtained  by  abstraction  when  the  nature  of  the  elements 
of  M  is  disregarded,  their  order  being  alone  retained.  The  order-type  of  M  is 
then  denoted  by  M.  This  definition  will  be  further  discussed  in  §  191.  That 
similar  aggregates  have  the  same  order-type  is  regarded  by  Cantor  as  a  de- 
duction from  this  definition. 

If,  in  M,  we  further  disregard  the  order  of  the  elements,  we  obtain  M,  the 
cardinal  number  of  M. 

The  order-type  of  M  is,  from  Cantor's  point  of  view,  regarded  as  a  simply 
ordered  aggregate  similar  to  M,  such  that  each  element  is  the  number  1.  If 
any  order-type  be  denoted  by  a,  the  corresponding  cardinal  number  is  denoted 
by  o. 

Corresponding  to  any  given  transfinite  cardinal  number,  there  is  a  multi- 
plicity of  order-types  which  form  a  class;  each  such  class  of  order-types  is 
characterised  by  the  common  cardinal  number  of  all  the  order-types  of  the 
class. 

The  order-types  which  belong  to  the  class  corresponding  to  a  cardinal  number 
a  form  an  aggregate  which  has  a  cardinal  number  a'.  It  will  appear  that  a'  is 
always  greater  than  a. 

If  the  order  of  every  pair  of  elements  in  a  simply  ordered  aggregate  M  be 
reversed,  the  aggregate  in  the  new  order  is  denoted  by  *M. 

If  M=a,  then  the  order-type  *M  is  denoted  by  *a. 

The  order-type  of  the  aggregate  of  all  the  finite  integers,  in  their  natural 
order  (1,  2,  3,  ...),  is  denoted  by  CD.  This  is  therefore  the  order-type  of  every 
aggregate  (al5  a2,  ...  an  ...)  which  is  similar  to  (1,  2,  3,  ...). 

The  aggregate  (...  an  ...  a3,  a2)  a^  has  the  order-type  *«. 

THE  ADDITION  AND  MULTIPLICATION  OF  ORDER-TYPES. 

158.  If  M .  N  denote  two  simply  ordered  aggregates,  and  if  the  aggregate 
(M,  N)  be  formed,  in  which  all  the  elements  of  both  M  and  N  occur,  and  which 
is  such  that  any  two  elements  of  M  have  the  same  relative  order  as  in  M,  and 
that  any  two  elements  of  N  have  the  same  relative  order  as  in  N,  and  further 
that  each  element  of  M  has  a  lower  rank  than  all  the  elements  of  N,  then 
the  new  simply  ordered  aggregate  (M,  N)  is  said  to  be  the  sum  of  the  two 
simply  ordered  aggregates  M  and  N.  It  is  clear  that  if  M  —  M',  X—  N',  then 
(M,  N)  -  (M',  N'),  and  thus  that  the  order-type  of  (M,  N)  depends  only  on  the 
order-types  of  M  and  N. 

If  M=a,  iV=/8,  the  sum  a  +  ft  is  defined  to  be  the  order-type  of  the  sum 
(M,  N)  of  the  two  simply  ordered  aggregates,  as  defined  above. 
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This  defines  the  operation  of  addition  of  order-types.  It  will  be  seen  that 
the  addition  of  order-types  does  not  obey  the  commutative  law.  For  if  a  =  M, 
£=iV,  then  a  +  ft  =  (M,N):  but  ft  +  a  =  (N,M);  and  the  two  order-types 


(M,  iV),  (N,  M)  are  in  general  different  from  one  another. 

If  n  denotes  a  finite  integer,  o>  +  n  is  the  order-type  of  the  ordered 
aggregate  (eti  es,  e3,  ...  /,,  /„  .../„),  whereas  n  +  a>  is  the  order-type  of 
(/i»/a.'  •••  /»»  «n  «2>  $»  •••)•  It  ig  c^ear  that  the  firet  °f  these  aggregates  is  not 
similar  to  (0,  ,  #,,  #,,  .  .  .  ),  but  if  we  make/1  ,/2)  .  .  .  /„  correspond  to  #,,  £2,  #,,...#„, 
then  ^  to  #n+i,  e2  to  <jrn+2,  ...  and  in  general  em  to  gn+m>  it  is  seen  that  the 
second  of  the  above  order-types  is  similar  to  (glt  g2,  gs,  ...).  It  thus  appears 
that  n  +  a)  =  (a,  but  &>  +  n  =^=  o>. 

159.  In  the  simply  ordered  aggregate  N,  let  us  suppose  that,  in  the  place  of 
each  element,  there  is  substituted  a  simply  ordered  aggregate  similar  to  M, 
whereby  a  new  simply  ordered  aggregate  is  formed  ;  this  may  be  denoted 
by  M.N.  It  is  clear  that  if  M^M',  N-N',  then  M.N^M'.N';  thus  the 
order-  type  of  M.  N  depends  only  on  the  order-  types  of  M  and  N. 


'If  a  =  M,  ft  =  N,  the  product  a. .  ft  is  defined  to  be  M  .N,  the  order-type  of 
M .  N,  cis  just  defined. 

It  will  be  seen  that  the  product  a .  ft  is  in  general  different  from  ft .  a,  and 
thus  that  the  multiplication  of  order-types  does  not  obey  the  commutative 
law.  For  example,  to  .  2  is  the  order-type  of  the  aggregate  formed  by  sub- 
stituting in  (ffj ,  a2)  for  each  of  the  two  elements  an  aggregate  of  type  &> ; 
o>  .2  is  therefore  the  order-type  of  (bl}  62,  b3,  ...  d,  C2,  C3,  ...),  in  which  there 
is  no  last  element,  and  no  element  immediately  preceding  d.  On  the  other 
hand,  2  .  &>  is  the  order-type  obtained  by  substituting  for  each  element  in 
(al}  a«,  a3,  ...),  an  aggregate  consisting  of  two  elements;  and  2  .  <o  is  thus 
the  order-type  of  the  enumerable  aggregate  (au,  alz,  a21,  a^,  a31,  a^, ...),  which 
is  similar  to  (blt  b»,  b3,  ...),  as  may  be  seen  by  making  anl  correspond  to  62»-i 
and  an2  to  b2n.  It  has  thus  been  shewn  that  2  .  o>  =  &>,  but  o> .  2  ^  w. 

THE  STRUCTURE  OF  SIMPLY  ORDERED  AGGREGATES. 

160.  An  examination  of  the  structure  of  a  simply  ordered  aggregate  M  can, 
in  general,  only  be  attempted  by  considering  the  nature  of  those  aggregates 
which  are  its  parts,  and  in  each  of  which  parts  the  order  of  the  elements  is  the 
same  as  that  of  the  same  elements  in  the  whole  aggregate.  The  simplest 
transfimte  part  of  an  ordered  aggregate  is  that  which  has  one  of  the  types 
&),  *o>.  Such  parts  we  speak  of  as  ascending  sequences,  and  descending  sequences. 
respectively,  contained  in  M. 

Two  ascending  sequences  {«„},  {a,,'j,  contained  in  M,  are  said  to  be  related 
to  one  another,  provided  that,  corresponding  to  any  element  a,,,  of  the  first, 
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there  are  elements  a'n',  of  the  second,  such  that  an<a'n>;  and  provided  also 
that,  corresponding  to  any  element  an',  of  the  second,  there  are  elements  an>, 
of  the  first  sequence,  such  that  an'  <  an'. 

Two  descending  sequences  {bn},  {&„'},  contained  in  M,  are  said  to  be  related 
to  one  another,  provided  that,  corresponding  to  any  element  bn  of  the  first 
sequence,  there  are  elements  b'n',  of  the  second,  such  that  bn  >  b'n> ;  and 
provided  also  that,  corresponding  to  any  element  bn',  of  the  second,  there 
are  elements  bn>,  of  the  first  sequence,  such  that  bn'  >  6n<. 

An  ascending  sequence  [an\  and  a  descending  sequence  {&„} ,  contained  in 
M,  are  said  to  be  related  to  one  another,  if  an<  bn<,  for  every  n  and  ri ;  and 
further,  provided  there  exists  in  M  no  element,  or  only  one  element  m,  which 
is  such  that  an<m<bn,  for  every  n. 

Two  sequences  contained  in  an  ordered  aggregate,  which  are  both  related 
to  a  third  sequence,  are  related  to  one  another. 

Two  sequences  in  an  ordered  aggregate,  which  are  both  ascending,  or 
both  descending,  and  of  which  one  is  a  part  of  the  other,  are  related  to  one 
another. 

161.  Suppose  that,  in  an  ordered  aggregate  M,  there  is  an  element  m0 
which  satisfies  the  following  conditions,  with  respect  to  an  ascending  sequence 
contained  in  M : 

(1)  for  every  n,  an  <  >n0; 

(2)  for  every  element  m,  of  M,  which  is  <  m0,  there  exists  a  number  n 
such  that  an,  an+1,  an+2,  ...  are  all  >  m ;  then  the  element  m0  is  said  to  be 
the  limiting  element,  or  limit  of  [an]  in  M;  and  m0  is  said  to  be  a  principal 
element  of  M. 

Similarly,  if  we  suppose  that,  in  M,  there  is  an  element  m0,  which  satisfies 
with  reference  to  a  descending  sequence  [an\,  contained  in  M,  the  following 
conditions : 

(1)  for  every  n,  an  >  ?n0; 

(2)  for  every  element  m,  of  M,  which  is  >m0,  there  exists  a  number  n 
such  that  an,  an+l,  an+2,  •••  are  all  <  m ;  then  the  element  m0  is  said  to  be  the 
limiting  element,  or  limit  of  [an\  in  M;  and  m0  is  said   to  be  a  principal 
element  of  M. 

A  sequence  contained  in  M  can  never  have  more  than  one  limiting 
element  in  M. 

If  a  sequence  in  M  has  a  limiting  element  in0,  in  M,  then  m0  is  the 
limiting  element  of  every  sequence  in  M  which  is  related  to  the  first  one. 

Two  sequences  which  have  the  same  limiting  element,  in  M,  must  be 
related  to  one  another. 


160,  i6i]  Simply  ordered  aggregates  207 

It  is  clear  that,  if  M,  M'  are  similar  ordered  aggregates,  an  ascending,  or 
a  descending,  sequence  in  M  corresponds  to  a  sequence  of  the  same  kind  in  M' . 
To  eveiy  principal  element  in  M  there  corresponds  a  principal  element  in  M'. 

An  ordered  aggregate  which  is  such  that  every  element  is  a  principal 
element  is  said  to  be  dense  in  itself. 

If,  in  an  ordered  aggregate,  every  sequence  which  is  contained  therein  has 
a  limiting  element  in  the  aggregate,  then  the  ordered  aggregate  is  said  to  be  a 
closed  aggregate. 

An  ordered  aggregate  which  is  dense  in  itself,  and  also  closed,  is  said  to  be 
perfect. 

An  ordered  aggregate  which  is  such  that,  between  any  two  whatever  of  its 
elements,  there  are  other  elements  of  the  aggregate,  is  said  to  be  everywhere 
dense. 

The  properties  of  an  ordered  aggregate,  thus  defined,  are  also  properties  of 
any  similar  aggregate;  hence  the  terms  may  be  applied  to  the  order-types 
which  are  symbolised  by  replacing  the  elements  of  the  ordered  aggregates 
by  1 ;  there  can  exist  therefore  an  order- type  which  is  dense  in  itself,  or 
closed,  or  perfect,  or  everywhere  dense. 

The  terms  which  have  been  here  employed  for  the  purpose  of  describing 
certain  peculiarities  which  may  exist  in  an  ordered  aggregate,  or  in  the 
corresponding  order-type,  are  identical  with  those  which  we  have  employed 
in  analogous  senses  in  Chapter  n,  in  the  case  of  sets  of  points,  or  numbers. 
There  is  however  a  distinction  which  must  be  noticed  between  the  use  of  the 
terms  in  the  two  cases.  To  illustrate  this  distinction,  let  (Pj,  P2,  P3, . . .  Pn, . . .} 
be  a  sequence  of  points  on  a  straight  line,  which  sequence  has  a  limiting 
point  PM,  on  the  right  of  the  points  Pn;  then  if  Q  be  any  point  of  the 
straight  line  on  the  right  of  Pa,  the  two  ordered  aggregates  (Pj,  P2,  Ps,  ... 
Pn,  -••  PUI).  and  (Pi,  P2,  P3,  ...  Pn,  ...  Q),  are  similar,  and  have  the  same 
order- type  a>  +  1.  In  the  first  of  these  aggregates,  Pw  is  the  limiting  element 
of  the  sequence  (P1?  P2,  ...  Pn,  ...)',  and,  in  the  second  aggregate,  Q  is  the 
limiting  element  of  the  same  sequence ;  and  therefore  both  the  ordered 
aggregates  are  closed,  in  the  sense  explained  above.  The  first  of  these 
aggregates  forms  a  closed  set  of  points,  in  the  sense  of  the  term  defined  in 
Chapter  n ;  but  the  second  does  not,  since  Q  is  not  a  limiting  point  of  the 
set  of  points  {Pn}.  The  distinction  rests  upon  the  different  use  of  the  terms 
limiting  element  and  limiting  point,  in  the  two  cases  of  an  ordered  aggregate 
of  elements  in  general,  and  that  of  a  set  of  points  in  the  continuum.  The 
question  whether  an  element  is  a  limiting  element  of  an  aggregate  to  which 
it  belongs,  or  not,  in  the  sense  defined  above,  is  answered  by  examining  the 
structure  of  the  ordered  aggregate  itself.  In  the  case  of  a  set  of  points  in 
the  continuum,  a  particular  point  may  be  a  limiting  element  of  the  aggregate 
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of  points  considered  merely  as  an  aggregate  of  elements  with  a  particular 
order-type ;  but  the  question  as  to  whether  the  same  point  is  a  limiting  point 
of  the  set  of  points,  considered  as  chosen  out  of  the  continuum,  can  only  be 
answered  after  an  examination  of  the  ordinal  relation  of  the  point  to  other 
points  of  the  continuum  which  do  not  belong  to  the  set ;  in  fact,  the  set  must 
be  regarded,  for  this  purpose,  as  an  aggregate  which  is  only  a  part  of  another 
aggregate,  the  continuum.  It  is  now  clear  that  a  set  of  points,  considered  solely 
as  an  ordered  aggregate  of  elements,  without  reference  to  the  fact  that  it  is 
essentially  a  part  of  the  continuum,  may  be  closed,  or  perfect ;  and  yet  that 
the  same  set  of  points  need  be  neither  closed  nor  perfect,  in  the  sense  of  the 
terms  employed  in  the  theory  of  sets  of  points,  which  has  been  dealt  with  in 
Chapter  n. 

THE  ORDER-TYPES  77,  0,  IT. 

162.  Certain  order-types  which  are  of  special  importance  will  be  now 
examined. 

The  first  of  these  is  the  order-type  77,  of  the  set  R,  of  rational  numbers 
between  0  and  1  (both  exclusive),  in  their  order,  as  defined  in  Chapter  I. 

It  will  be  shewn  that  the  order-type  77  is  exhaustively  characterised  by 
the  following  properties : 

(1)  77  =  N0  =  a. 

(2)  There  is  in  77  no  lowest,  and  no  highest,  element. 

(3)  77  is  everywhere  dense. 

In  fact,  every  simply  ordered  aggregate  M,  which  has  these  three 
characteristics,  is  similar  to  the  aggregate  R. 

To  prove  this,  we  first  observe  that,  on  account  of  the  condition  (1),  the 
order  of  the  elements  in  both  M  and  R  can  be  so  altered  that  each  of  them 
is  reduced  to  the  order- type  w.  Let  this  be  done ;  and  denote  by  M0,  R0  the 
new  ordered  aggregates 

MV  —  (mj,  m2,  ms,  .. .), 

-Ro  =  fa,  r2,  rs,  ...). 

We  have  to  shew  that  M  —  R\  and  to  do  this  we  have  to  shew  how  to  establish 
the  requisite  correspondence  between  the  elements  m,  of  M,  and  r,  of  R.  Let 
m-i  be  made  to  correspond  to  i\ ;  then  there  are  an  indefinitely  great  number 
of  elements  of  M,  which  have  the  same  relation,  as  regards  order,  to  m},  as  r.2 
has,  in  R,  relatively  to  7^ ;  of  all  these  elements  choose  that  one  m(2,  which 
has  the  smallest  index  as  it  appears  in  M0 ;  and  let  mf2  be  made  to  correspond 
to  i'o.  Of  all  the  elements  of  M,  which  are  related  to  ml  and  mf2,  in  the  same 
manner,  as  regards  order  in  M,  as  rs  is  related  to  r,  and  r2,  as  regards  order  in 
R,  choose  that  one  we3,  which  has  the  smallest  index  as  it  appears  in  M0;  and 
make  m(3  correspond  to  r3. 
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Proceeding  in  this  manner,  we  make  the  elements  r,,  r2,  r3  ...  rn,  of  R, 
correspond  to  the  elements  mlt  m,t,  mf|,  ...  w,n,  of  M;  and  so  far  as  these 
elements  are  concerned  the  relations  of  rank  are  preserved  in  the  corre- 
spondence :  we  proceed  then  to  choose,  in  the  same  manner  as  before,  the 
element  mtn+1 ,  which  is  to  be  made  to  correspond  to  rn+, ,  and  thus  we  obtain, 
for  every  rn,  the  corresponding  ra,n.  It  must  however  be  shewn  that  this 
process  exhausts  all  the  elements  m,  of  M,  that  is  to  say,  that  in  the  sequence 
1,  €a,  €3,  ...  €„,  ...  every  integral  number  p  occurs  in  some  definite  place.  This 
can  be  proved  by  the  method  of  induction.  Let  us  assume  that  the  elements 
m,,  w2,  m3, ...  mn  all  occur  in  the  correspondence  that  has  been  set  up  between 
the  whole  of -R  and  at  least  a  part  of  M;  then  we  shall  prove  that  mn+l  also 
occurs.  Upon  this  assumption,  let  X  be  so  great  that,  among  the  elements 
mlt  mtl,  mej,  ...  w,A,  all  the  elements  mlt  mz,  m3,  ...  wn  occur.  Then,  if  mn+l 
is  not  also  among  those  elements,  choose  out  of  rA+1,  rA+2)  rA+3,  ...  that  element 
rA+,,  with  the  smallest  index,  which  has  the  same  relation  to  7*,,  r2,  ...  rA,  as 
regards  order  in  R,  that  mn+1  has  relatively  to  ml}  mtl,  m,2,  ...  m€A,  as  regards 
order  in  M.  Then  the  element  mn+i  has  the  same  relation  to  ml,  mfl,  mti,  ... 
m<A-f«-i'  as  regards  order  in  M,  as  rA+g  has  to  rlt  r2,  ...  rA+g_1;  as  regards  order 
in  R.  It  thus  appears  that  mn+l  is  the  element  with  the  smallest  index  as 
it  appears  in  j¥0,  which  has,  in  M,  the  same  relation  as  regards  order  to 
mi>  w<2,  ...  m(^  g_i,  that  rA+g  has,  relatively  to  rlt  r2,  ...  rA+«_i,  in  R;  hence 
m(  =  7n,,^i ;  that  is,  the  element  mn+l  occurs  in  the  correspondence  which 
has  been  established  between  M  and  R.  It  has  now  been  shewn  that  M  and 
R  are  similarly  ordered  aggregates. 

Examples  of  the  order-type  rj  are  the  following : 

(1)  The  aggregate  of  all  negative  and  positive  rational  numbers,  including 
zero,  in  their  natural  order. 

(2)  The  aggregate  of  all  rational  numbers  which  are  greater  than  a,  and 
less  than  b,  where  a,  b  are  two  real  numbers  such  that  a  <  b. 

(3)  The  aggregate  of  all  real  algebraical  numbers  in  their  natural  order 
in  the  continuum,  or  of  all  such  of  these  numbers  as  lie  between  two  real 
numbers  a,  b. 

(4)  The  aggregate  of  a  set  of  non-abutting  linear  intervals  which  are 
such  that  their  end-points  and  the  limiting  points  of  these  end-points  form  a 
non-dense  perfect  set  of  points  in  a  linear  interval. 

The  rational  numbers  of  the  interval  (0,  1),  including  0  and  1,  form  an 
aggregate  of  the  order-ty pe  1  +  r]  +  1 . 

163.  We  now  proceed  to  the  consideration  of  the  order-type  6,  of  points 
forming  a  linear  closed  continuum. 

It  will  be  shewn  that  any  simply  ordered  aggregate  M  is  similar  to  the 
aggregate  X  of  all  real  numbers  of  the  continuum  (0,  1),  in  their  natural 

14 
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order,  provided  (1)  M  is  perfect,  and  (2)  in  M,  an  aggregate  S,  with  the 
cardinal  number  K0,  is  contained,  which  is  so  related  to  M,  that,  between  any 
two  elements  m0,  m1}  of  M,  there  are  elements  of  S. 

If  S  has  a  lowest  and  a  highest  element,  these  can  be  removed  without 
affecting  its  relation  to  M ;  and  thus  we  may  suppose  8  to  be  of  the  type  rj, 
of  the  aggregate  R  of  rational  numbers  which  lie  between  0  and  1,  both 
exclusive,  in  their  natural  order. 

Since  S  —  R,  we  may  suppose  the  elements  of  S  to  be  made  to  correspond 
in  order  to  the  elements  of  R ;  and  it  will  be  shewn  that  this  correspondence 
enables  us  to  establish  a  correspondence  between  the  elements  of  M  and  X. 

We  suppose  that  each  element  of  M,  which  belongs  to  S,  corresponds  to 
that  element  of  X  which  belongs  to  jR,  just  as  in  the  correspondence  of  S 
with  R  already  established.  Any  element  m,  of  M,  which  does  not  belong  to 
8,  is  the  limiting  element  of  a  sequence  [mn\  of  elements  of  S.  To  this 
sequence  {mn},  there  corresponds  a  sequence  {rn}  in  X,  all  the  elements  of 
which  belong  to  R ;  and  this  sequence  {rn}  has  a  limiting  element  x,  in  X, 
not  belonging  to  R ;  we  take  therefore  m,  in  M,  to  correspond  to  x,  in  X.  If 
we  take  a  different  sequence  {#,/},  which  has  the  same  limiting  element  m  as 
before,  in  M,  then  there  corresponds  to  it  a  sequence  {?•„'}  in  R,  which  has  the 
same  limiting  element  x  as  before,  in  X.  It  will  now  be  shewn  that,  in 
the  correspondence  so  established  between  the  elements  of  M  and  of  X,  the 
relative  order  of  two  elements  of  M  is  the  same  as  that  of  the  corresponding 
elements  of  X.  This  clearly  holds  of  any  two  elements  of  M  which  are  also 
elements  of  S.  Consider  next  two  elements  m  and  s,  of  M,  the  first  of  which 
does  not,  and  the  second  of  which  does,  belong  to  S;  and  let  x1}  r  be  the  corre- 
sponding elements  of  X.  If  r<  x1,  there  exists  an  ascending  sequence  in  R, 
of  which  #j  is  the  limiting  element,  such  that  all  its  elements  are  >  r;  then,  to 
this  sequence  there  corresponds  an  ascending  sequence  in  S,  all  the  elements 
of  which  are  >  s,  and  of  which  m  is  the  limiting  element ;  hence  s  <  m.  If 
r>x1,  it  can  be  shewn,  in  a  similar  manner,  that  s>m.  The  proof  that, 
corresponding  to  any  two  elements  ml,  m2,  of  M,  which  do  not  belong  to  S. 
the  elements  a?1(  jr2,  of  X,  are  such  that  m^  $  m.2,  according  as  xl  <.  x2,  is  of 
a  precisely  similar  character  to  that  just  given.  It  has  thus  been  shewn  that 
M  and  X  are  similar  aggregates,  and  that  the  type  6  is  characterised  by  the 
conditions  (1)  and  (2). 

The  above  characterisation*  of  the  type  6  contains  Cantor's  ordinal  theory 
of  the  constitution  of  the  linear  closed  continuum. 

A  non-dense  perfect  set  of  points  in  a  linear  interval  has  not  the  order- 
type  6,  but  the  set  of  complementary  intervals  together  with  the  limiting 

*  See  Russell,  Principle*  of  Mathematics,  vol.  i,  p.  303,  also  Veblen,  Trans.  A  me  r.  Math.  Soc., 
vol.  vi,  and  Huntington,  Annals  of  Math.,  (2),  vols.  vi  aiid  vn. 
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points  of  their  end-points  does  form  an  aggregate  of  order-type  6,  when  the 
elements,  consisting  partly  of  points,  and  partly  of  intervals,  are  taken  in  the 
order  in  which  they  occur  in  the  continuum. 


164.  The  order-  type  *o>  +  a>  may  be  denoted  by  IT,  and  is  the  order-type 
of  the  negative  and  positive  integers  in  their  natural  order.  This  order-type 
has  properties  distinct  from  those  of  o>.  For  example,  n  +  w  has  been  shewn 
to  be  identical  with  «,  where  n  is  a  finite  integer;  but  n  +  tr  is  not  identical 
with  TT.  From  either  of  the  equations  n  +  TT  =  m  +  TT,  or  TT  +  n  =  IT  4-  m, 
there  follows  m  =  n,  or  more  generally!: 

If  n,  n'  are  finite  integers,  £  and  £'  other  order-  types,  from  the  equation 
»  +  TT  +  f  SB  n'  +  tr  +  £,  there  follows  n  =  n,  f  =  f. 

To  prove  this  theorem,  we  observe  that,  if  the  two  aggregates  be  placed 
into  similar  correspondence,  the  lowest  elements  correspond  to  one  another, 
then  the  second,  and  so  on;  hence  n  =  n'  is  proved  at  once:  and  we  now  have 


When  two  simply  ordered  aggregates  Mn  +  Z,  Nn  +  Z'  ,  of  order-types 
TT  -f  f,  TT  +  £"  ,  are  placed  in  correspondence  in  order,  either  Mn  corresponds 
to  NV)  or  Mn  corresponds  to  a  part  of  Nv,  or  else  Nn  corresponds  to  a  part 
of  M,.  In  the  last  two  cases  the  order-type  TT  must  be  split  up  into 
TT  =  TJ-!  +  7r2,  where  ^  =  77^  and  7r2  is  some  other  order-type;  but  from  the 
definition  TT  =  *&>  +  &>,  it  is  clear  that  every  mode  of  dividing  TT  into  two 
parts,  without  altering  the  relative  order  of  the  elements,  leaves  it  in  the 
form  *w  +  o)  ;  hence  it  is  impossible  that  TT  =  frrl  +  7r2,  and  7r  =  7r1;  and  there- 
fore M,  corresponds  to  Nv.  Hence  also  Z  corresponds  to  Z  \  or  £  =  £". 

NORMALLY  ORDERED   AGGREGATES. 

165.  The  order-type  of  a  simply  ordered  aggregate  is,  as  we  have  already 
seen,  such  that  the  structure  of  the  aggregate,  as  revealed  by  an  examination 
of  the  sequences  contained  in  it,  may  be  of  the  most  varied  character  ;  the 
various  sequences  may  be  ascending  or  descending  ones,  and  may,  or  may  not, 
have  a  limiting  element  within  the  aggregate. 

Of  all  the  possible  order-types,  those  are  of  especial  importance  which 
have  been  defined  by  Cantor  as  the  order-  types  of  normally  ordered  aggregates 
(wohlgeordnete  Mengen). 

A  normally  ordered  aggregate  M  is  one  which  satisfies  the  following 
conditions  : 

(1)  M  has  an  element  m,  of  lower  rank  than  all  the  other  elements. 

(2)  If  M!  is  any  part  of  M,  and  if  M  contains  one  or  more  elements 
which  are  of  -higher  rank  than  all  the  elements  of  Ml}  then  there  ej'ixts  one 

t  Bernstein,  luc.  cii.,  p.  9. 
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element  m',  of  M,  which  immediately  follows  the  part-aggregate  Mlt  so  that 
there  are  no  elements  of  M  which  are  intermediate  in  rank  between  m'  and  all 
the  elements  of  Ml. 

The  special  case  of  (2)  which  arises  when  Ml  consists  of  one  element, 
shews  that  a  normally  ordered  aggregate  is  such  that  each  element  has  one 
which  immediately  follows  it,  unless  the  element  is  the  highest  element  of 
M.  It  is  however  not  necessarily  the  case  that  M  has  a  highest  element. 

If  e]}  e2,  e3, ...  en, ...  is  an  ascending  sequence  of  elements  contained  in  M, 
and  such  that  elements  exist,  in  M,  which  are  of  higher  rank  than  every  en> 
then  there  exists  an  element  e,  of  M,  which  is  higher  than  all  the  en,  and 
such  that  every  element  e",  of  M,  which  is  lower  than  e  is  lower  than  en,  en+i, 
en+2,  ...,  for  some  definite  value  of  n. 

Every  part  of  a  normally  ordered  aggregate  has  a  lowest  element. 

Let  M!  be  a  part  of  M;  if  Ml  contains  ml,  the  lowest  element  of  M,  then 
raj  is  the  lowest  element  of  J/j.  If  Ml  does  not  contain  mlt  consider  that 
part  of  M  which  contains  all  those  elements  every  one  of  which  is  of  lower 
rank  than  all  the  elements  of  M1 ;  this  part  of  M  must  have  an  element 
which  immediately  follows  it;  and  this  element  belongs  to  M^,  and  is  its 
lowest  element. 

//  a  simply  ordered  aggregate  M  itself,  and  also  every  part  of  M,  has  a 
lowest  element,  tlien  M  is  normally  ordered. 

The  condition  (1)  is  fulfilled.  Let  Ml  be  a  part  of  M  such  that  M  con  tains 
elements  which  are  higher  than  all  those  of  M^  let  these  form  the  aggregate 
M2,  and  let  m  be  the  lowest  element  of  M2.  Then  m  is  the  element  which 
immediately  follows  M1;  and  thus  the  condition  (2)  is  satisfied. 

This  property  of  a  normally  ordered  aggregate,  that  every  part  of  it  has  a 
lowest  element,  might  be  adopted  as  the  definition  of  a  normally  ordered 
aggregate. 

A  somewhat  simpler  property,  which  might  be  employed  to  define  a 
normally  ordered  aggregate,  is  the  following : 

An  aggregate  M  is  normally  ordered^  if,  and  only  if,  it  contains  no  part,  of 
which  the  order-type  is  *&>. 

If  M  is  not  normally  ordered,  at  least  one  part  of  it  must  have  no  lowest 
element,  and  this  part  contains  a  sequence  whose  order-type  is  *o>.  This 
follows  from  the  theorem  of  §  1 51 ,  that  every  aggregate  that  is  not  finite  contains 
a  part,  of  cardinal  number  N0.  Such  part  can  be  ordered  according  to  the 
type  w,  when  it  has  no  highest  element,  and  according  to  the  type  *&>,  when 
it  has  no  lowest  element.  An  aggregate  which  has  a  lowest  element,  and  is 

t  See  Jourdain,  Phil.  Mag.,  (6),  vol.  vn,  p.  65. 
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also  such  that  each  element  has  one  that  immediately  succeeds  it,  is  not 
necessarily  normally  ordered,  even  if  each  element  has  one  immediately 
preceding  it.  This  can  be  seen  by  considering  an  aggregate  with  the  order- 
type  o>  +  *w. 

166.  The  following  properties  of  normally  ordered  aggregates  can  be 
proved  in  a  very  simple  manner: 

Every  part-aggregate  of  a  normally  ordered  aggregate  is  itself  normally 
ordered. 

Every  ordered  aggregate  which  is  similar  to  a  normally  ordered  aggregate 
is  itself  normally  ordered. 

If,  in  a  normally  ordered  aggregate  M,  there  be  substituted  for  the 
elements  normally  ordered  aggregates,  in  such  a  manner  that,  if  M,H,  Mm~ 
are  the  aggregates  substituted  for  any  two  elements  m,  m',  then  Mm  $  Mm', 
according  as  in  £  m',  the  resulting  new  aggregate  is  normally  ordered. 

167.  The  part  of  a  normally  ordered  aggregate  M  which  consists  of  all 
those  elements  which  are  of  lower  rank  than  an  element  m,  of  M,  is  called  the 
segment  of  M  determined  by  the  element  m. 

The  aggregate  which  remains  when  the  segment  of  M,  determined  by 
the  element  m,  is  removed  from  M,  is  called  the  remainder  of  M  determined 
by  an  element  m.  The  element  m  is  the  lowest  element  of  the  remainder. 

If  8  is  the  segment  of  M,  determined  by  m,  and  R  is  the  remainder, 
then  Jf=(S,  R). 

The  segment  of  M  determined  by  the  lowest  element  of  M  contains  no 
element,  but  it  may  be  regarded  as  existent,  and  being  the  null-segment. 
The  remainder,  determined  by  the  lowest  element,  is  the  aggregate  M  itself. 

Of  two  segments  8,  S'  determined  by  the  elements  m,  m',  of  which  m  <  m', 
we  say  that  $  is  the  smaller,  and  8'  the  larger  segment,  or  8  <  S'. 

It  can  easily  be  seen  that,  if  M,  Ml  are  two  similar  normally  ordered 
aggregates,  a  segment  of  M  corresponds  to  a  similar  segment  of  Mlt  the 
element  by  which  the  segment  of  M  is  determined  corresponding  to  the 
element  of  MI  by  which  the  segment  of  MI  is  determined. 

A  normally  ordered  aggregate  is  not  similar  to  any  of  its  segments. 

Assume  that,  if  possible,  S—M,  and  suppose  the  elements  of  S,  M  are 
put  into  correspondence.  To  the  segment  S,  of  M,  there  must  correspond  a 
segment  Slt  of  S,  so  that  St  —  M  —  8,  where  Si  <  S.  Since  Si  —  M,  we  find  in 
a  similar  manner  a  segment  S.,<  Si,  which  is  similar  to  M,  and  so  on  ;  and  in 
this  way  we  obtain  an  unending  sequence  S  >  Sl  >  S»  . . .  >  Sn  ...  of  segments 
of  M,  which  are  all  similar  to  M.  Let  m,  mlt  m.2,  ...  m,,,  ...  be  the  elements 
which  determine  the  segments  8,  Si ,  8.,  ,...Sn,...',  then  m  >  nil  >  m.,  ...>  mn .... 
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The  aggregate  ( mn,  ...  m.2)  mlt  w)  would  be  a  part  of  M  which  has  no 

lowest  element,  and   is  of  type  *a>  ;    this  is  impossible  if  M  is  normally 
ordered. 

If  M  is  an  infinite  normally  ordered  aggregate,  it  always  has  parts  which 
are  similar  to  M,  although  such  a  part  cannot  be  a  segment. 

A  normally  ordered  aggregate  cannot  be  similar  to  any  part  of  one  of  its 
segments. 

Let  us  assume  that,  if  possible,  S'  a  part  of  a  segment  S,  of  M,  is  similar 
to  M.  Since  S'  —  M,  we  can  place  the  elements  of  S',  M  in  correspondence, 
then,  to  the  segment  8,  of  M,  there  will  correspond  a  segment  S1}  of  S' ,  where 
Sl  —  S;  let  then  Si  be  determined  by  the  element  el ,  of  S'.  Since  ^  is  also 
an  element  of  M,  it  determines  a  segment  M1}  of  M,  of  which  $1  is  a  part, 
and  which  has  a  part  similar  to  M.  Proceeding  in  the  same  manner,  we 
determine  a  segment  M2,  of  MI,  which  has  a  part  that  is  similar  to  M ; 
and  in  this  way  we  obtain  an  unending  sequence  of  segments  of  M,  all 
similar  to  M,  so  that  M  >  Ml  >  M2 . . .  >  Mn  —  The  elements  which  determine 
these  sequences  form  a  part  of  M  which  is  of  type  *  o>  ;  and  this  is  contrary  to 
the  hypothesis  that  M  is  normally  ordered. 

Two  different  segments  of  a  normally  ordered  aggregate  cannot  be  similar. 
For  one  of  these  segments  is  a  segment  of  the  other. 

The  aggregate  of  which  the  elements  are  the  segments  of  a  normally  ordered 
aggregate  M,  can  be  so  ordered  that  it  is  similar  to  M. 

For  each  element  of  M  corresponds  to  a  single  segment  of  M,  arid  thus 
the  aggregate  of  segments  can  be  so  ordered  as  to  be  similar  to  M. 

There  is  only  one  mode  of  putting  the  elements  of  two  similar  normally 
ordered  aggregates  into  correspondence,  so  that  the  relative  orders  of  the 
elements  are  unaltered  in  the  correspondence. 

For  if,  in  two  modes  of  placing  the  aggregates  in  correspondence,  two 
elements  /,  /',  of  one  aggregate  M,  correspond  to  one  element  e  of  the  other 
M' ,  the  segments  of  M  determined  by/*/'  are  each  similar  to  the  segment  of 
M'  determined  by  e\  but  it  has  been  shewn  to  be  impossible  that  Mean  have 
two  different  segments  which  are  similar  to  one  another. 

A  segment  of  one  of  two  normally  ordered  aggregates  is  similar  to  at 
most  one  segment  of  the  other  aggregate. 

If  8,  S'  are  similar  segments  of  two  normally  ordered  aggregates  M,  M', 
then,  to  every  .smaller  segment  Si<S,  of  M,  there  corresponds  a  similar 
segment  $/  <  S',  of  M'. 

If  Si,  $2  are  two  segments  of  the  normally  ordered  aggregate  M,  and 
Si,  8.2'  are  two  similar  segments  of  a  normally  ordered  aggregate  M',  then  if 
Si  <Si,it  follows  that  #2'  <  S,. 
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If  a  segment  8,  of  M,  is  not  similar  to  any  segment  of  another  normally 
ordered  aggregate  M',  then  no  segment  8'  >  8,  of  M,  is  similar  to  any  segment 
of  M' ,  nor  to  M '  itself;  and  the  same  holds  of  M  itself. 

//  M,  M',  two  normally  ordered  aggregates,  are  so  related  that,  to  any 
segment  of  either,  there  corresponds  a  similar  segment  of  the  other,  then  M  —  M'. 

Any  element  e,  of  M,  determines  a  segment  of  M  which  corresponds  to  a 
similar  segment  of  M'.  Let  this  latter  be  determined  by  an  element  e,  of 
M ' ;  we  then  take  e  to  correspond  to  e.  To  every  element  of  M  we  therefore 
find  a  corresponding  element  of  M',  and  it  is  seen  by  applying  the  foregoing 
theorems  that  the  relative  order  of  the  elements  is  preserved. 

168.  If  two  normally  ordered  aggregates  M,  M'  are  so  related  that,  (1)  to 
every  segment  S,  of  M,  there  corresponds  a  similar  segment  8',  of  M',  and  (2) 
at  least  one  segment  of  M'  exists,  to  which  there  is  no  corresponding  similar 
segment  of  M;  then  there  exists  a  definite  segment  $/,  of  M',  such  that  $,'  —  M. 

Consider  all  those  segments  of  M'  which  do  not  correspond  to  similar 
segments  of  M.  Among  these,  there  must  be  one  $/,  which  is  the  least  of 
all ;  this  follows  from  the  fact  that  the  elements  which  determine  these 
segments  of  M'  form  an  aggregate  which  has  a  lowest  element,  and  this 
lowest  element  determines  the  segment  $/.  Every  segment  of  M'  which  is 
greater  than  $,'  is  such  that  there  exists  no  corresponding  similar  segment 
of  M ;  but  every  segment  of  M'  which  is  less  than  $/  has  a  corresponding 
similar  segment  of  M.  Since,  to  every  segment  of  M,  there  corresponds  a 
similar  segment  of  $/,  and,  to  every  segment  of  $/,  there  corresponds  a  similar 
segment  of  M,  it  follows  that  M—S^. 

If  the  normally  ordered  aggregate  M'  has  at  least  one  segment  to  which 
there  corresponds  no  similar  segment  of  M,  then,  to  every  segment  of  M,  there 
corresponds  a  similar  segment  of  M'. 

Let  81  be  the  smallest  segment  of  M'  to  which  there  corresponds  no 
similar  segment  of  M.  If  there  exist  segments  of  M  to  which  no  corre- 
sponding similar  segments  of  M'  exist,  let  $,  be  the  smallest  of  all  such 
segments  of  M.  To  every  segment  of  Sl  there  corresponds  a  similar  segment 
of  81,  and  conversely ;  hence  Sl  —  $/,  which  is  contrary  to  the  hypothesis 
that  there  exists  no  segment  of  M  which  is  similar  to  $,'. 

If  M,  M'  are  any  two  normally  ordered  aggregates,  then  either  (1)  M  and 
M'  are  similar,  or,  (2)  there  exists  a  segment  S',  of  M' ,  which  is  similar  to  M, 
or,  (3)  there  exists  a  segment  S,  of  M,  which  is  similar  to  M' ',  and  these 
possibilities  are  mutually  exclusive. 

The  following  four  possibilities  may  be  contemplated,  as  regards  the 
relation  of  M  to  M' : 

(1)  To  every  segment  of  either  M  or  M'  there  corresponds  a  similar 
segment  of  the  other  aggregate. 
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(2)  To  every  segment  of  M  there  exists  a  corresponding  similar  segment 
of  M'\  but  there  is  at  least  one  segment  of  M'  to  which  no  similar  segment  of 
M  corresponds. 

(3)  To  every  segment  of  M '  there  corresponds  a  similar  segment  of  M ; 
but  there  is  at  least  one  segment  of  M  to  which  no  similar  segment  of  M' 
corresponds. 

(4)  There  is  at  least  one  segment  of  M  to  which  no  similar  segment  of 
M'  corresponds,  and  also  at  least  one  segment  of  M'  to  which  no  similar 
segment  of  M  corresponds. 

It  has  been  shewn  that  (4)  is  impossible.  In  the  case  (1),  it  has  been 
proved  that  M  —  M'.  In  the  case  (2),  it  has  been  shewn  that  a  definite 
segment  Si,  of  M',  exists,  such  that  Si'  —  M;  and  in  the  case  (3),  that  there 
is  a  definite  segment  Si,  of  M,  such  that  Si  —  M' 

It  is  impossible  that,  at  the 'same  time  M  —  M',  and  also  M—S^;  for,  in 
that  case,  M'  —  Si';  and  it  has  been  shewn  to  be  impossible  that  M'  should  be 
similar  to  one  of  its  own  segments. 

It  is  also  impossible  that  M  —  Si,  and  also  M'  —  Si',  for  there  must  then 
exist  a  segment  of  Si  which-  is  similar  to  Si,  and  therefore  to  M' ';  but  this  is 
contrary  to  the  theorem  that  a  normally  ordered  aggregate  cannot  be  similar 
to  one  of  its  segments. 

If  any  part  of  M  is  such  that  that  part  is  not  similar  to  any  segment  of  M, 
then  that  part  is  similar  to  M  itself. 

Any  part  MI,  of  M,  is  normally  ordered  ;  if  then  M±  be  similar  neither  to  M 
nor  to  any  segment  of  M,  there  must  exist  a  segment  <&  MI,  of  M : ,  which  is 
similar  to  M ;  and  MI  is  a  part  of  that  segment  of  M  which  is  determined 
by  the  same  element  that  determines  the  segment  MI,  of  MI.  Therefore  3/i 
would  be  similar  to  "a  part  of  one  of  its  segments,  which  has  been  shewn  to  be 
impossible. 

THE  THEORY  OF  ORDINAL  NUMBERS. 

169.  The  order-type  M,  of  a  normally  ordered  aggregate  M,  is  said  to  be 
the  ordinal  number  which  belongs  to  M ;  all  similar  normally  ordered  aggregates 
have  consequently  the  same  ordinal  number. 

If  M,  M'  are  two  normally  ordered  aggregates  such  that  M  has  a  segment 
which  is  similar  to  M',  whilst  M'  has  no  segment  which  is  similar  to  M,  then 
the  ordinal  number  a  =  M  is  said  to  be  greater  than  the  ordinal  number 
ft  =  M' ;  and  this  relation  is  denoted  by  a  >  ft.  If  M  has  no  segment  similar 
to  M',  but  M'  has  a  segment  similar  to  M,  the  ordinal  number  a.  is  said  to  be 
less  than  ft,  and  the  relation  is  denoted  by  a  <  ft. 
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It  follows  from  these  definitions,  in  conjunction  with  the  theorem  of  §  168, 
that,  if  a,  £  are  any  two  ordinal  numbers  whatever,  they  satisfy  one,  and  one 
only,  of  the  relations  a  =  #,  a>/8,a</9;  and  that  if  a  >  ft,  then  £  <  a. 

Further  it  is  seen  that,  if  a  <  ft  and  ft  <  7,  then  a  <  7 ;  hence  the  aggregate 
of  all  ordinal  numbers  is  a  simply  ordered  aggregate,  when  arranged  in  such 
a  manner  that  any  one  or,  which  has  been  defined  as  less  than  another  one  ft, 
precedes  it. 

The  sum  a.  +  ft  of  two  ordinal  numbers  is,  in  accordance  with  the  general 
definition  of  the  sum  of  two  order-types,  the  order-type  of  the  normally  ordered 
aggregate  (M,  N),  where  M,  N  are  two  normally  ordered  aggregates  such  that 
a  =  M,ft  =  ft. 

Since  M,  N  both  contain  no  part  of  type  *<u,  the  same  is  true  of  (M,  N). 
Hence  the  aggregate  (M,  N)  is  normally  ordered ;  and  thus  a  +  ft  is  an  ordinal 
number. 

Since  M  is  a  segment  of  (M,  N),  we  see  that  a  <  a  +  ft. 

N  is  &  remainder  of  (M,  N)  determined  by  the  lowest  element  of  N, 
hence  N  may  be  similar  to  (M,  N) ;  or,  if  not,  it  is  similar  to  a  segment  of 
(M,N):  thus  either /3  =  a +  /3,  or  ft<a+ft. 

The  addition  of  ordinal  numbers  obeys  the  associative  law,  but  not  in 
general  the  commutative  law  ;  thus  (a  4-  ft)  -f  7  =  a  +  (ft  4-  7),  but  a  +  /3  is  in 
general  not  the  same  as  ft  +  a. 

170.  The  product  a .  ft,  of  two  ordinal  numbers,  is,  in  accordance  with  the 
definition  of  §  159,  the  order- type  of  the  aggregate  obtained  by  substituting 
for  each  element  of  an  aggregate  of  order-type  ft,  an  aggregate  of  order-type  a. 
In  accordance  with  the  theorem  of  §  166,  the  aggregate  thus  obtained  is 
normally  ordered,  and  of  type  dependent  only  on  a  and  ft. 

In  general  a .  ft  is  not  equal  to  0 .  a. 

It  is  easily  seen  that  a .  ft  >  a,  provided  /3  >  1 ;  and  that,  if  a/3  =  ay,  then 
ft  =  y. 

If  a,  ft  are  two  ordinal  numbers  such  that  a<  ft,  there  exists  an  ordinal 
number  7  such  that  a  +  7  =  ft ;  and  this  number  7  is  defined  to  be  ft  —  a. 

For,  if  M  =  ft,  there  is  a  segment  of  M  which  may  be  denoted  by  MI,  such 
that  Jif,  =  a ;  let  then  M  =  (MltS),  therefore  M=M1  +  S,  and  ft-a  =  S. 

171.  Let  &,  ft.2,  ...  ftn,  ...  denote  a  simple  sequence  of  ordinal  numbers, 
and  suppose  Mlt  Mz,  ...  Mn,  ...  are  aggregates  of  which  the  order-types  are 
respectively  the  numbers  of  the  sequence.    The  aggregate  (Mi,  M.>,  ...  Mn,  ...), 
which  is  obtained  by  replacing  each  element  of  the  normally  ordered  aggregate 
(1,  1,  1,  ...),  of  type  a),  by  a  normally  ordered  aggregate,  is,  in  accordance  with 
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the  theorem  of  §  166,  itself  normally  ordered ;  and  its  type  defines  the  sum 
Pi  +  ftz  +  •  •  •  +  fin  +  •  -  -  =  ft.  If  «-rt  denotes  the  sum  &  +  ft%  +  . . .  +  @n,  we  see 
that  nn  —  (Ml}  J/2,  ...  Mn)  ;  and  it  is  clear  that  an+l  >an:  hence 

/3j  =  «],  $2  =  «2  —  «i,  •••»  ftn  —  <xn  —  «n-i- 

It  will  now  be  shewn  (1)  that  ft  >  an,  for  every  value  of  n  ;  and  (2)  that,  if  ft' 
is  any  ordinal  number  <  ft,  there  is  some  definite  value  of  n  such  that  an, 
ttn+u  ...  are  all  >  ft'. 

(1)  follows  from  the  fact  that  each  an  is  the  ordinal  number  of  a  segment 
of  (Mi,  Mo,  ...  Mn,  ...)  of  which  ft  is  the  ordinal  number. 

To  prove  (2),  we  observe  that  a  segment  of  (M1}  M2,  ...  Mn,  ...)  exists,  of 
which  ft'  is  the  ordinal  number,  and  therefore  the  element  which  determines 
this  segment  must  belong  to  one  of  the  aggregates  M1}  M2,  ...  Mn,  •••,  say  J/ni. 
It  follows  that  the  segment  is  also  a  segment  of  (M1}  M2,  ...  Mni) ;  and  there- 
fore, ft'  <  «ni,  or  an  >  ft',  n  being  ^  n^ 

It  has  thus  been  proved  that  ft  is  the  ordinal  number  which  immediately 
follows  all  the  ordinal  numbers  «i,  or2,  ...  an,  ...  ;  and  it  may  be  spoken  of  as 
the  limit  of  the  sequence  a1}  a.2)  ...  «„,  .... 

Thus  every  ascending  sequence  a1}  a2, . . .  «n, . . .  of  ordinal  numbers  determines 
a  limiting  number  ft=  lira  an,  which  immediately  follows  all  the  numbers  of  the 
sequence. 

THE  ORDINAL  NUMBERS  OF  THE  SECOND  CLASS. 

172.  Every  finite  ordered  aggregate  is  normally  ordered,  and  its  order-type 
is  the  ordinal  number  of  the  aggregate.  The  finite  ordinal  numbers  may  be 
spoken  of  as  the  ordinal  numbers  of  the  first  class ;  to  each  such  ordinal  number 
there  corresponds  a  single  cardinal  number,  and  the  properties  of  the  finite 
ordinal  numbers  are  identical  with  those  of  the  finite  cardinal  numbers,  the 
terms  ordinal  and  cardinal  simply  defining  the  two  uses  of  the  same  number. 
In  the  case  of  transfinite  aggregates  there  is  no  such  identity  between  ordinal 
and  cardinal  numbers ;  in  fact  the  arithmetic  of  the  one  kind  of  numbers  is 
essentially  different  from  that  of  the  other  kind. 

Corresponding  to  a  single  transfinite  cardinal  number  there  is  an  infinity 
of  transfinite  ordinal  numbers ;  all  those  transfinite  ordinal  numbers  which 
correspond  to  aggregates  that  have  one  and  the  same  cardinal  number  a  are 
said  to  form  a  class  Z(a),  the  class  of  normal  order- types  which  have  the 
cardinal  number  a. 

The  ordinal  numbers  of  all  those  order-types  which  have  the  same  cardinal 
number  K0,  as  the  aggregate  of  finite  numbers,  are  said  to  be  of  the  second 
class  £(N0). 

The  ordinal  number  w  •=•  lim  n,  and  is  the  smallest  number  of  the  second  class 
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If  M  denotes  the  aggregate  (w,,  w,,  ...  mn,  ...),  then  M=a),  and  &>  =  N0. 
Any  number  ft,  which  is  <  o>,  must  be  the  order-type  of  a  segment  of  M,  and 
M  has  only  segments  (wi,,  ™2,  ...  mn)  with  finite  ordinal  numbers  n ;  thus  ft 
must  be  a  finite  number ;  and  therefore  the  only  ordinal  numbers  <  o>  are 
finite  ones. 

Every  number  a,  of  the  second  class,  has  a  number  o+l  immediately 
following  it. 

For  if  a=M,  a  =  N0,  we  have  a+l=(M,  e),  where  e  is  a  new  element; 
and  since  M  is  a  segment  of  ( M,  e),  we  have  a  -h  1  >  a.  Also 


It  has  thus  been  shewn  that  a  + 1  is  a  number  of  the  second  class.  Every 
number  <  a+  1,  is  the  order- type  of  a  segment  of  (M,  e)',  and  such  segment 
can  only  be  M,  or  a  segment'  of  M ;  hence  no  number  <  a.  +  1  is  >  a  :  therefore 
*a  +  1  is  the  next  number  greater  than  a. 

If  di,  «2>  •••  an,  •••  is  any  sequence  of  numbers  of  the  second  class,  there  is 
a  number  lim  an,  also  of  the  second  class,  which  is  the  smallest  number  that  is 

»~00 

greater  than  every  number  an  of  the  sequence. 

If,  as  in  §  171,  we  write  $,  =  alt  /32  =  «2  —  ffi ,  •••  $n  =  crn  —  an_i,...,  then  if 
$n  =  ftn,  we  have  lim  an  =  (Glt  G2,  ...  Gn,  . . .) ;  and  this  number  lirn  an  has  been 

M~OO  n~oo 

shewn  to  be  the  smallest  number  which  is  >  an  for  every  value  of  n.  To  shew 
that  this  number  lim  an  is  of  the  second  class,  we  have,  since  ftn  £  K0,  for  every 

w~x 

value  of  n, 


lim  «„  ^  NO  .  K0  =  KO  ;  and,  since  lim  .  «n  is  not  finite,  it  must  therefore  = 


sequences  [an],  {«'„},  o/  numbers  of  the  second  class,  have  the  same 
limiting  number,  when,  and  only  when,  the  sequences  are  related  to  one  another, 
in  accordance  with  the  definition  o/§  160. 

Let  ft,  7  be  the  two  limiting  numbers,  and  first  assume  that  the  sequences 
are  related  to  one  another.  If  /3  <  7,  then  for  some  value  of  n,  a'n  >  ft, 
a'M+1  >  ft,  ...',  and  hence,  for  some  value  of  n',  we  must  have  an-  >  ft,  «,v-+1  >  ft,  .  .  .  , 
which  is  inconsistent  with  ft  being  the  limit  of  the  sequence  {an}. 

If  we  assume  @  =  <y,  then,  since  an<y,  for  some  fixed  number  r  we  must 
have  a'r  >  atl,  a',.+1  >  «„,  ...;  and  similarly,  since  «'„<  ft,  for  some  fixed  number 
s  we  must  have  as  >«'„,  a,+J  >  a'n,  •••;  hence  the  two  sequences  are  related  to 
one  another. 

If  n  is  a  finite  ordinal  number,  and  a  a  number  of  the  second  class,  then 
n  +  a  =  a,  and  hence  a  —  n  =  a.  If  a  be  a  non-limiting  number,  it  is  however 
convenient  to  denote  the  number  immediately  preceding  a  by  [a  —  1]. 
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For  n  +  (a  =  a>, 


since  n  +  w  =  (el,eti  ...  <?„;/!, /2,  .../«,  ...) 


where  ^i  =  e, ,  #2  =  e2,  ...  gn  =  en,  gn+i  =/i,  #n+2  =/2, 

Further,  n  +  a.  =  ??  +  <u  +  (a  —  &>)  =  &>  +  (a  -  &>)  =  a. 

If  ft  is  a  finite  number,  then  no>  =  to.  This  is  seen  by  taking  an  aggregate 
of  the  type  w,  and  replacing  each  element  by  n  new  elements ;  then  it  is  clear 
that  the  new  aggregate  is  also  of  type  co. 

It  can  easily  be  proved  that  (a  +  n)  co  =  ctco,  where  a  is  of  the  second  class, 
and  n  of  the  first  class. 

173.  If  a.  is  any  number  of  the  second  class,  then  those  numbers  of  the  first 
and  second  classes  which  are  less  than  a  form  a  normally  ordered  aggregate 
of  type  a,  when  they  are  arranged  in  order  as  defined  above. 

If  M  is  an  aggregate  such  that  M=a,  and  if  a'  is  an  ordinal  number  <  a, 
then  there  is  a  segment  M' ,  of  M,  such  that  M'  =  a' ;  and,  conversely,  every 
segment  of  M  determines  a  number  of  the  first,  or  the  second,  class  which  is  <  a. 
For,  since  M  =  K0,  any  segment  M'  must  have  either  a  finite  cardinal  number, 
or  else  must  have  N0  for  its  cardinal  number.  If  el  is  the  lowest  element  of  M, 
a  segment  M'  is  determined  by  an  element  e  >  el ;  and  every  element  e,  of  M, 
determines  a  segment  M'.  If  e',  e"  are  two  elements  of  M,  both  >  elt  and  M', 
M"  the  segments  of  M  determined  by  these  elements,  and  a,  a"  their  order- 
types,  then  if  e  <  e",  it  follows,  by  §  167,  that  M'  <  M";  and  hence  a  <  a". 

If  then  M  =  (e1 ,  M'),  and  to  the  element  e,  of  M',  we  make  the  element  a, 
of  {«'},  correspond,  the  two  aggregates  M'  and  {«'}  are  placed  in  the  relation 
of  similarity.  It  has  thus  been  shewn  that  {a'|  =  M' ';  now  M  =  a  —  1  =  a, 
hence  {a'}  =  a. 

Since  a  =  K0.  we  have  (oc'}  =  K0;  and  therefore  the  following  theorem  is 
established : 

The  aggregate  [a'}  of  all  those  numbers  a.',  of  the  first  and  second  classes, 
which  are  ordinally  smaller  than  a  number  a,  of  the  second  class,  has  the 
cardinal  number  N0. 

174.  Every  number  a,  of  the  second  class,  is  either  (1)  such  that  it  is  obtained 
from  a  number  of  the  same  class  immediately  preceding  it,  by  the  addition  of 
unity,  or  else,  (2)  such  that  there  exists  a  sequence  {«„},  of  numbers  of  the  first 
or  second  class,  having  a  for  its  limit. 

Let  a  =  M;  then,  if  M  has  a  highest  element  e,  M  =  (M',  e),  where  M'  is 
the  segment  of  M  determined  by  e ;  thus  M=  M'  +  1 ,  or  a  =  [a  -  1]  +  1. 
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If  M  has  no  highest  element,  then  the  aggregate  jo'},  of  all  numbers  <  a, 
which  is  similar  to  M,  has  no  greatest  number ;  and  this,  aggregate  {«'},  being 
of  cardinal  number  N0,  can  be  re-arranged  as  an  aggregate  {o'n}  of  type  <o.  In 
this  aggregate  {o'n|,  some  of  the  numbers  a/,  or/,  ...  will  in  general  be  less 
than  «/,  but  others  must  be  greater  than  a/;  for  a,'  cannot  be  greater  than 
all  the  other  numbers  of  the  aggregate,  there  being  in  jo'}  no  greatest  number. 
Let  o'pj  be  that  number  of  {o'n},  with  the  smallest  index,  such  that  a'Pt  >  a/; 
similarly  let  a'Pt  be  that  number,  with  the  smallest  index,  such  that  a'Pa  >  a'p2, 
and  so  on.  We  have  now  an  infinite  sequence 

«/>  a'j>2>  «Vs>  ••• 

of  numbers,  such  that  they  are  in  ascending  order,  and  such  that  their  indices 
are  also  in  ascending  order.  Since  n  £  pn,  we  have  an  =  <*'Pn ',  hence,  for  every 
number  a'  which  is  less  than  a,  there  exists  a  number  a'Pn  which  is  >  a. 
Since  o  is  the  number  which  follows  next  after  all  the  numbers  a,  it  is  also 
the  number  which  follows  next  after  all  the  numbers  «/,  aV2,  a'p3,  ...,  which 
we  may  write  as  a,,  a2>  <*3>  •••  «n>  •••;  thus  a  =  lim  an. 

M~-ac 

It  has  thus  been  shewn  that  there  are  two  kinds  of  numbers  of  the  second 
class,  (1)  those  which  have  an  immediate  predecessor  in  the  aggregate  of  all 
such  numbers  arranged  in  ascending  order,  and  (2)  those  which  have  no  such 
immediate  predecessor,  and  are  called  limiting  numbers. 

A  number  of  the  first  kind  is  obtained  by  means  of  the  first  principle  of 
generation,  (see  §  65),  from  the  immediately  preceding  number. 

A  number  of  the  second  kind  is  obtained  by  the  second  principle  of 
generation,  as  the  number  a  which  next  follows  all  the  numbers  on,  of  some 
sequence  [on}  of  numbers  of  the  second  class. 

THE  CARDINAL  NUMBER  OF  THE  SECOND  CLASS  OF  ORDINALS. 

175.  The  totality  of  the  numbers  of  the  second  class,  arranged  in  ascending 
order,  forms  a  normally  ordered  aggregate. 

If  Aa  denotes  the  ordered  aggregate  of  all  those  numbers  of  the  second 
class  which  are  less  than  the  given  number  «,  then  Aa  is  normally  ordered, 
and  of  type  a  —  w.  For  the  aggregate  {a'}  of  numbers  of  the  first  and  second 
classes,  which  consists  of  {»]  and  Aa,  has  been  shewn,  in  §  173,  to  be  normally 
ordered,  and  thus 

Ja'}  =  ({n],  A.), 

hence  [a'}  =  \n]  +  Aa,  or  Aa  =  a  —  w. 

Let  M  denote  any  part  of  the  aggregate  {a},  of  all  the  numbers  of  the  second 
class,  such  that,  in  [a],  there  are  numbers  which  are  greater  than  all  the  numbers 
in  M;  and  let  o0  be  one  such  number :  then  M  is  a  part  of  Aao+l,  which  is  such 
that  all  the  numbers  of  M  are  less  than  at  least  one  number  o0,  of  Aa  +l.  Since 
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Aao+l  is  normally  ordered,  there  must  be  a  number  a,  of  Aao+1,  being  itself 
consequently  a  number  of  jo},  which  is  the  next  greater  number  than  all  the 
numbers  of  M.  Thus,  since  {«}  has  a  lowest  number  CD,  the  conditions  are  satisfied 
that  (a}  is  a  normally  ordered  aggregate. 

It  follows,  by  applying  the  results  of  §  165,  that  : 

Every  part  of  the  aggregate  {«},  of  all  numbers  of  the  second  class,  has  a 
least  number. 

Every  such  part,  in  order,  is  normally  ordered. 

It  will  now  be  shewn  that  the  aggregate  [a],  of  all  the  numbers  of  the  second 
class,  has  a  cardinal  number  greater  than  N0. 

Consider  the  two  aggregates  {«}  and  {n},  where  n  denotes  the  finite  integers. 
The  aggregate  fo|  has  a  part,  viz.  that  consisting  of  w  +  1,  w  +  2,  ...  w  +  n,  ...  , 
that  is  equivalent  to  {n}:  It  has  been  shewn  in  §  151  that  every  part  of  {n}  is 
either  finite,  or  that  it  is  equivalent  to  {n}  itself.  In  order  to  shew  that  the 
cardinal  number  of  {a}  is  greater  than  K0,  it  is  therefore  only  necessary  to  shew 
that  it  is  not  equal  to  N0. 

If  {aj  =  N0)  the  numbers  of  [a]  could  be  arranged  in  the  form 

7i>  7a.   •••  7n,   •••> 

of  type  CD,  in  which  of  course  the  order  would  not  be  that  of  generation.  Starting 
from  7!  ,  let  yPi  be  the  7,  with  the  smallest  index,  which  is  such  that  yp2  >  ^  ; 
then  let  yPa  be  that  7,  with  the  smallest  index,  such  that  yPi  >  <yPt;  and  so  on. 
We  obtain  in  this  manner  a  sequence 

7i>  7i*  >  7ft  »  ••• 

in  ascending  order,  the  indices  1,  p2,  p3,  ...  being  also  in  ascending  order.  In 
accordance  with  §  171,  there  must  be  a  definite  number  8  of  the  second  class, 
namely  8  =  lim  <ypn,  such  that  S  >  yPn,  for  every  pn,  and  consequently  such  that 

tt~x 

S  is  greater  than  every  jn  ;  but  this  is  impossible,  since  {yn}  contains  every 
number  of  the  second  class;  hence  {a}  cannot  equal  X0,  and  is  therefore  >  N0. 

Every  part  of  the  aggregate  {a},  of  all  numbers  of  the  second  class,  has  either 
the  cardinal  number  of  {a},  or  else  the  cardinal  number  K0,  unless  it  is  a  finite 
part. 

Every  such  part,  when  the  elements  of  it  are  in  order  of  generation,  being 
part  of  the  normally  ordered  aggregate  {a.},  is  either  similar  to  [a],  or  else  to 
some  segment  Aao,  of  {a};  hence  the  cardinal  number  is  either  that  of  {a},  or  is 


Aa  =  a0  —  «D,  and  this  last  is  either  N0,  or  is  finite. 

The  cardinal  number  of  [a]  is  the  cardinal  number  next  greater  than  K0. 

If  there  exist  a  cardinal  number  less  than  -or},  and  greater  than  N0,  it 
must  be  the  cardinal  number  of  some  part  of  {a}  :  but  it  has  been  shewn  that 
every  such  part  of  {otj  has  either  the  cardinal  number  of  (a},  or  is  N0,  or  is  finite. 

The  cardinal  number  of  {a],  or  of  Z  {K0},  is  denoted  by  X,. 
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THE  GENERAL  THEORY  OF  ALEPH-NUMBER8. 

176.  It  has  now  been  shewn  that  the  ordinal  numbers  of  the  second  class, 
in  their  order  of  generation,  form  a  normally  ordered  aggregate,  of  which  the 
cardinal  number  is  N1(  the  next  cardinal  number  to  N0.  The  ordinal  type  of 
the  normally  ordered  aggregate  {a}  of  all  numbers  of  the  second  class,  is  a 
number  H,  which  is  the  smallest  number  of  the  third-class.  In  analogy  with 
the  definition  of  the  second  class,  and  in  accordance  with  what  Cantor  has 
denominated  the  principle  of  limitation  (Hemmungsprinzip),  the  third  class  is 
taken  to  include  all  the  ordinal  types  of  normally  ordered  aggregates,  of  which 
the  cardinal  number  is  Kl(  and  this  class  is  consequently  denoted  by  Z(^). 
The  number  H,  which  is  the  order-type  of  all  the  numbers  of  the  first  and 
second  classes,  in  the  order  of  generation,  and  which  comes  after  all  those 
numbers,  is  not  the  limiting  element  of  any  sequence  a^a^,  ...an, ...  of  numbers 
of  the  second  class ;  for,  as  we  have  seen,  every  such  sequence  has  a  limiting 
number  within  the  second  class.  From  the  point  of  view  adopted  by  Cantor 
in  his  earlier  writings,  and  explained  in  §  65,  in  which  the  successive  ordinal 
numbers  are  regarded  as  successively  generated,  in  accordance  with  postulated 
principles  of  generation,  the  number  fl  must  be  regarded  as  generated  by  a 
third  principle  of  generation,  different  from  the  two  principles  of  generation 
employed  in  the  case  of  the  numbers  of  the  first  and  second  classes.  This  third 
principle  of  generation  affirms  that  every  set  of  ordinal  numbers  similar  to  the 
aggregate  of  all  the  numbers  of  the  first  and  second  classes,  in  their  order  of 
generation,  is  immediately  succeeded  by  a  new  number,  ordinally  greater  than 
all  the  numbers  of  the  set,  so  that  every  number  which  is  less  than  this  new 
number  is  also  less  than  some  of  the  numbers  of  the  set.  When,  proceeding 
from  fl,  the  numbers  fi  +  1 ,  fl  +  2,  ...ft-ffl,  ...  are  formed,  all  three  principles 
of  generation  will  be  required,  in  forming  the  numbers  of  the  third  class. 

From  the  point  of  view  adopted  later  by  Cantor,  and  explained  in  the 
present  chapter,  H  is  simply  defined  to  be  the  order-type  of  the  totality  of  the 
numbers  of  the  first  and  second  classes,  in  their  normal  order.  The  numbers 
higher  than  H  are  then  defined  in  the  same  manner,  each  one  as  the  order-type 
of  the  totality  of  the  preceding  numbers  in  normal  order. 

The  existence  of  a  whole  series  of  classes  of  order- types  of  normally  ordered 
aggregates,  i.e.  of  ordinal  numbers,  has  been  speculatively  asserted  by  Cantor*, 
who  has  however,  in  his  published  works,  confined  his  detailed  investigations 
to  numbers  of  the  first  and  second  classes. 

To  each  of  the  successive  classes  of  numbers,  there  corresponds  a  single 
cardinal  number,  that  of  the  totality  of  the  ordinal  numbers  up  to,  and  including 
all  the  ordinal  numbers  of  that  class.  The  first  ordinal  number  of  each  class 
is  the  order-type  of  all  the  numbers  of  the  preceding  classes,  in  their  order  of 

*  See  Math.  Annalen,  vol.  xxi,  pp.  587,  588,  also  vol.  xi.vi,  p.  4^5. 
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generation.  A  new  principle  of  generation  is  required  for  the  first  number 'of 
each  new  class,  since  that  number  cannot  be  regarded  as  the  limiting  number 
of  any  sequence  of  which  the  ordinal  number  is  less  than  that  of  the  number 
in  question.  All  the  successive  principles  of  generation  are  however  included 
in  the  one  principle,  that  an  aggregate  of  normally  ordered  ordinal  numbers 
has  itself  an  order- type  which  is  a  new  number ;  and  thus,  from  this  point  of 
view,  all  the  principles  of  generation,  from  the  second,  onwards,  are  replaced 
by  this  one  principle. 

177.    In  accordance  with  this  theory,  there  exists  an  ordered  aggregate 

1,  2,  3,  ...  11,  ...  <w,  (o  +  1,  ...  ft>2,  ...  to'0,  ...  O,  H  +  l,  ...  7,  ... 
which  contains  every  ordinal  number  of  every  class ;  and  there  also  exists  a 
similar  aggregate 

1,  2,  3,  ...  n,  ...  No,  K,,  N2,  ...  **„,  ...  «„,  ...  Nn,  ...  Ny,  ... 
of  cardinal  numbers,  each  element  of  which  is  the  cardinal  number  of  a  single 
class  of  numbers  of  the  first  aggregate. 

That  the  first  of  these  aggregates  is  normally  ordered  may  be  seen  by 
remarking  that,  if  it  contained  any  part,  of  the  type  *&>,  then  such  part  would 
also  be  part  of  the  normally  ordered  aggregate  formed  by  the  numbers 
1,  2,  3,  ...  CD,  ...  a;  where  a  is  the  highest  number  in  the  hypothetical  part,  of 
type  *&).  This  is  impossible,  and  hence  the  first  aggregate  is  normally  ordered. 

Cantor  has  given  a  proof  that  tf0  is  less  than,  or  equal  to,  the  cardinal 
number  of  any  transfinite  aggregate,  and  that  tfj  is  the  cardinal  number  next 
greater  than  N0.  In  his  proof  the  possibility  of  two  cardinal  numbers  being 
incomparable  was  disregarded.  Accordingly,  the  first  theorem  in  §  151  has 
been  made  to  precede  Cantor's  theorem,  which  has  however  also  been  given. 
A  proof  has  been  given  by  Jourdainf,  that  N2  is  the  next  greater  cardinal 
number  than  Xj ,  who  has  also  considered,  in  some  detail,  the  ordinal  numbers  of 
the  third  class,  and  has  given  indications  of  extension  to  the  higher  classes. 

The  question  whether  every  transfinite  cardinal  number  is  necessarily  an 
Aleph-number,  which  is  equivalent  to  the  question  whether  every  aggregate  is 
capable  of  being  normally  ordered,  has  engaged  a  considerable  amount  of 
attention.  That  the  answer  should  be  an  affirmative  one,  was  regarded 
by  Cantor  as  probable.  Some  discussion  of  attempts  which  have  been  made 
to  settle  this  matter,  will  be  considered  in  §  203.  A  case  of  great  importance 
is  that  of  the  continuum,  which  is  defined  as  a  simply  ordered,  but  not  as  a 
normally  ordered,  aggregate.  No  proof  has  yet  been  discovered  of  the  cor- 
rectness of  Cantor's  view  that  c  =  tfj.  In  case  c  occurs  at  all  in  the  aggregate 
of  Aleph-n umbers,  the  continuum  is  capable  of  being  normally  ordered.  The 
possibility  has  also  been  contemplated  that  c  may  be  greater  than  all  the 
Aleph-numbers. 

|  See  Phil.  Mag.,  (6),  vol.  vn,  p.  294,  "On  the  transfinite  cardinal  numbers  of  number-classes  in 
general." 
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THE  ARITHMETIC  OF  ORDINAL  NUMBERS  OF  THE  SECOND  CLASS. 

178.  The  ordinal  numbers  of  the  second  class  have  been  defined  as  the 
order-types  of  normally  ordered,  enumerably  infinite,  aggregates;  and  the 
operations  of  addition  and  multiplication  have  been  defined  for  these  numbers, 
in  §§169  and  170.  It  now  remains  for  us  to  define  exponentials,  for  numbers 
of  this  class;  and  the  definition  is  founded  upon  the  following  theorem: 

If  £  is  a  variable  of  which  the  domain  consists  of  the  numbers  of  the  first 
and  second  classes,  including  zero,  and  if  7,  8  denote  two  constants  belonging 
to  the  same  domain,  such  that  B  >  0,  7  >  1,  then  there  exists  a  single-  valued 
determinate  function  /(£)>  which  satisfies  the  conditions 

(i)  /(o)=a. 

(2)  If  f  ,  f  '  are  any  two  values  of  £  such  that  f  <  f  ,  then/(f  )  <  /(£"). 

(3)  For  every  value  of  £,/(£+  1)  =  /(£)  .  7. 

(4)  If  {£„}  is  a  sequence,  of  which  £is  the  limiting  number,  then 

is  a  sequence,  of  which  /(£)  is  the  limiting  number. 

In  the  case   8  =  1,  the  function  /(£)  is  denoted  by  7*;  and  then 
satisfying  the  above  conditions,  defines  the  exponential  function  7*,  for  all 
numbers  7,  £  of  the  first  and  second  classes. 

To  prove  the  theorem,  in  the  first  place  we  have 


thus  /(I)  </(2)  </(3),  ...  ;  and  the  function  is  determined  for  every  £  <  &>. 
Next  assume  that  the  function  is  determined  for  every  £<  a,  a  number  of  the 
second  class.  If  a  is  not  a  limiting  number,  /(«)=/[«  —  1]  7  >/[a  —  1]; 
and  thus  /(a)  is  determined.  If  a  is  a  limiting  number,  and  is  preceded  by 
the  sequence  {«„},  then  {/(£n)}  is  a  sequence,  and/(a)=  lim/(an).  If  {«„'}  is 

n~<» 

another  sequence  such  that  a  =  lim  a,/,  then  the  two  sequences  {/(an){>  {/(«?/)! 

M~  » 

are  related  to  one  another,  and  therefore  have  the  same  limit;  and  thus  /(a) 
is  uniquely  determined.  /(£)  is  now  determined  for  every  £:  for  if  there  were 
values  of  a.  for  which  it  were  not  determined,  there  must  be  a  smallest  of 
such  values;  the  theorem  would  then  hold  for  £<  a,  but  not  for  £  =  a;  which 
is  contrary  to  what  has  been  proved  above. 

179.     If  a,  /3  are  numbers  of  the  first  or  second  class,  7a+^  =  7"  .  7^. 
The  function  </>  (£),  =  7°+^,  satisfies  the  conditions 
(1) 


(3)  *(£+!)  =  $(£).  7. 

(4)  If  {£n}  is  a  sequence  such  that  lim  i;n  =  g,  then  <f>  (£)  =  lim  0  (£„). 

n  ~  x  n  ~  *. 

H.  15 
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It  follows  by  §  178  that,  if  we  take  8  =  7",  then  <£(£)  =  7*7*;  hence  if 
£  =  /3,  we  have 


Again,  if  a,  /3  are  two  numbers  of  the  first  or  second  class, 


If  we  put  /(£)  =  7af,  we  find  by  applying  the  theorem  of  the  last  section, 
that/(f)  =  (7B)f,  where  7*  replaces  7. 

7  being  >  2,  it  can  be  proved  that,  for  every  |,  7^  ^  £.  The  theorem  holds 
for  f  =•  0,  £  =  1 ;  and  if  it  be  assumed  to  hold  for  all  values  of  If  which  are 
less  than  a  given  number  a,  then  it  holds  also  for  £  =  a. 

For,  first  let  a  be  not  a  limiting  number:  then,  if  [a  —  1]  ^  7[a~1],  we  have 

hence  7"  ~  [a  -  1]  +  [a  —  1]  [7  —  1]:  therefore  since  [a  —  1]  and  [7  —  1]  are  >  1, 
and  [a  —  1]  +  1  =  a,  we  have  7"  =  a.  If  a  is  a  limiting  number  =  lim  «„,  then 

since  an  £  yan,  we  have  lim  an  ^  lim  7a»,  or  a  ^  7*.     If  there  were  values  of  £ 

n  ~  ao  n  ~  oo 

such  that  ^>7f,  there  would  be  one  of  such  values  which  is  the  least  of  all; 
and  if  this  were  a,  then  £  ^  7^,  if  £<  a,  but  a  >  7°;  which  is  contrary  to  what 
has  been  proved  above. 

180.  Of  all  the  numbers  of  the  second  class,  the  smallest  ones  are  those 
which  are  algebraical  functions  of  &>,  of  the  form 

wn  .  pn  +  c0H~l .  pn_!  4-  . . .  +  <u  .  PI  +  Po, 
where  p0,  p},  ...  pn  are  finite  numbers.    If  we  write 

then  we  obtain  the  number  e0  =  lim  &>n.     This  number  e0  is  the  smallest  of  a 

W-oo 

species  of  numbers  of  the  second  class  which  are  characterised  by  the  property 
e  =  &)e,  and  which  Cantor  has  designated  e-numbers.  Cantor  has  shewn  that 
the  e-numbers  form  a  normally  ordered  aggregate,  of  type  fl,  and  therefore 
similar  to  the  whole  second  class  of  numbers.  He  has  further  shewn  that 
every  number  a,  of  the  second  class,  is  uniquely  re  presentable  in  the  form 

«  =  0)a»/C0  +  O)0'*!  +  ...   +  0)arKr, 

where  or0,  otj,  cr2, ...  a,,  are  numbers  of  the  first  or  second  class  which  satisfy  the 
conditions  «„  >  o^  >  «2  ...  >  ar  ^  0,  and  KO,  KI}  K»,  ...  K,.,  r  +  1,  are  numbers  of 
the  first  class  which  are  different  from  zero.  For  the  detailed  investigation 
of  the  normal  form,  and  for  that  of  the  special  class  of  e-numbers,  we  must 
refer  to  Cantor's  original  discussion*. 

*  Math.  Annalen,  vol.  XLIX,  pp.  235 — 246. 
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THE  THEORY  OF  OHDER-FUNCTIONS. 

181.  A  method  of  representation  of  any  mode  of  ordering  a  given 
aggregate  M  has  been  given  by  Bernstein*.  When  the  elements  of  the 
aggregate  are  numbers,  this  method  lends  itself  to  a  diagrammatic  repre- 
sentation of  the  aggregate,  as  ordered  in  any  particular  order-type. 

An  aggregate  M  is  ordered,  in  the  most  general  sense  of  the  term,  when 
it  is  known  as  regards  every  pair  of  elements  a,  6,  whether  a  $  6;  but  a 
particular  mode  of  ordering  the  aggregate  can  be  represented  by  means  of 
a  function  /(a,  6)  of  the  pairs  of  elements,  which  is  defined  by 

/(a,  6)  =  1,  if  a  <  6;  /(a,  b)  =  -  1,  if  a  >  b  ;  and  /(a,  a)  =  0. 
This  function /(a,  b)  may  be  denominated  an  order-function  of  the  aggregate 
M ;  and  there  is  one  order-function  for  each  possible  mode,  of  ordering  the 
aggregate.     The  function  must  satisfy  the  condition  that,  if  /(a,  b)  =f(b,  c), 
then  each  is  equal  to  f(a,  c). 

Two  order-functions  /i  (a,  b),  /2  (a,  b),  of  a  given  aggregate  M,  represent 
two  methods  of  ordering  the  aggregate  in  one  and  the  same  order-type, 
provided  there  exist  a  reversible  transformation  <£,  of  the  aggregate  M  into 
itself,  such  that/j  {<f>(a),  <j>(b)}  =f2(a,  b). 

All  those  order- functions  of  a  given  aggregate  M,  which  correspond  to  an 
arrangement  of  M  in  one  and  the  same  order-type,  constitute  a  family  of 
order- functions;  and  there  is  one  such  family  of  order-functions  corresponding 
to  each  order-type  in  which  the  given  aggregate  M  can  be  arranged. 

It  is  clear  that  the  order-functions  of  a  family  corresponding  to  M  form 
an  aggregate  with  the  same  cardinal  number  as  the  group  of  transformations 
of  M  into  itself. 

If,  in  particular,  the  aggregate  M  is  that  of  the  positive  integers,  then  a 
pair  of  elements  (a,i)  is  represented  by  a  cross-point  of  the  rectangular  trellis 
formed  in  the  positive  quadrant  by  drawing  all  the  straight  lines,  x  =  i,  y  =  i, 
for  positive  integral  values  of  i,  referred  to  rectangular  Cartesian  coordinates 
x,y. 

The  natural  order  of  the  numbers  1, 2,  3, ...  will  be  represented  by  f(x,  y}, 
defined  for  all  the  cross-points,  so  that/(#,  y)  =  l,  when  x  <  y,  and/(#,  y)  =  —  l, 
when  x  >  y,  and  such  that  f(x,  x)  =  0. 

Any  particular  modef  of  ordering  the  numbers  1,  2,  3, ...  will  be  represented 
by  marking  one  set  of  cross-points  -I- 1,  and  another  set  —  1,  those  on  the 
diagonal  x  =  y  being  marked  zero. 

*  See  his  Dissertation,  also  W.  H.  Youug,  on  "  Closed  sets  of  points  and  Cantor's  numbers," 
Pi-oc.  Lond.  Math.  Soc.  (2),  vol.  i. 

t  Some  examples  of  order-types  represented  in  this  manner  are  given  by  W.  H.  Young.  Proc. 
Lond.  Math.  Soc.  (2),  vol.  i,  p.  244. 

15—2 
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It  is,  however,  not  every  mode  of  so  marking  the  cross-points  that  repre- 
sents a  possible  ordering  of  the  aggregate.  That  a  mode  of  marking  may 
represent  a  possible  order,  two  conditions  must  be  satisfied.  First,  we  must 
have/(#,  y}  =  —f(y,  #);  and  thus  points  which  are  optical  images,  relatively  to 
the  diagonal  y  =  x,  must  be  marked  with  unities  of  opposite  sign.  Secondly, 
the  condition  that,  if  /(a,  6,)=/(6j,c),  then  each  =/(a,c),  must  be  satisfied. 
This  condition  may  be  expressed  as  follows: — Join  every  cross-point  which  is 
marked  + 1,  or  0,  with  every  other  such  cross-point,  then  the  resulting  figure 
may  be  called  the  positive  frame-work;  join  similarly  all  the  pairs  of  points 
marked  —  1,  0;  in  this  way  we  obtain  the  negative  frame-work.  Let  lines 
joining  the  two  pairs  of  points  (ac1}  y^,  (x2,  y2)  and  (scl}  y2),  (%z,  y\}  be  called 
conjugate  lines.  The  condition  which  must  be  satisfied  is  that  no  side  of  the 
positive  frame-work  can  be  conjugate  to  a  side  of  the  negative  frame-work. 
It  can  easily  be  seen  that  the  condition  so  stated  is  necessary  and  sufficient. 

THE  CARDINAL  NUMBER  OF  THE  CONTINUUM. 

182.  The  arithmetic  continuum  has  been  defined  as  an  aggregate  of  the 
order-type  6  (see  §  163),  and  it  is  thus  not  normally  ordered.  It  has  been  held 
by  Cantor*  that  this  aggregate,  and  perhaps  every  aggregate,  is  capable  of 
being  arranged  as  a  normally  ordered  aggregate;  but  no  universally  accepted 
proof  of  the  correctness  of  this  view  has  been  obtained.  If  the  continuum  be 
capable  of  arrangement  as  a  normally  ordered  aggregate,  its  cardinal  number 
c  must  be  identical  with  one  of  the  Aleph -numbers;  and  in  fact  Cantor  be- 
lieved that  c  =  {"?i,  the  cardinal  number  of  the  aggregate  of  all  the  order- types 
of  normally  ordered  enumerable  aggregates.  As  evidence  of  the  probable  truth 
of  this  view,  the  facts  may  be  cited  that  all  the  sets  of  points  which  have 
actually  been  defined  in  connection  with  the  theory  of  sets  of  points  have  one 
or  other  of  the  two  cardinal  numbers  N0  and  c,  and  that  no  such  set  of  points 
has  been  defined  of  which  it  is  known  that  the  cardinal  number  is  >  K0  and 
<  c.  This  negative  evidence  is  however  clearly  insufficient  to  settle  the  question 
whether  every  part  of  the  continuum  has  one  of  the  powers  N0  or  c,  a  question 
which  has  hitherto  defied  all  attempts  to  obtain  a  conclusive  answer.  It  has 
been  shewn  by  Konig  that  the  cardinal  number  of  the  continuum  cannot  be  an 
Aleph-number  of  which  the  index  is  a  limiting  number.  As  has  already  been 
pointed  out,  it  cannot  be  assumed  that  every  two  cardinal  numbers  are  such 
as  to  be  comparable  with  one  another;  but  a  proof  has  been  propounded  by 
G.  H.  Hardy f  that  c,  and  presumably  any  cardinal  number  whatever,  must 
either  be  an  Aleph-number,  or  else  be  greater  than  all  the  Aleph-numbers. 
The  mode  of  reasoning  is  a  generalization  of  that  employed  by  Cantor  in  his 
proof  (see  §  151)  that  N0  is  less  than  any  other  transfinite  cardinal  number. 

*  Math.  Annalen,  vol.  xxi,  p.  550. 
t  Quarterly  J.  of  Math.  1903,  p.  87. 
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Because  c  —  N1(  it  is  possible  to  take  elements  from  the  number-continuum 
corresponding  to  all  the  numbers  of  the  first  and  second  classes  of  ordinals. 
For,  if  this  process  came  to  an  end,  we  should  have  c  =  N0,  which  has  been 
proved  by  Cantor  not  to  be  the  case.  It  follows  that  a  set  can  be  selected 
from  the  continuum  equivalent  to  the  aggregate  of  ordinal  numbers  of  the 
first  and  second  classes.  Now  if  a  set  could  be  selected  from  this  aggregate 
equivalent  to  the  continuum,  it  would  follow  from  the  equivalence  theorem, 
proved  in  §  153,  that  0  =  ^;  and  if  no  such  set  could  be  selected  it  would 
follow  from  the  definition  of  inequality  in  §  146,  that  c  >^;  thus  it  follows 
that  c  =  Kj.  If  now  c  >  N1(  a  similar  proof  would  shew  that  c  =  K2,  and  so  on. 
If  c>Nn  for  every  finite  n,  then  c  =  Kw,  and  this  process  may  be  continued 
indefinitely  through  the  Aleph  series.  The  validity  of  this  proof  depends 
upon  considerations  which  will  be  discussed  in  §  198. 

It  is  known  that  every  infinite  closed  linear  set  of  points  has  one  or  other 
of  the  two  cardinal  numbers  N0,  c;  and  if  a  set  of  points  can  exist  of  which 
the  cardinal  number  has  neither  of  these  two  values,  it  must  be  unclosed,  and 
may  without  loss  of  generality  be  taken  as  dense  in  itself.  The  difficulties  of 
dealing  with  unclosed  sets,  dense  in  themselves,  are  so  great  that  attempts  to 
find  a  contradiction  involved  in  the  assumption  of  the  existence  of  such  a  set, 
possessing  a  cardinal  number  different  from  both  K0  and  c,  have  hitherto  been 
a  complete  failure. 

183.  A  very  remarkable  relation  has  been  given  by  Cantor  between  the 
cardinal  number  of  the  continuum  and  that  of  the  integral  numbers.  This 
relation  is  expressed  by  c  =  2^o,  or  more  generally  c  =  n^v,  where  n  is  a  finite 
integer. 

This  theorem  was  applied  by  its  discoverer  to  obtain  a  simple  arithmetical 
proof  that  the  K0-dimensional  continuum  has  the  same  power  as  the  one- 
dimensional  continuum. 

In  accordance  with  the  definition  of  an  exponential,  given  in  §  150,  2^u  is 
the  cardinal  number  of  the  numbers  in  the  dyad  scale, 

&i      ^      k 
2      22     2s 

where  every  b  is  either  0  or  1.     In  this  aggregate,  each  number  of  the  form 
~^  —  <  1,  where  p  and  q  are  integers,  occurs  twice;  hence 


2«u  =  (8,  X), 

where  X  is  the  aggregate  of  real  numbers  between  0  and  1,  and  s  is  an 
enumerable  aggregate. 

It  follows  from  the  above,  that  '2^>  =  c  +  N0.     Now  c  +  N0  =  c  4-  -Nu.  since 
K0  =  2N0;  therefore  c  =  c  +  Ny;  whence  we-  have  2xo  =  c. 
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From  this  theorem,  we  deduce  c .  c  =  2xo .  2xo  =  22Ro  =  2xo  =  c;  and  hence 
by  repeated  multiplication  by  c,  we  find  cn  =  c,  where  n  is  any  finite  integer. 

Again  c^o  =  (2^o)^o  =  2No-^o  =  2xo  =  c, 

and  therefore    the   continuum   of  finite,  or   of  enumerable,   dimensions  is 
equivalent  to  the  one-dimensional  continuum. 

The  aggregate  defined  by  all  possible  modes  of  distributing  the  numbers 
of  the  continuum  upon  themselves,  has  the  power  c°=f',  and  this  number  f  is 
greater  than  c.  This  has  been  proved  in  §  154  ;  for  a  part  of  the  new  aggregate 
is  equivalent  to  that  obtained  by  replacing  the  numbers  of  the  continuum 
either  by  A,  or  by  B,  and  taking  all  possible  aggregates  which  arise  in  this 
way.  Since  this  part  has  been  shewn  to  have  a  cardinal  number  greater  than 
that  of  the  original  aggregate,  it  follows  then  that/>  c. 

More  generally,  if  a  is  any  cardinal  number,  we  have  aa  >  a. 

If  the  continuum  be  divided  into  any  finite  number  n,  of  parts,  such  that 
all  the  parts  have  the  same  cardinal  number,  then  that  cardinal  number  is  the 
same  as  that  of  the  continuum*. 

The  parts  may  consist  of  sets  of  points  of  any  kind. 

The  theorem  may  also  be  stated  thus: — if  not  =  c,  then  a.  —  c. 

To  prove  it,  we  have  wo  =  c  =  nc;  and  therefore,  by  applying  the  theorem 
of  §  156,  it  follows  that  a  =  c. 

184.  The  continuum  is  equivalent  to  the  aggregate  of  all  possible  order- 
types  of  simply  ordered  aggregates  of  cardinal  number  X0. 

This  theorem  points  the  contrast  between  the  aggregate  of  all  order-types 
of  simply  ordered  enumerable  aggregates,  which  is  of  power  2^o,  and  the 
aggregate  of  all  the  order-types  of  normally  ordered  enumerable  aggregates, 
which  is  tfj.  The  latter  aggregate  is,  of  course,  a  part  of  the  former  one,  and 
thus  the  theorem  Kj  ^  2xo  can  be  deduced. 

The  theorem  may  be  also  stated  in  the  form:  The  total  number  of  ivays 
of  ordering  the  integral  numbers  1,  2,  3,  ...  is  c. 

If  jji  is  the  order-type  of  an  enumerable  aggregate,  arranged  as  a  simply 
ordered  aggregate,  then  a  part  of  an  aggregate  of  the  type  77,  considered  in 
§  162,  can  always  be  determined,  which  is  of  the  type  p.  To  establish  this,  it 
can  be  shewn  that  an  aggregate  of  type  p  can  always  be  changed  into  one  of 
type  i),  by  insertion  of  new  elements.  If,  between  every  pair  of  elements  m^, 
mz,  of  IJL,  there  are  other  elements,  then  /j,  is  of  one  of  the  types  rj,  1  +77,  77  +  1, 
1  +  ?;  +  1 ;  so  that  p  is  reduced  to  77  by  the  removal  of  the  lowest  and  the  highest 
elements,  when  such  elements  exist.  In  any  such  case,  if  we  add  to  p,  aggre- 

*  Bernstein,  loc.  cit.,  p.  31. 
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gates  of  type  rj,  at  the  beginning  and  at  the  end,  we  obtain  an  aggregate  of 
type  77  +  fi  +  rj,  which  is  of  type  77,  whichever  of  the  types  77,  1+77,  77+1,1+17  +  ! 
may  be  identical  with  /*.  If  pairs  of  elements  exist  in  the  aggregate  of  type  /A, 
such  that  there  are  no  elements  between  them,  an  aggregate  of  type  77  can  be 
inserted  between  every  such  pair,  until  a  new  aggregate  of  one  of  the  types  77, 
1  +77,  77  +  1,  1+77+1,  is  obtained;  then,  as  before,  by  adding  aggregates  of 
type  77,  at  the  beginning  and  end,  we  obtain  an  aggregate  of  type  77.  It  has  thus 
been  shewn  that  any  aggregate  of  type  p,  is  a  part  of  another  aggregate  of  type 
77.  Since  the  rational  numbers  in  their  totality  naturally  exist  in  the  order-type 
77,  it  follows  that  an  aggregate  of  any  type  //,  can  be  made  by  taking  a  part  of 
the  aggregate  of  rational  numbers,  of  type  77.  It  follows  that  the  aggregate 
of  all  types  /*  has  a  cardinal  number  less  than,  or  equal  to,  that  of  the  aggre- 
gate of  all  part-aggregates  of  the  set  of  rational  numbers,  arranged  in  type  77. 

Now  every  aggregate  (r^?*.,,  ...),  of  which  all  the  elements  are  rational 
numbers,  corresponds  to  a  single  point  of  a  continuum  of  an  enumerable  number 
of  dimensions,  of  which  the  coordinates  are  xl  =  rl,  x.2  =  r. ,,....  Hence  the 
cardinal  number  of  the  aggregate  of  all  part- aggregates  of  the  set  of  rational 
numbers  is  less  than,  or  equal  to,  the  cardinal  number  of  the  N0-dirnensional 
continuum,  that  is,  ^  c;  and  therefore  the  cardinal  number  of  the  aggregate 
of  all  types  /j,  is  ^  c. 

It  will  now  be  proved  that  c  ^  the  cardinal  number  of  the  aggregate  of 
all  types  /*.  To  every  real  number  between  0  and  1,  there  corresponds  an 
infinite  sequence  b^bg...,  where  every  b  is  either  0  or  1,  expressing  the  number 
in  the  dyad  scale.  After  each  6,  insert  an  aggregate  of  type  TT,  and  we  then 
have  an  aggregate  ^Trb^b^ ...,  of  type  v  =  bl  +  TT  +  bz  +  TT  +  b3+  TT  +  — 
Here,  some  of  the  6's  may  be  zero,  and  these  may  be  simply  omitted;  thus 
7r  +  0  +  7r  =  7r  +  7r.  Hence,  to  any  real  number  x,  between  0  and  1,  there 
corresponds  the  type 

v  =  6,  +  TT  +  62  +  . . . . 

It  is  now  necessary  to  shew  that  the  two  order-types  v,  v',  which  correspond 
to  two  different  numbers,  x,  x,  are  necessarily  distinct  from  one  another.  If 
v  =  i/(  we  can  write  the  equality  Cl  +  TT  +  £  =  CV  +  TT  +  £V,  where  C\,  CV  are 
each  either  0  or  1;  and  from  this  we  obtain,  by  means  of  the  theorem  of 
§  164,  C'i  =  CY,  and  f,  =  £/.  The  last  equation  can  be  written 

Co  +  TT  +  £,  =  a:  +  TT  +  £/, 

and  from  this  we  conclude  that  C»  =  CV,  £>  =  £>';  and  we  can  proceed  onwards 
in  the  same  manner.  From  6j  =  6/,  b.2=b.2',  ...,  we  conclude  that  ,i-  =  .c'.  It 
has  thus  been  shewn  that  {x}  —  {v};  and  from  this  we  conclude  that  c  ^  the 
cardinal  number  of  all  order-types  p. 

This  part  of  the  theorem  is  due  to  Cantor*,  and  the  first  part  to  Bernstein. 
By  combining  the  two  results,  the  complete  theorem  is  established. 

*  See  Bernstein's  Dissertation,  p.  7. 


232  Transfinite  numbers  and  order-types  [CH.  iv 

This  important  result  may  also  be  expressed  by  saying  that  the  totality 
of  oil  permutations  of  the  sequence  of  positive  integers  has  the  power  of  the 
continuum. 

It  may  also  be  shewn  that  the  totality  of  all  parts  of  the  sequence  1,  2,  3, ... 
has  the  power  of  the  continuum. 

For,  if  we  form  a  sequence,  by  writing  0  for  each  of  the  numbers  1,  2,  3,  ... 
which  does  not  occur  in  a  given  part  of  (1,  2,  3,  ...);  and  1  for  each  number 
which  does  occur  in  the  given  part,  then  the  sequence  of  O's  and  1's,  thus 
obtained,  corresponds  to  a  real  number  expressed  in  the  dyad  scale,  and 
therefore  the  numbers  of  the  continuum  are  put  into  correspondence  with  the 
parts  of  the  sequence  (1,  2,  3,  ...). 

185.  It  can  be  shewn  that  the  aggregate  of  all  sets  of  points  in  the 
n-dimensional  continuum  has  a  cardinal  number  greater  than  c. 

For,  in  the  aggregate  [P]  of  all  points  in  an  n -dimensional  continuum, 
we  can  substitute  0  for  each  point  P  which  does  not  occur  in  a  given  set 
of  points  of  the  continuum,  and  1  for  each  point  P  which  does  occur;  we 
then  obtain  an  aggregate  consisting  of  O's  and  1's :  but  it  is  known  that 
the  totality  of  all  such  aggregates  has  the  power  2C,  which  is  >c. 

On  the  other  hand,  the  totality  of  all  closed  sets  of  points  in  the  n-dimen- 
sional continuum  has  the  same  power  c  as  the  continuum. 

Every  closed  set  is  the  derivative  of  an  enumerable  set  of  points ;  and,  to 
every  enumerable  set  of  points,  there  corresponds  a  single  closed  set. 

It  follows  that  the  cardinal  number  of  the  totality  of  closed  sets  is  ^  the 
cardinal  number  of  the  totality  of  enumerable  sets  of  points  chosen  out  of  the 
continuum.  To  shew  that  the  latter  is  c,  we  observe  that  it  is  ^  the  aggre- 
gate of  all  combinations  of  points  of  the  continuum  in  sets  of  K0  elements, 
that  is,  ^  c**o,  or  £  c. 

.  Again,  every  single  point  of  the  N0-dimensional  continuum  corresponds 
to  a  single  point  of  the  one-dimensional  continuum,  and  this  point  is  an 
enumerable  part  of  the  continuum ;  hence  the  totality  of  enumerable  sets 
of  points  of  the  n  -dimensional  continuum  is  ^  c.  On  combining  this  with 
what  has  been  proved  above,  we  see  that  the  totality  of  all  enumerable  sets 
of  points  in  the  n  -dimensional  continuum  is  c;  hence  the  totality  of  all 
closed  sets  of  points  in  the  n-dimensional  continuum  is  ^  c. 

Again,  the  totality  of  all  closed  sets  of  points  in  the  n-dimensional 
continuum  is  ^  c.  For  one  such  closed  set  can  be  taken  in  each  of  an 
infinity  of  the  domains  xl  =  «,  where  x1  is  one  of  the  n  coordinates  which 
determine  the  position  of  a  point  in  the  n-dimensional  continuum ;  and  the 
aggregate  of  all  possible  values  of  a  has  the  power  c.  We  thus  obtain 
an  aggregate  of  closed  sets  which  has  the  power  c ;  and  it  follows  that  the 
aggregate  of  all  closed  sets  in  the  n-dimensional  continuum  is  ^  c. 
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Since  the  totality  of  all  closed  sets  of  points  in  the  n -dimensional  con- 
tinuum is  =  c,  and  at  the  same  time  is  £  c,  it  must  have  the  cardinal  number  c. 

Since*  every  curve,  or  surface,  in  a  continuum  is  formed  by  a  closed  set  of 
points,  we  see  that  every  possible  curve,  or  surface,  corresponds  uniquely  to 
a  single  definite  real  number. 

186.  A  method  of  constructing  a  set  of  points  of  which  the  cardinal 
number  is  N,,  has  been  given f  by  G.  H.  Hardy. 

If  we  start  from  the  sequence 

1,  2,  3,  4,  5, (1) 

of  integral  numbers,  a  new  sequence 

2,  3,  4,  5, (2) 

is  formed  by  omitting  the  first  term. 
Continuing  this  process,  we  form 

3,  4,  5,  6, (3) 

4,  5,  6,  7 (4) 

5,  6,  7,  8,  (5) 


We  now  form  a  new  sequence 

1,     3,     5,     7,     9,  («) 

by  traversing  the  above  infinite  array  of  sequences  diagonally.   Then  we  form 

3,     5,     7,     9,  11, («  +  !) 

5,     7,     9,  11,  13,  (ft)  +  2) 

7,     9,  11,  13,  15, (&)  +  3) 

9,  11,  13,  15,  17,  (w +  4) 


1,     5,     9,  13,  17,  (w. 2) 

5,     9,  13,  17,  21,  (w. 2  +  1) 

9,  13,  17,  21,  25,  (w.  2 +  3) 


1,     9,  17,  25,  33, (w. 3) 


Thus  sequences  corresponding  to  all  the  numbers  CD  .  /j,  +  v  can  be  formed. 
To  form  the  sequences  corresponding  to  w'2,  we  take  the  array  of  sequences 

1,     3,     5,     7,       9, (w) 

1,     5,     9,  13,     17,  (w. 2) 

1,     9,  17,  25,     33,  (w. 3) 

1,  17,  33,  49,     65,  (to  A) 

1,  33,  65,  97,  129,  (to .  5) 


*  Bernstein,  loc.  cit.,  p.  43. 

f  Quarterly  Journal  of  Matk.,  vol.  xxxv,  1903,  "A  theorem  concerning  the  infinite  cardinal 
numbers."    See  also  Hausdorff,  Leip.  Ber.,  vol.  LIX,  p.  155. 
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and  traverse  it  diagonally ;  we  thus  obtain 

1,     5,  17,  49,  129,  (w2). 

Generally,  if  blt  b2,  b3,  b4,  ...  is  the  sequence  corresponding  to  ft,  the 
sequence  b2,  b3,  64,  bs,  ...  corresponds  to  ft  +  l.  To  obtain  a  sequence  cor- 
responding to  a  number  7  which  is  a  limiting  number  of  the  second  class, 
we  take  the  array  of  sequences  corresponding  to  any  ascending  set  of 
numbers  ftly  /32,  ...  of  which  the  limit  is  7,  and  traverse  it  diagonally.  It  is 
clear  that,  in  this  manner,  a  sequence  can  be  found  for  any  given  number  of 
the  second  class ;  but  that  the  set  of  sequences  so  obtained  is  not  unique.  For 
example,  w  might  have  been  taken  as  the  limit  of  1,  3,  5,  7,  ...  ,  or  w2  might 
have  been  taken  as  the  limit  of  a>  +  1,  w  .  2  +  2,  a> .  3  +  3, . . . . 

It  will  be  shewn  that  the  sequences  615  b.2,  bs,  ...  can  be  so  chosen  that 
in  every  case  b1<b2<b3,  ...;  and  that,  if  the  sequences  b1}  62,  •  •  •  and  &/,  b2' . . . 
correspond  to  any  two  numbers  ft,  ft',  where  ft  <  ft',  then  there  exists  a 
number  N  such  that  bn'>bn,  for  n^N;  and  thus  that  the  sequences  are 
distinct  from  one  another. 

Let  us  assume  that  sequences,  corresponding  to  all  numbers  <  7,  have 
been  constructed  in  such  a  manner  that  this  condition  is  satisfied.  First, 
let  7  be  a  non-limiting  number,  so  that  7  =  7'+  1.  Then  if  ft<y,  there  is 
a  number  N  such  that  an'  >bn,  for  n  ^  N,  where  a/,  a-2',  a/,  ...  is  the  sequence 
which  corresponds  to  7'.  But  if  «!,  «2,  a3,  ...  is  the  sequence  which  cor- 
responds to  7,  we  have  an  =  an+l>  an' >bn,  for  n^N. 

Hence,  if  the  construction  is  possible  for  all  numbers  <y,  it  is  possible 
for  all  numbers  ^  7,  where  7  is  a  non-limiting  number. 

Next,  let  us  suppose  that  7  has  no  immediate  predecessor,  and  that 
7  =  lim./9m;  then  also  7  =  lim  (ftm  +  vm),  where  the  vm  are  finite  numbers. 

m~  oo  in  ~  x 

Now  there  is  a  number  JV",,  such  that  62,n>&i)»,  for  n  =  ^Vi,  where  bm>n 
denotes  the  wth  number  in  the  sequence  corresponding  to  ftm.  A  fortiori,  if 
7»,t  =  ftm  +  I'm,  we  have  c2)7i  =  62|M+I.a  >  &2,»  >  &i,n,  for  n  ^  Nlt  where  cm,n  is  the 
nth  number  in  the  sequence  corresponding  to  jm.  But  if  we  take  vz>bljyl-1, 
we  have  C3tn  =  b2tn+vag!n  +  va>bit2fl-l>bljn,  for  n<Nl;  and  hence  we  have 
ci,n  >  bltn,  for  all  values  of  n.  Similarly,  v3  can  be  so  chosen  that  73  >72,  and 
C'A,n>Cz,n,  f^  all  values  of  n ;  and  so  on  generally.  If  we  write  7,  for  ftly 
and  Cj.n  for  6]i?l)  we  have  a  doubly  infinite  array 


and  we  define  the  sequence  corresponding  to  7,  by  traversing  it  diagonally, 
so  that  cn=cn>n.  If  then  ft  <  7,  we  can  find  m  so  that  ft<rym',  then  there 
is  a  number  K,  such  that  cm<n>bn,  for  n^K.  But  if  n  >  m,  we  have 
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cn  •=  c»,»  >  cm>n ;  and  thus,  if  n  is  greater  than  the  greater  of  the  two  numbers 
m,  K,  we  have  cn>bn.  It  has  thus  been  shewn  that,  if  the  construction  is 
possible  for  all  numbers  <  7,  it  is  possible  for  all  numbers  £  y,  whether  y  is 
a  limiting  number  or  not. 

In  this  manner,  a  sequence  is  obtained  which  corresponds  to  any  assigned 
number  y  of  the  second  class,  and  this  sequence  is  distinct  from  those  which 
correspond  to  the  numbers  <  y,  such  sequences  being  also  distinct  from  one 
another. 

The  sequences  may  be  correlated  with  points  in  the  linear  continuum 
(0,  1).  To  correlate  a  sequence  61}  62,  63,  ... ,  we  may  take  the  binary  radix 
fraction  in  which  the  fcjth,  62th,  63th,  ...  figures  are  all  1,  and  the  remaining 
figures  all  0.  In  this  manner  a  set  of  points  is  shewn  to  exist,  such  that 
one  point  of  the  set  corresponds  to  each  number  of  the  first,  or  of  the  second, 
class.  This  amounts  to  the  construction  of  a  set  of  points  of  cardinal 
number  Nj.  Just  as  an  enumerable  set  of  points  is  determinate,  when  the 
point  which  corresponds  to  any  assigned  number  n,  of  the  first  class,  is 
determinate,  so  the  set  of  cardinal  number  Ka  is  determinate,  in  the  sense 
that  a  definite  point  is  determined,  corresponding  to  any  assigned  number 
ft,  of  the  first,  or  of  the  second,  class. 

It  may  be  remarked  that  a  set  of  points  of  cardinal  number  Xj,  or  of  any 
cardinal  number  >K0,  when  arranged  in  normal  order,  cannot  possibly  be  in 
the  order  in  which  they  occur  in  the  continuum. 

For,  if  a  set  of  points,  in  the  order  in  which  they  occur  in  the  continuum, 
forms  a  normally  ordered  aggregate,  each  point  and  the  next  succeeding  one 
define  a  linear  interval  of  which  they  are  the  end-points.  We  have  thus  a 
set  of  intervals  which  must  have  the  same  cardinal  number  as  the  given 
set  of  points.  Each  interval  of  the  set  abuts  on  the  next  one,  and  thus  the 
end-points  together  with  their  limiting  points  define  an  enumerable  closed 
set.  Hence  the  given  set  must  be  enumerable. 

GENERAL  DISCUSSION  OF  THE  THEORY. 

187.  Cantor's  definition  of  an  aggregate,  given  in  §  145,  implies  that  the 
elements  of  an  aggregate  are  logically  prior  to  the  aggregate  itself.  The 
definite  and  distinct  objects  are,  by  an  act  of  synthesis,  regarded  as  a  single 
collection,  or  aggregate.  Such  an  act  of  synthesis  involves  a  postulation, 
which  is  subject  to  the  law  of  contradiction.  It  appears  to  have  been  fully 
recognized  by  Cantor  himself,  in  connection  with  a  certain  so-called  "  incon- 
sistent "  aggregate,  to  be  discussed  below,  that  this  is  the  case.  The  postula- 
tions  of  the  existence,  as  single  objects,  of  the  aggregate  of  integral  numbers, 
and  of  that  of  real  numbers,  are  instances  of  such  an  act  of  synthesis.  A 
somewhat  different  view  of  the  nature  of  an  aggregate  is  however  widely 
spread.  In  accordance  with  this  view,  the  elements  of  an  aggregate  consist 
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of  all  objects  that  satisfy  some  prescribed  condition  or  conditions;  and  thus 
the  aggregate  consists  of  a  certain  class  of  objects;  and  there  has  been  a 
tendency  to  identify  the  conception  of  aggregate  with  that  of  "class,"  employed 
in  Formal  Logic.  The  aggregate  itself  is,  from  this  point  of  view,  logically 
prior  to  the  elements  it  contains ;  in  fact,  in  any  given  case,  it  may  be  doubt- 
ful whether  any  such  elements  exist;  or  it  may  even  be  known  that  no 
element  exists,  in  which  case  the  aggregate,  or  class,  is  called  a  null-aggregate, 
or  null-class.  Thus  the  relation  of  the  aggregate  to  its  elements  is  inverse 
to  that  which  is  in  accordance  with  Cantor's  synthetic  definition.  The  fol- 
lowing form  of  definition  expresses  this  view  of  the  nature  of  an  aggregate: 

All  objects  which  are  such  as  to  satisfy  a  prescribed  norm  are  said  to 
belong  to  an  aggregate  defined  by  that  norm.  The  norm  consists  of  a  set  of 
specified  conditions,  or  of  a  set  of  alternative  specified  conditions;  and  this 
norm  must  be  sufficient  to  render  it  logically  determinate,  as  regards  any 
particular  object  whatever,  whether  that  object  belongs  to  the  aggregate  or  not. 

In  the  case  of  a  finite  aggregate  the  norm  may  take  the  form  of  indi- 
vidual specification  of  the  objects  which  form  the  aggregate,  but  such  ex- 
haustive individual  specification  of  the  elements  of  a  non-finite  aggregate  is 
not  possible.  The  condition  that  it  must  be  logically  determinate  whether 
a  particular  object  is  an  element  of  a  particular  aggregate,  or  not,  covers 
cases  in  which  effective  determination  is  not  possible,  either  because  such 
determination  is  impracticable,  on  account  of  the  length  of  the  process  which 
would  be  requisite  actually  to  make  the  decision,  or  because  we  are  not  in 
possession  of  the  requisite  means  for  making  the  decision.  For  example, 
the  determination  of  the  millionth  digit  in  the  value  of  the  number  TT  is 
merely  impracticable,  although  it  can  be  carried  out  by  a  finite  process. 
Again,  the  question  whether  a  particular  number,  such  as  Mascheroni's 
constant,  belongs  to  the  aggregate  of  transcendental  numbers,  has  a  logically 
determinate  answer,  although  we  may  not  be  in  possession  of  a  method  by 
which  that  answer  can  be  obtained.  The  definition  admits  the  logical  deter- 
mination as  sufficient  when  effective  determination  is  impossible  or  difficult. 
There  may  even  be  cases  in  which  the  effective  determination  of  the  question 
whether  a  defined  aggregate  is  a  null-aggregate,  or  not,  is  impossible,  by  the 
employment  of  existing  means,  or  is  so  difficult  as  to  be  impracticable. 

It  is  clear  that  the  elements  of  an  aggregate,  being  subject  to  a  common 
norm,  must  have  a  certain  community  of  nature,  their  class  mark,  which 
constitutes  the  ground  of  the  aggregation. 

188.  The  discussion  of  the  theory  of  aggregates  which  has  taken  place 
amongst  mathematicians,  since  the  publication  of  Cantor's  theories,  has  led  to 
the  examination  of  certain  aggregates  which  appear  to  satisfy,  prima  facie, 
the  definition  of  an  aggregate,  and  yet  which  are  such  that  their  existence,  at  all 
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events  if  they  are  regarded  as  possessing  cardinal  numbers  or  order-types, 
involves  certain  contradictions.  In  this  manner,  what  are  known  as  the  anti- 
nomies of  the  theory  of  aggregates  have  arisen.  Both  Cantor's  definition  of  an 
aggregate,  and  the  definition, givenabove,  of  an  aggregate  asa  class,  lead  to  anti- 
nomies of  this  kind.the  most  important  of  which  will  be  discussed  below.  It  was, 
however,  recognized  by  Cantor*  that  the  collection  of  elements  into  an  aggre- 
gate is  only  valid  provided  a  conception  free  from  contradiction  is  the  result 
of  such  synthesis.  Suggestions  have  been  made  in  various  quarters,  in  the 
direction  of  limiting  the  concepts  of  the  theory  by  means  of  postulates,  in  such  a 
manner  that  the  antinomous  aggregates  will  not  arise.  At  the  present  time 
however,  no  such  generally  accepted  scheme  of  postulations  and  axioms  is 
in  existence.  The  final  goal  of  such  investigations  would  be  attained  if  we 
were  in  possession  of  a  definition  of  an  aggregate,  of  such  a  character  that 
it  could  be  shewn  a  priori  to  be  free  from  contradiction,  without  waiting  for 
the  discovery  of  antinomies.  Such  a  definition  should  also  be  sufficiently 
wide;  so  that  all  aggregates,  which  are  fitted  for  employment  in  a  mathe- 
matical theory,  would  fall  under  it. 

On  systems  of  axioms  which  are  suited  for  the  building  up  of  a  theory 
of  aggregates  of  such  a  character  that  no  contradictions  will  arise,  the  writings 
of  Hilbertf,  Zermelo^;,  Russell§,  Schoenflies||,  and  HuntingtonH  may  be 
consulted.  Developments  of  the  theory  which  go  beyond  those  of  which 
an  account  is  given  in  the  present  chapter  will  be  found  in  the  writings 
of  Hausdorff**,  Hessenbergff,  SchoenfliesJJ,  Bernstein  §§,  Jourdain||||  and 
WhiteheadlHI.  A  treatise  has-  been  published  by  Konig***,  in  which  the 
subject  is  treated  from  his  own  point  of  view. 

189.  It  is  not  clear  that  an  aggregate,  defined  as  a  class  of  objects,  is 
necessarily  capable  of  being  ordered  at  all.  For  example,  it  is  difficult  to 

*  See  Jourdain,  Phil.  Mag.  (6),  vol.  vn,  p.  55. 

•}•  Jahresber.  d.  Math.  Vereiniauna,  vol.  ix,  p.  55;  Verhand.  d.  Math.  Kongress  z.  Heidelberg, 
p.  176;  d.  Math.  Vereinigung,  vol.  vni,  1,  p.  180.  See  also  Mollerup,  Math.  Anna-ten,  vol.  LXIV, 
p.  231. 

£  Math.  Annalen,  vol.  LXV,  p.  261;  Acta  Math.,  vol.  xxxn,  p.  186. 

§  Proc.  Lond.  Math.  Soc.  (2),  vol.  iv,  p.  29. 

ii  Die  Entwickelungen  der  Lehre  von  den  Punktmannigfaltigkeiten. 

If  Ann.  of  Math.  (2),  vol.  vi,  p.  151,  and  (2),  vol.  vn,  p.  15. 

**  Grundziiye  der  Mengenlehre,  Leipzig,  1914;  also  Math.  Annalen,  vol.  LXV,  p.  435,  and 
vol.  LXXV,  p.  469;  Leipz.  Ber.,  vol.  LIII,  p.  460;  vol.  LVIII,  p.  106,  and  vol.  LIX,  p.  84.  Jahresber. 
d.  Math.  Vereinigung,  vol.  xm,  p.  570. 

•ff  Grundbegriffe  der  Mengenlehre,  Gottingen,  1906. 

JJ  Math.  Annalen,  vol.  LIX,  p.  129;  vol.  LX,  p.  181,  and  p.  431. 

§§  Math.  Annalen,  vol.  LXI,  p.  117. 

Jill  Phil.  Mag.  (6),  vol.  Yin,  p.  49,  (7),  vol.  vi,  p.  73,  and  p.  302. 

HIT  Amer.  J.  of  Math.  (2),  vol.  vi,  p.  151,  and  (2),  vol.  vn,  p.  15. 

**  Neue  Grundlagen  der  Logik,  Arithmetik  und  Mengenlehre,  also  Math.  Annalen,  vol.  LX, 
p.  177,  and  p.  462.  Also  C.  R.  Paris,  vol.  CXLIII,  p.  110,  and  Verhand.  d.  Math.  Kongress  z. 
Heidelberg,  p.  144. 


238  Transfinite  numbers  and  order-types  [OH.  iv 

see  that  such  an  aggregate  as  that  of  "all  propositions"  could  conceivably  be 
ordered;  where  it  is  assumed  that  the  meaning  of  the  word  "proposition"  is 
taken  as  so  definite,  that  this  aggregate  has  a  norm  in  accordance  with  the 
definition  above.  Again,  to  take  an  example  among  aggregates  of  the  kind 
usually  considered  in  Mathematical  theory,  we  may  consider  the  aggregate 
obtained  by  distributing  the  aggregate  of  real  numbers  upon  itself.  This  aggre- 
gate which  has  the  cardinal  number  f=  c0,  is  equivalent  to  the  aggregate  of  all 
the  functions  of  a  real  variable;  it  is  difficult,  if  not  impossible,  to  see  how  order 
could  be  effectively  imposed  upon  this  aggregate.  If  then,  a  transfinite  aggre- 
gate is  to  be  given  as  an  ordered  aggregate,  or  is  to  have  an  order  imposed  upon 
it,  or  rather  discovered  in  it,  it  would  appear  to  be  necessary  that  the  norm, 
which  constitutes  the  definition  of  the  aggregate,  should  be  of  such  a  character, 
that  a  principle  of  order  is  contained  therein,  or  can  at  all  events  be  adjoined 
thereto;  so  that,  when  any  two  particular  elements  are  considered,  the  con- 
ditions which  they  satisfy  in  virtue  of  their  belonging  to  the  aggregate,  when 
individualized  for  the  particular  elements,  may  be  sufficient  also  to  allow  of 
relative  rank  being  assigned  to  those  elements,  in  accordance  with  a  principle 
of  order.  This  is  in  fact  the  case  in  such  aggregates  as  those  of  the  integral 
numbers,  the  rational  numbers,  or  the  real  numbers.  In  the  case,  for  example, 
of  the  positive  rational  numbers,  the  relative  rank  of  any  two  particular  elements 
(P>  <?)>  (P>  •?')  i§  assigned  by  the  system  of  postulations,  contained  in  §  12, 
which  defines  the  aggregate.  It  may,  of  course,  also  be  possible  in  other 
cases,  as  in  this  one,  to  re-order  the  aggregate,  in  accordance  with  some  other 
law,  extrinsically  imposed  upon  the  aggregate;  but  the  nature  of  the  elements 
must  be  such  that  this  is  possible. 

We  can  now  state  that: 

In  order  that  a  transfinite  aggregate,  defined  as  in  §  187,  may  be  capable  of 
being  ordered,  a  principle  of  order  must  be  explicitly  or  implicitly  contained  in 
the  norm  by  which  the  aggregate  is  defined. 

The  relative  order  of  any  two  elements  chosen  from  an  ordered  aggregate 
depends  upon  the  individual  characteristics  of  those  elements,  in  accordance 
ivith  the  principle  of  order. 

In  the  definition  of  order- type  given  by  Cantor  (see  §  157),  according  to 
which  the  order-type  of  an  aggregate  is  obtained  by  making  abstraction  of 
the  particular  nature  of  the  elements  of  the  aggregate,  it  is  assumed  that 
the  aggregate  is  given  as  an  ordered  aggregate.  Again,  in  his  definition 
of  cardinal  number  (see  §  145),  Cantor  has  assumed  that  the  aggregate  is 
given  as  an  ordered  one;  the  cardinal  number  there  appears  as  the  result  of 
a  double  abstraction,  viz.  of  the  particular  nature  of  the  elements,  and  of  the 
order  in  which  they  are  given.  The  question  however  arises,  whether  the 
definition  of  cardinal  number  should  not  be  such  as  to  be  also  applicable  in 


189,  i9o]  General  discussion  of  the  theory  239 

the  case  of  aggregates  which  are  not  given  as  ordered  aggregates.  Cantor  has 
himself,  in  fact,  in  his  theory  of  exponentials  involving  transfinite  cardinal 
numbers,  contemplated  certain  aggregates  as  having  cardinal  numbers,  whilst 
such  aggregates  were  not  given  as  ordered  aggregates,  and  primd  facie,  at  all 
events,  are  not  capable  of  being  ordered. 

190.  Taking  the  case  of  an  aggregate  defined  as  an  ordered  aggregate, 
we  now  approach  the  consideration  of  the  fundamental  question,  whether, 
and  under  what  conditions,  if  any,  such  an  aggregate  can  be  regarded  as 
having  a  definite  order-type,  and  a  definite  cardinal  number.  This  is  equiva- 
lent to  asking  whether,  or  when,  meanings  can  be  given  to  those  terms,  in 
accordance  with  general  definitions,  of  such  a  character  that  they  can  be 
treated  as  permanent  objects  for  thought,  or  as  mathematical  entities  which 
may  themselves  be  elements  in  aggregates. 

With  reference  to  Cantor's  definition  (see  §  145),  of  the  cardinal  number 
of  a  transfinite  aggregate,  by  abstraction,  in  accordance  with  which  the 
cardinal  number  is  represented  by  replacing  each  element  by  an  abstract 
unity,  it  must  be  observed  that  such  a  substitution  would  replace  the  given 
aggregate  by  another  one  which  had  no  longer  any  intelligible  relation  with 
the  norm  by  which  the  original  aggregate  is  defined.  The  abstract  unities 
would  be  indistinguishable  from  one  another,  and  the  new  aggregate  would 
be  indistinguishable  from  any  other  non-finite  aggregate  of  such  unities.  It 
would  be  impossible  to  decide,  as  regards  any  particular  abstract  unity, 
whether  it  belonged  to  the  aggregate  or  not;  in  fact,  to  make  complete 
abstraction  of  the  individual  nature  of  the  elements  of  an  aggregate  is  to 
destroy  the  aggregate.  A  definition  by  abstraction  could  be  justified  only 
by  the  interpretation  that  abstraction  is  made  of  those  characteristics  only, 
in  which  the  elements  of  the  aggregate  differ  from  the  corresponding  ele- 
ments of  all  possible  equivalent  aggregates.  Thus  the  existence  of  aggregates 
equivalent  to  the  given  aggregate  would  appear  to  be  essential,  if  the  latter 
is  to  be  regarded  as  having  a  cardinal  number  to  which  any  definite  meaning 
can  be  attached.  On  the  grounds  stated,  the  definition  of  a  cardinal  number, 
as  the  characteristic  of  a  class  of  equivalent  aggregates,  is  to  be  preferred  to 
the  definition  given  by  Cantor.  Accordingly,  an  aggregate  has  a  cardinal 
number,  only  when  it  is  one  of  a  plurality  of  equivalent  aggregates,  distinct 
from  one  another.  In  all  cases  the  correspondence  between  equivalent  aggre- 
gates must  be  definable  by  some  norm. 

We  are  thus  led  to  the  following  statement,  containing  a  definition  of 
cardinal  number : 

The  members  of  any  particular  class  of  equivalent  aggregates  have  a 
quality  in  common  in  virtue  of  their  equivalence.  This  quality  is  the  cardinal 
number,  and  may  be  regarded  as  characteristic  of  each  aggregate  of  the 
particular  class. 


240  Transfinite  numbers  and  order-types  [CH.  iv 

A  cardinal  number  has  been  defined*  by  B.  Russell  to  be  a  class  of 
equivalent  aggregates.  It  may  then  be  urged  that  such  a  class  may  contain 
only  one  member,  and  that  this  is  sufficient  for  the  existence  of  the  cardinal 
number  of  that  member.  In  fact,  Russell  infers  f  the  existence  of  the  number 
n  + 1  from  that  of  the  numbers  0,  1,  2,  3,  ...  n.  Russell  objects^  to  the  con- 
ception of  a  number  as  the  common  characteristic  of  a  family  of  equivalent 
aggregates,  on  the  ground  that  there  is  no  reason  to  think  that  such  a  single 
entity  exists,  with  which  the  aggregates  have  a  special  relation;  but  that 
there  may  be  many  such  entities,  and  that  there  are  in  fact  an  infinite  number 
of  them.  The  mind  does,  however,  in  point  of  fact,  in  the  case  of  finite 
aggregates  at  least,  recognize  the  existence  of  such  a  single  entity,  viz.  the 
number,  or  degree  of  plurality,  of  the  aggregate;  and  this  creation,  by 
abstraction,  is  a  valid  process,  subject  to  the  law  of  contradiction. 

191.  In  Cantor's  definition  of  the  order-type  of  a  simply  ordered  trans- 
finite  aggregate  (see  §  157),  abstraction  is  made  of  the  nature  of  the  elements, 
their  order  in  the  aggregate  being  alone  retained.  The  order-type  is  then 
regarded§  as  represented  by  an  aggregate  of  abstract  unities,  in  the  order 
of  the  elements  of  the  given  aggregate.  In  any  ordered  aggregate,  it  is 
however  the  individual  characteristics  of  any  two  elements  which  determine 
their  relative  order  in  the  aggregate,  in  accordance  with  some  principle  of 
order,  valid  for  the  whole  aggregate.  If  complete  abstraction  be  made  of 
the  characteristics  of  the  various  elements,  order  has  then  disappeared  from 
the  aggregate.  It  must  be  supposed  that,  in  Cantor's  representation  of  the 
order-type,  there  are  attached  to  the  abstract  unities  marks  of  some  kind, 
which  may  in  particular  cases  be  marks  indicating  position  in  space  or  time, 
by  which  the  order  of  the  various  abstract  unities  is  denoted ;  the  given 
aggregate  is  then  really  replaced  by  an  aggregate  of  these  marks,  and  the 
abstract  unities  are  superfluous.  These  marks,  by  which  order  is  determined, 
must  also  have  been  associated  with  the  elements  of  the  original  aggregate. 
It  thus  appears,  that  in  a  definition  by  abstraction,  it  can  be  only  those 
characteristics  (if  any)  of  the  various  elements  which  are  irrelevant  in 
determining  the  order,  of  which  abstraction  is  made :  thus  the  aggregate  is 
really  replaced  by  a  similar  one.  On  these  grounds,  that  definition  of  an 
order- type  is  to  be  preferred,  in  which  the  order- type  is  defined  as  the 
characteristic,  or  class-name,  of  a  class  of  similar  aggregates.  Accordingly, 
in  order  that  a  given  aggregate  may  have  an  order-type,  to  which  a  definite 
meaning  can  be  attached,  it  is  necessary  that  the  aggregate  be  one  of  a 
plurality  of  similar  aggregates. 

We  may  accordingly  state  that : 

The  members  of  any  particular  class  of  similar  aggregates  have  a  quality 

*  Principles  of  Mathematics,  vol.  i,  pp.  Ill — 116.  f  Ibid.,  p.  497. 

J  Ibid.,  p.  114.  §  See  Math.Annalen,  vol.  XLVI,  p.  497. 
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t'n  common,  in  virtue  of  their  relation  of  similarity.  This  quality  of  mutual 
similarity  possessed  by  the  aggregates  is  their  order-type,  and  may  be  repre- 
sented by  a  name  or  symbol,  regarded  as  characteristic  of  each  aggregate  of 
the  particular  class. 

The  considerations  above  adduced  may  be  applied  in  the  case  of  an 
aggregate  which  is  a  segment  of  the  hypothetical  aggregate  of  all  ordinal 
numbers.  In  this  case  it  is  impossible  to  make  abstraction  of  the  nature 
of  the  individual  elements  of  the  aggregate,  without  destroying  the  order, 
because  the  elements  are  themselves  nothing  more  than  marks  indicating  order. 

192.  The  question  has  frequently  been  discussed  whether  the  conception 
of  an  ordinal  number  is  necessarily  prior  to  that  of  a  cardinal  number.  The 
view  is  held  that  the  very  earliest  conception  of  a  number  was  developed  as 
the  result  of  immediate  intuition  of  the  degree  of  plurality  in  a  very  small 
group  of  objects,  for  example  of  a  pair,  or  of  a  triplet,  or  of  the  unity  in  a 
single  object,  without  recourse  to  the  process  of  counting.  In  this  manner 
the  conception  of  the  first  few  integral  numbers  may  have  arisen,  and  the 
conception  would  be  that  of  cardinal  number;  the  objects  in  a  group  not 
being  regarded  as  in  any  particular  order.  This  mode  of  apprehension  of  the 
number  of  objects  in  a  group  must  certainly  have  been  extremely  limited  in 
its  scope,  and  the  conception  of  numbers,  after  the  first  few,  must  have  arisen, 
as  explained  in  §  1,  in  connection  with  the  process  of  counting,  in  which  the 
objects  are  taken  in  some  definite  order ;  thus  leading  in  the  first  instance 
to  the  ordinal  number  of  the  group.  It  appears  certain,  then,  that  the  actual 
order  of  development  of  Arithmetic,  except  for  the  case  of  the  first  few 
numbers,  must  have  been  one  in  which  the  ordinal  number  has  a  certain 
priority  to  the  cardinal  number. 

In  the  systematic  development  of  the  theory  of  numbers,  finite  and 
transfinite,  the  notion  of  an  ordered  aggregate  has  certainly  been  fundamental, 
and  not  derivative.  A  reversal  of  the  historical  procedure,  in  which  ordered 
aggregates,  having  definite  ordinal  numbers  or  order-types,  were  first  con- 
sidered, and  afterwards  the  notion  of  the  cardinal  number  has  arisen,  would 
lead  to  probably  insuperable  difficulties.  It  is  no  doubt  possible  to  define 
the  cardinal  number  of  a  transfinite  aggregate,  as  has  in  fact  been  done  above, 
in  such  a  way  that  the  notion  of  order  does  not  explicitly  appear,  but  the 
question  of  the  comparability  of  cardinal  numbers  so  defined  at  once  arises, 
and  the  consideration  of  ordered  aggregates  would  appear  to  be  indispensable 
for  the  development  of  any  systematic  arithmetic  of  cardinal  numbers. 
Cantor's  aleph-numbers,  as  forming  an  ordered  family  of  cardinal  numbers, 
are  essentially  dependent  upon  the  theory  of  the  order-types  of  normally 
ordered  aggregates.  It  is  difficult  to  see,  for  example,  how  it  could  be  known, 
in  respect  of  a  particular  aggregate,  that  it  has  the  cardinal  number  K0,  unless 
a  correspondence  of  the  elements  of  the  aggregate  with  the  ordered  set  of 
H.  1G 
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finite  numbers  could  be  shewn  to  exist.  It  may  be  asserted  that,  since  any 
non-finite  part  of  an  enumerable  aggregate  is  itself  enumerable,  there  may 
exist  a  part  of  the  integer  sequence  1,  2,  3,  ...  n, ...  which  cannot  be  defined 
by  means  of  a  finite  set  of  rules,  and  yet  that  such  an  aggregate  must  have 
the  cardinal  number  K0.  A  distinction  is  accordingly  drawn,  by  some  writers*, 
between  those  enumerable  aggregates  for  which  a  correspondence  with  the 
integer  sequence  1,  2,  3,  ...  can  be  effectively  defined,  and  those  for  which  this 
is  not  the  case.  It  can,  however,  be  shewn  that  there  exists  no  part  of  the 
integer  sequence  [n]  which  is  inherently  incapable  of  finite  definition.  It 
has  been  shewn,  in  |  63,  in  the  case  of  the  numbers  of  the  continuum,  that 
those  numbers,  in  any  interval,  that  are  capable  of  finite  definition,  form  an 
aggregate  which  has  all  the  properties  of  the  continuum  itself;  and  in  view 
of  the  theory  of  the  order-type  of  the  continuum  given  in  §  163,  that  aggregate 
is  identical  with  the  continuum. 

Any  non-finite  part  of  the  integer  sequence  {??}  can,  however,  be  made  to 
correspond  with  a  particular  number  of  the  continuum  in  the  interval  (0,  1); 
we  take  such  a  number  expressed  as  a  radix  fraction  in  the  binary  scale, 
defined  by  the  rule  that  the  rath  digit  is  0  or  1,  according  as  the  integer  m 
occurs,  or  does  not  occur,  in  the  part  of  the  integer  sequence  which  is  under 
consideration.  Since  there  is  a  (1,  1)  correspondence  between  the  parts  of 
the  integer  sequence  and  the  numbers  of  the  continuum  within  the  interval 
(0,  1),  it  follows  that,  since,  as  has  been  shewn  in  §  63,  no  number  of  the 
continuum  is  inherently  incapable  of  finite  definition,  the  same  is  true  of  the 
parts  of  the  integer  sequence.  It  thus  appears  that  no  enumerable  aggregate 
is  inherently  incapable  of  being  placed  into  correspondence  with  the  numbers 
of  the  integer  sequence,  by  a  finite  set  of  rules,  although  a  particular  fixed 
apparatus  of  definition  may  be  insufficient  for  this  purpose. 

193.  We  proceed  to  consider  those  aggregates  which  consist  of  ordinal 
numbers  in  their  order  of  generation. 

There  are  two  distinct  methods  of  establishing  the  existence  of  a  class 
of  mathematical  entities. 

(1)  Their  existence,  as  definite  objects  for  thought,  may  be  shewn  to 
follow  as  a  logical  consequence  of  the  existence   of  other  entities  already 
recognized  as  existent,  or  of  principles  already  recognized  as  valid ;  so  that 
the  existence  of  the   new   entities  in   question  cannot   be   denied   without 
coming  into  contradiction  with  truths  already  known,  or  at  all  events  with 
postulations  already  made.     This  method  may  be  termed  the  genetic  method. 

(2)  The  existence  of  the  entities  may  be  directly  postulated ;  and  their 
mutual  relations,  and  their  relations  with  other  entities  already  assumed  to 
exist,  may  be  defined  by  means  of  a  complete  system  of  definitions  and  pos- 

*  See,  for  example,  Borel,  Annales  de  Vecole  normale,  (3),  vol.  xxv,  p.  447. 
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tulations.  Accordingly,  the  objects  in  question  are  a  relatively  free  creation 
of  our  mental  activity.  The  validity  of  the  scheme  thus  set  up  is  established 
when  it  is  shewn  to  be  free  from  internal  contradiction.  Its  utility  is  to  be 
judged  by  its  applicability  to  the  general  purposes  of  the  science,  and  by  the 
light  it  may  throw  upon  the  fundamental  principles  of  that  science,  in  virtue 
of  the  scheme  containing  a  generalization  of  what  was  previously  known. 
This  method  may  be  termed  the  method  of  postulation.  It  may,  however,  be 
urged  that  the  failure  to  discover  contradictions  within  a  scheme  of  postula- 
tions  is  no  proof  that  such  contradictions  do  not  exist,  and  that  such  proof 
can  only  be  supplied  by  the  exhibition  of  a  system  of  entities  already  assumed 
to  exist,  such  that  the  relations  between  them  are  in  accordance  with  those 
postulated  in  the  scheme  in  question. 

Both  these  methods  have  been  employed  by  Cantor  in  his  theory  of  trans- 
finite  numbers  and  order- types.  In  his  earlier  treatment  of  the  subject,  he 
employed  the  second  of  the  above  methods.  The  existence  of  the  new 
number  &>,  and  of  the  limiting  numbers  of  the  second  class,  was  postulated, 
in  accordance  with  the  second  principle  of  generation.  Freedom  from 
contradiction,  and  utility  in  connection  with  the  theory  of  sets  of  points, 
which  suggested  the  postulations,  were  relied  upon  as  the  grounds  upon 
which  the  system  of  new  numbers  was  to  be  justified.  The  first  number  H, 
of  the  third  class,  was  introduced  by  a  new  postulation. 

In  his  later  and  more  abstract  treatment  of  the  subject,  an  account  of 
which  has  been  given  in  the  present  chapter,  Cantor  applied  the  genetic 
method.  The  existence  of  the  number  &>  is  not  directly  postulated,  but  is 
taken  to  follow  from  the  existence  of  the  aggregate  {n},  of  integral  numbers; 
eu  is  defined  to  be  the  order-type  of  this  aggregate,  and  it  is  assumed  that 
such  order-type  is  a  definite  object  which  can  itself  be  an  element  of  an 
aggregate.  The  successive  ordinal  numbers  of  the  successive  classes  are 
obtained  by  assuming  as  a  general  principle,  that  an  ordered  aggregate 
necessarily  possesses  a  definite  order-type  which  can  be  regarded  as  itself  an 
object,  the  ordinal  number  coming  immediately  after  all  those  that  are  the 
elements  of  the  aggregate  of  which  it  is  the  order-type. 

It  will  be  seen  later,  that  the  assumptions  that  an  ordered  aggregate 
necessarily  possesses  a  definite  order-type,  and  that  it  also  possesses  a 
definite  cardinal  number,  both  of  which  can  be  regarded  as  objects,  lead  to 
the  contradiction  pointed  out  by  Burali-Forti.  It  appears,  therefore,  that 
the  class  of  entities,  which  is  constituted  by  the  ordinal  numbers  of  all  classes, 
and  the  similar  aggregate  of  aleph-numbers,  do  not  satisfy  the  condition  of 
being  subject  to  a  scheme  of  relations  which  is  free  from  contradiction. 
In  fact,  the  principle,  in  accordance  with  which  their  existence  is  inferred, 
conflicts  with  the  definition  of  the  aggregates  as  containing  respectively 
every  ordinal  number,  and  every  aleph-number.  It  would  then  appear, 
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that  the  genetic  process  which  led  to  the  definition  of  the  aggregates  of  all 
ordinal  numbers,  and  of  all  aleph-numbers,  cannot  be  a  valid  one  without 
some  restriction.  Thus  the  principle  that  every  ordered  aggregate  has  a 
definite  order-  type,  which  may  be  regarded  as  a  permanent  object  of  thought, 
cannot  be  accepted  as  a  universal  principle  to  be  used  in  a  genetic  mode  of 
establishment  of  the  existence  of  a  class  of  entities.  A  denial  of-  the  validity 
of  this  principle  does  not  however  preclude  the  less  ambitious  procedure  of 
postulating  the  existence  of  definite  ordinal  numbers  of  a  limited  number  of 
classes,  in  accordance  with  Cantor's  earlier  method.  So  long  as  the  postulation 
of  the  existence  of  ordinal  numbers  does  not  go  beyond  some  definite  point, 
no  contradiction  will  arise,  and  the  validity  of  the  scheme,  for  purposes  of 
representation,  will  suffice  to  justify  the  postulations  which  are  made.  An 
attempt  to  examine  the  structure  of  such  a  class  of  ordinal  numbers,  as  that 
of  the  <uth  class,  with  cardinal  number  Nw,  or  that  of  the  Dth  class,  with  cardinal 
number  Kn,  will  lead  to  the  conviction  that  such  conceptions  are  unlikely  to 
prove  capable  of  useful  application  in  any  branch  of  Analysis  or  of  Geometry. 

THE    PARADOXES    OF    BURALI-FORTI    AND    RUSSELL. 

194.    In  accordance  with  Cantor's  general  theory  of  ordinal  numbers,  and 
of  aleph-numbers,  there  exist  two  aggregates 

1,2,3,...     n,  ...    w,  w  +  1,  ...    fl,  n  +  1,  ...    /?,... 


the  first,  the  aggregate  of  all  ordinal  numbers,  which  may  be  denoted  by  W  ; 
and  the  second,  that  of  all  N  cardinal  numbers.  These  aggregates  are  similar 
to  one  another,  and  they  contain,  respectively,  every  ordinal  number,  and  every 
cardinal  number  which  belongs  to  a  normally  ordered  aggregate. 

The  aggregate  W  is  normally  ordered.  For,  if  W^  be  any  part  of  W,  such 
that  W  contains  one  or  more  elements  of  higher  order  than  all  the  elements 
of  TFj,  it  follows  that  TFj  is  a  part  of  some  segment  of  W  that  also  contains  one 
or  more  such  elements.  Since  such  segment  is  normally  ordered,  there  exists 
one  element  of  that  segment,  and  therefore  of  W,  which  immediately  follows 
the  part-aggregate  W-^.  It  is  thus  seen  that,  since  the  conditions  of  §  165  are 
satisfied,  W  is  normally  ordered.  In  accordance  with  the  principle  which  is 
fundamental  in  the  whole  theory,  that  every  normally  ordered  aggregate  has 
a  definite  order-  type,  which  is  its  ordinal  number,  and  has  also  a  definite 
cardinal  number,  the  aggregate  W  has  an  ordinal  number  7,  and  a  cardinal 
number  Kr  The  ordinal  number  7  must  itself  occur  in  the  aggregate  W, 
and  must  therefore  be  the  greatest  ordinal  number,  i.e.  the  last  element  of 
W.  There  can,  however,  be  no  last  ordinal  number;  for,  on  the  assumption  of 
the  existence  of  7,  an  aggregate,  of  ordinal  number  7+  1,  can  be  formed,  by 
adding  to  W  a  new  element  e,  of  higher  rank  than  all  the  elements  of  W. 
Therefore  a  contradiction  has  been  arrived  at. 
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This  contradiction  in  the  conception  of  the  aggregate  W,  of  all  order-types 
of  normally  ordered  aggregates,  was  pointed  out  by  Burali-Forti*,  but  as  is 
stated  by  Jourdainf,  it  was  discovered  by  Cantor  in  1895,  and  communicated 
by  him  to  Hilbert  in  1896,  and  to  Dedekind  in  1899.  The  contradiction  has 
been  discussed  by  various  writers,  and  the  conclusions  they  draw  from  its 
existence  are  not  in  full  agreement  with  one  another.  Burali-Forti  inferred 
from  it  that  ordinal  numbers  are  not  comparable  with  one  another,  but  this 
is  refuted  by  Cantor's  theorem  in  §  168.  Jourdain  shews  that  W  is  normally 
ordered,  but  concludes  that  it  possesses  neither  order-type  nor  cardinal  number. 
He  describes  W  as  an  "  inconsistent "  aggregate,  a  term  which  had  been 
previously  employed  by  Cantor  in  the  same  connection.  It  would  thus  appear 
that  there  are  aggregates  which  have  no  order-type,  and  no  cardinal  number, 
and  that  the  above  aggregates  belong  to  such  class.  Such  aggregates  are 
called  inconsistent  aggregates,  in  virtue  of  the  fact  that  the  act  of  synthesis, 
by  which  they  are  formed,  cannot  be  carried  out  without  leading  to  a  con- 
ception which  contains  an  element  of  contradiction. 

It  was  pointed  out  by  Schoenflies^:  and  by  Bernstein  §,  that  the  hypothetical 
aggregate  TFlacks  one  property  that  belongs  to  all  its  segments,  and  to  all  normally 
ordered  aggregates,  viz.  that  a  new  normally  ordered  aggregate  can  be  formed 
by  the  addition  of  a  new  element,  of  higher  rank  than  all  those  of  the  original 
aggregate.  In  fact  the  first  principle  of  generation  of  the  successive  ordinal 
numbers  in  Cantor's  scheme  is  that  to  each  ordinal  number  there  follows  a 
next  one,  and  it  is  clearly  impossible  that  the  aggregate  of  all  ordinal  numbers 
can  have  an  ordinal  number  that  is  subject  to  this  principle.  It  thus  appears 
that  W  can  have  no  ordinal  number,  or  order-type.  This  being  so,  the  question, 
whether  W  can  be  properly  said  to  be  an  aggregate,  or  not,  depends  on  the 
precise  meaning  assigned  to  the  term  aggregate.  It  would  appear  desirable 
that  such  a  limitation  should  be  given  to  the  definition  of  the  term  that  such 
hypothetical  aggregates  as  W  would  not  fall  under  the  definition.  All  ordinal 
numbers  form  a  class,  and  thus  doubt  is  thrown  upon  the  advisability  of 
making  the  term  aggregate  synonymous  with  that  of  the  "  class  "  of  Logic. 
Moreover,  it  would  appear  desirable  that  the  definition  of  an  aggregate  should 
be  so  restricted  that  every  aggregate  has  a  cardinal  number  and,  if  normally 
ordered,  an  order-type.  It  has  been  pointed |i  out  by  Mirimanoff  that  the 
antinomy  of  Burali-Forti  would  arise,  even  if  the  existence  of  all  the  ordinal 
numbers  in  the  scheme  of  Cantor  be  not  admitted.  Instead  of  W,  the  hypo- 
thetical aggregate  of  all  existing  ordinal  numbers  could  be  contemplated,  and 
the  same  reasoning  would  shew  that  it  would  be  contradictory  to  ascribe  an 

*  Rend,  del  c.ircolo  mat.  di  Palermo,  vol.  xi,  p.  164. 

t  Phil.  Mag.,  1904;  see  also  Hilbert,  Jahresber.  der  Math.  Vereiniguny,  vol.  vm,  p.  184. 

I  Bericht.  2ler  Thtil,  p.  27. 

§  Bernstein,  Math.  Annalen,  vol.  LX,  p.  187. 

|j  See  L'enaeignemeni  mat.,  Jan.  to  March,  1917. 
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ordinal  number,  or  order-type,  to  this  aggregate.  Let  us  assume,  for  example, 
that  the  finite  ordinal  numbers  were  the  only  ordinal  numbers  whose  existence 
was  admitted;  it  would  then  be  contradictory  to  regard  the  totality  of  all 
these  ordinal  numbers  as  having  an  ordinal  number,  or  order-type;  in  fact 
this  totality  would  be  an  "  inconsistent "  aggregate,  in  the  sense  employed  by 
Cantor  and  Jourdain. 

The  mode  of  generalizing  the  ordinal  numbers  in  Cantor's  scheme  requires 
a  fresh  postulation  of  existence,  in  the  case  of  the  lowest  numbers  of  each  of 
the  successive  classes.  Although  the  contradiction  of  Burali-Forti  cannot  be 
invoked  to  shew  that  such  a  postulation  leads  to  contradiction,  as  long  as  we 
refrain  from  considering  such  an  aggregate  as  that  of  all  ordinal  numbers,  or 
of  all  existing  ordinal  numbers,  it  cannot  be  directly  shewn  that  at  no  stage 
of  the  successive  postulations  does  some  contradiction  arise.  It  would  appear 
then  that  the  validity  of  the  whole  scheme,  being  subject  to  the  possibility  of 
the  discovery  of  contradictions  at  some  stage  or  other,  the  existence  of  the 
numbers  in  Cantor's  scheme  must  be  regarded,  from  the  point  of  view  of  the 
Mathematician,  as  a  working  hypothesis.  The  utility  of  such  hypothesis  must 
be  judged  by  its  consequences,  and  its  applications  in  Analysis. 

195.  The  antinomy*  of  Russell  depends  upon  the  consideration  that  two 
species  of  aggregates  may  be  contemplated.  An  aggregate  M,  of  the  first 
species,  does  not  contain  M  itself  as  an  element ;  but  an  aggregate  M  of  the 
second  species  does  contain  M  itself,  as  an  element.  An  example  of  an  aggre- 
gate of  the  second  species  is  the  aggregate  of  all  aggregates.  It  is  clear  that 
an  aggregate  of  the  second  species  contains  an  element  which  is  itself  an 
aggregate  of  the  second  species.  It  follows  that  an  aggregate,  all  of  whose 
elements  are  aggregates  of  the  first  species,  is  itself  of  the  first  species.  The 
aggregate  which  contains,  as  elements,  all  aggregates  of  the  first  species  is 
a  notion  essentially  affected  with  contradiction.  For  it  must  be  itself  of 
the  first  species,  since  all  its  elements  are  of  that  species ;  on  the  other  hand, 
it  must  be  of  the  second  species,  since  it  must  contain  itself  as  element ; 
for  the  aggregate  itself  is  one  of  the  aggregates  of  the  first  species. 

It  thus  appears  that  the  aggregate  of  all  aggregates  of  the  first  species 
cannot  exist  as  a  notion  free  from  contradiction. 

This  contradiction  maybe  considered  in  relation  both  to  Cantor's  definition 
of  an  aggregate,  and  to  the  definition  of  an  aggregate  as  a  class.  In  accordance 
with  Cantor's  definition,  the  elements  of  an  aggregate,  being  logically  prior 
to  the  aggregate,  must  be  definable  in  a  manner  which  does  not  make  use  of 
the  aggregate  itself  in  the  definitions  ;  and  thus  it  would  appear  that  no 
aggregate  which  contained  itself  as  an  element  would  fall  under  Cantor's 

*  See  Principles  of  Mathematics,  vol.  i,  Chap.  x. 
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definition  of  an  aggregate.  On  the  other  hand,  the  definition  of  an  aggregate 
as  a  class  would  not  prima  facie  preclude  the  contemplation  of  such  a  con- 
ception as  the  class  of  all  classes. 

It  is  clear  that  the  proper  definition  of  an  aggregate  should  be  such  as  to 
exclude  all  hypothetical  aggregates  which  contain  elements  that  cannot  be 
defined  without  employing  the  aggregate  itself;  in  the  language  of  Poincare, 
non-predicative  definitions  should  be  excluded. 


196.  With  a  view  to  the  elucidation  and  generalization  of  the  antinomies 
of  Burali-Forti  and  Russell,  a  classification  of  aggregates  has  been  given*  by 
Mirimanoff,  according  to  the  modes  in  which  their  elements  are  composed  of 
other  elements.  The  elements  m,  of  an  aggregate  M,  may  be  simple  elements 
which  are  not  decomposable  into  other  elements.  On  the  other  hand,  an 
element  m  may  itself  be  an  aggregate  composed  of  elements  e ;  thus  m  =  {e}. 
Again,  the  elements  e,  of  which  m  is  composed,  may  themselves  be  aggregates 
composed  of  other  elements  e' ;  thus  e  =  [e'\. 

It  is  clear  that  this  process  of  decomposition  of  the  elements  of  an  aggre- 
gate into  aggregates  of  other  elements,  and  of  these  elements  into  aggre- 
gates of  other  aggregates,  may  proceed  until  only  simple  elements,  no  longer 
decomposable,  are  obtained ;  or  on  the  other  hand  the  process  may  go  on 
indefinitely. 

An  aggregate  is  said,  by  Mirimanoff,  to  be  an  ordinary  aggregate,  when  each 
element  m  is  such  that,  after  a  finite  number  of  decompositions,  simple  un- 
decomposable  elements  are  obtained.  An  aggregate  which  does  not  possess 
this  property  is  said  to  be  an  extraordinary  aggregate. 

Two  aggregates  are  said  to  be  isomorphic  when  (1)  they  are  equivalent, 
i.e.  when  there  is  a  (1,  1)  correspondence  between  their  elements,  and  (2)  when, 
m,  m  being  two  corresponding  elements,  which  are  decomposable,  there  is  also 
equivalence  between  the  two  aggregates  of  which  they  consist,  and  when  such 
equivalence  also  holds  in  case  further  decomposition  can  take  place,  and  so  on, 
so  long  as  any  continued  decomposition  of  elements  can  be  made. 

An  aggregate  is  said  to  be  of  the  first  species  if  it  is  not  isomorphic  with 
any  of  its  elements,  and  it  is  said  to  be  of  the  second  species  if  it  is  isomor- 
phic with  one  at  least  of  its  elements.  The  hypothetical  aggregate,  which 
consists  of  all  aggregates  of  the  first  species,  or  also  that  of  all  ordinary 
aggregates,  involves  the  same  contradiction  as  in  the  case  of  Russell's  anti- 
nomy. 

An  aggregate  of  the  second  species  is  an  extraordinary  aggregate,  but 
an  aggregate  of  the  first  species  may  also  be  extraordinary. 

*  L'enreignement  mat.,  Jan.  to  March,  1917,  p.  42. 
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THE   MULTIPLICATIVE   AXIOM. 

197.  An  axiom,  known  variously  as  the  "multiplicative  axiom,"  the  "prin- 
ciple of  Zermelo,"  and  the  "  general  principle  of  selection,"  which  can  be  ex- 
pressed in  several  equivalent  forms,  was  first  stated  explicitly  in  connection 
with  a  proof  given  by  Zermelo*,  in  1904,  that  every  aggregate  can  be  ordered 
in  normal  order;  although  it  had  been  implied  in  the  reasoning  of  various 
writers  in  connection  with  special  theorems  in  the  theory  of  sets  of  points,  and 
in  the  theory  of  functions.    The  axiom  has  been  stated  f  by  Zermelo  in  the 
following  form  : 

An  aggregate  8,  which  falls  into  an  aggregate  of  separate  parts  A,  B,  C,  ..., 
each  of  which  contains  at  least  one  element,  possesses  at  least  one  part  S,  which 
has  in  common  with  each  of  the  parts  A,  B,  C,  ...,  exactly  one  'element. 

In  accordance  with  the  definition,  given  in  §  149,  of  the  product  of  a 
finite,  or  infinite,  set  of  aggregates,  as  an  aggregate  of  which  each  element 
consists  of  an  association  of  elements,  one  from  each  of  the  aggregates  of  the 
given  set,  the  above  principle  is  equivalent  to  the  assertion  that  the  product  of 
the  aggregates  of  the  given  set  contains  at  least  one  element ;  and  thus  the  prin- 
ciple may  be  termed  the  "multiplicative  axiom,"  in  that  it  asserts  the  existence 
of  the  product. 

The  principle  can  be  stated  in  the  less  objective  form  that  a  choice  can  be 
made  of  a  single  element,  from  each  one  of  a  set  [M]  of  aggregates  M ;  an  aggre- 
gate thus  being  formed  by  an  infinite  number  of  such  acts  of  choice.  When 
viewed  in  this  manner  the  principle  is  'spoken  of  as  the  "  general  principle  of 
selection  "  (das  Prinzip  der  Auswahl). 

In  the  discussions  to  which  this  principle,  or  axiom,  has  given  rise,  much 
divergence  of  opinion  on  the  part  of  Mathematicians  has  emerged.  Borel,  who 
pointed  out  thatZermelo's  proof  that  an  aggregate  can  be  normally  ordered  simply 
establishes  that  this  theorem  is  equivalent  to  the  principle  of  selection,  distin- 
guished |  between  an  enumerable  and  an  unenumerable  set  of  acts  of  choice,  and 
rejected,  as  outside  the  domain  of  Mathematics,  all  reasoning  founded  upon 
the  supposition  of  an  unenumerable  set  of  acts  of  choice.  The  impossibility 
of  proving  the  principle  was  expressly  stated  by  Zermelo,  and  emphasized  by 
Borel,  and  by  Peano§.  Poincare,  in  the  course  of  his  discussion  ||  of  the 
"  Logistic "  of  Peano  and  Russell,  expressed  the  view  that  the  principle, 
although  incapable  of  proof,  is  an  indispensable  axiom. 

198.  By  some  Mathematicians,  the  employment  of  an  infinite  set  of  acts 
of  choice,  even  if  that  set  be  enumerable,  is  rejected  as  outside  the  domain  of 
Mathematics,  on  the  ground  that  the  hypothetical  entity,  the  existence  of 
which  is  asserted  in  such  a  case  by  the  principle,  is  not  properly  defined. 

*  Math.  Amialen,  vol.  LIX,  p.  514.  f  Math.  Annnlen,  vol.  LXV,  p.  110. 

I  See  Math.  Annalen,  vol.  LX,  p.  195.  §  Rivista  di  Mat.,  vol.  vnr,  No.  5,  p.  145. 

||  See  the  Revue  de  Metaphysique  et  de  Morale,  vols.  xm  and  xiv. 
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In  the  course  of  a  discussion*  by  Hadamard,  Borel,  Baire,  and  Lebesgue, 
the  distinction  was  taken  into  account  between  independent  acts  of  choice  and 
such  as  are  dependent  upon  acts  of  choice  that  have  been  previously  made. 

In  those  special  cases  in  which  the  aggregates  are  of  such  a  nature  that  it 
is  possible  effectively  to  define  an  aggregate  which  consists  of  one  element 
belonging  to  each  of  the  given  aggregates,  the  principle  is  not  required  as  an 
axiom,  but  such  special  cases  may  be  regarded  as  instances  in  which  the  truth 
of  the  principle  is  verified.  In  a  case  in  which  we  are  unable  to  give  such 
effective  definition,  the  principle  amounts  to  an  assertion  of  the  existence  of 
the  aggregate  in  question,  apart  from  any  question  of  effective  definition,  and 
at  least  asserts  that  we  are  warranted  to  make  the  same  deductions  as  if  we 
were  in  possession  of  such  effective  definition.  In  this  connection  it  may  be 
pointed  out  that  there  is  a  certain  relativity  in  the  notion  of  effective  defini- 
tion, as  has  already  been  shewn  (§  63)  in  the  case  of  the  definition  of  numbers 
of  the  arithmetic  continuum ;  in  fact,  the  possibility  of  effective  definition  of 
an  object  is  relative  to  an  apparatus  of  definition  which  cannot  be  considered 
as  once  for  all,  and  finally,  fixed. 

Even  if  the  number  of  aggregates  from  which  the  selection  is  to  be  made 
be  finite,  there  remains  a  question  as  to  the  scope  of  the  axiom.  It  is  suf- 
ficient to  consider  whether,  and  in  what  sense,  it  is  always  possible  to  select 
a  single  element  from  a  single  given  aggregate.  Whether  this  can  be  done 
effectively  will  depend  .upon  the  mode  in  which  the  aggregate  is  defined. 
For  the  possibility  of  such  choice,  it  must  in  the  first  place  be  known  that 
the  aggregate  is  not  a  null-aggregate,  i.e.  that  it  contains  at  least  one  ele- 
ment. The  assumption  of  this  knowledge  being  made,  the  effective  determi- 
nation of  an  element  may  be  impracticable,  or  difficult;  but  this  will  not 
affect  its  logical  possibility.  It  would  appear,  however,  that  there  may  be  cases 
in  which  even  the  logical  possibility  of  determination  cannot  be  demonstrated; 
and  in  such  cases  recourse  must  be  had  to  the  axiom  of  existence,  if  an 
element  of  the  aggregate  in  question  is  required  for  the  purpose  of  any  pro- 
cess of  reasoning  in  which  such  element  is  employed. 

Some  of  the  objections  raised  to  the  axiom  amount  to  the  assertion  that 
the  axiom  is  meaningless,  if  it  is  interpreted  as  asserting  the  existence  of  an 
aggregate  of  which  the  elements  cannot,  even  in  theory,  be  effectively  defined. 
The  difference  of  opinion  upon  the  point,  as  to  whether,  or  not,  an  aggregate 
can  be  said  to  exist,  when  none  of  its  elements  can  be  effectively  defined, 
would  appear  to  reveal  a  fundamental  difference  of  view  on  a  matter  of 
Ontology,  which  cannot  be  resolved  within  the  domain  of  Mathematics,  as 
it  involves  an  irreconcilable  divergence  of  attitude  in  general  Philosophy. 
It  was  pointed  out  by  Hadamard  (loc.  cit.)  that  the  real  question  at  issue  is 

*  "  Cinque  lettres  sur  la  theorie  des  ensembles,"  Bull,  de  la  Soc.  Mat.  de  France,  vol.  xxxm, 
p.  261. 
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whether  it  is  possible  to  demonstrate  the  existence  of  mathematical  entities 
which  cannot  be  precisely  defined;  a  question  which  Hadamard  himself  answers 
in  the  affirmative,  though  with  this  answer  the  other  writers  do  not  appear  to 
be  in  agreement.  By  some  Mathematicians,  the  principle  is  accepted  in  a 
pragmatic  sense,  as  a  useful  instrument  of  Mathematical  research,  and  as  a 
guide  which  suggests  results  that  may  be  \7erifiable  by  other  means,  although 
the  principle  cannot  be  regarded  as  standing  upon  an  absolutely  firm  basis. 

The  case  for  the  acceptance  of  the  axiom  rests  partly  upon  the  fact  that 
the  postulation  of  its  validity  had  been  implicitly  made,  as  self-evident,  by 
many  writers,  in  various  investigations  in  the  theory  of  sets  of  points,  and  in 
the  theory  of  functions,  before  it  was  stated  explicitly;  and  that  it  has  since 
not  only  proved  fruitful  in  many  further  developments  of  those  theories,  but 
that  results  obtained  by  its  use  have  been  verified  in  many  important  cases. 
A  detailed  analysis  has  been  given*  by  Sierpinski  of  general  results  in  the 
theories  of  sets  of  points,  and  of  functions,  which  cannot  be  established 
without  the  aid  of  the  axiom,  and  of  other  cases  in  which  the  employment  of 
the  axiom  is  not  necessary.  The  method  of  G.  H.  Hardy  (§  186),  for  the 
construction  of  a  set  of  points  of  cardinal  number  K,  is  a  case  in  which  the 
principle  of  selection  is  required,  since  there  exists  an  indefinite  number  of 
sequences  of  the  ordinal  numbers,  preceding  a  limiting  ordinal  number  7,  of 
each  of  which  7  is  the  limiting  number.  It  is  not  possible  to  give  a  norm 
by  which  these  sequences  can  be  determined. 

199.  A  limiting  point  P,  of  a  set  of  points  G,  has,  in  §  52,  been  defined 
as  a  point  such  that  in  every  neighbourhood  of  P  there  are  points  of  G.  It 
has  been  pointed  out  by  Sierpinski  that,  in  the  general  case,  the  inference, 
from  this  definition,  that  there  exists  a  sequence  P1}  P2,  ...  Pn, ...  of  points 
of  0,  which  converges  to  P  as  sole  limiting  point,  so  that  the  distance  PPn 
converges  to  zero,  as  n  is  indefinitely  increased,  cannot  be  made  without 
having  recourse  to  that  case  of  Zermelo's  axiom  in  which  the  selections  are 
made  from  an  enumerable  sequence  of  sets  of  points. 

For  simplicity,  let  us  consider  the  case  in  which  G  is  a  linear  set;  this 
differs  in  no  essential  respect  from  the  case  in  which  G  is  a  set  in  p  dimen- 
sions, where  p  >  1.  Let  us  suppose  that  the  point  x  =  0  is  a  limiting  point 
of  G,  in  accordance  with  the  definition  of  §  52.  Let  Gn  be  that  part  of  G 

which  consists  of  points  x  that  satisfy  the  conditions  -     -=  ^    x  <  -  .     Thus 

n  +  l  n 

Gn  is  in  the  two  half-closed  intervals  (  -     ^  ,  -  J ,  ( , 7  )  •   Consider  now 

Vn  + 1    n)    \    n       n+l/ 

the  sequence  of  sets  Gl,G2,  ...  Gn>...  of  which  the  sum  is  the  relevant  part  of  G. 
The  axiom  of  Zermelo  asserts  the  existence  of  a  set  of  points  Pr,  Pr+l,  ...PH,...', 

*  Bull,  de  I'acad.  des  Sciences  de  Cracovie,  April — May,  1918. 


i98-20oj  Multiplicative  axiom  251 

such  that  Pn  belongs  to  Gn,  for  all  the  values  r,  r+  1,  .  ..  of  n.  Unless  it  is 
possible  to  define  by  a  norm  such  a  set  of  points,  its  existence  can  only  be 
assumed  in  virtue  of  the  axiom.  This  set  of  points  is  a  sequence  such  as  is 
required,  which  converges  to  the  point  x  =  0. 

There  is  one  important  case  in  which  Zermelo's  axiom  is  not  required, 
viz.  when  G  contains  a  known  enumerable  set  H,  for  example  the  set  of 
rational  points,  that  is  everywhere  dense  in  G.  The  point  Pn  may  then  be 
defined  as  that  point  of  H,  of  lowest  rank  when  H  is  arranged  in  enumerable 
order  of  type  to,  which  belongs  to  Gn. 

It  has  been  shewn  conversely  by  Sierpinski  that  the  following  theorem 
follows  as  a  consequence  of  the  assumption  that,  if  P  be  a  limiting  point  of 
a  set  G,  in  accordance  with  the  definition  in  §  52,  there  exists  a  sequence  {Pn}  , 
of  points  of  G,  which  converges  to  P  as  its  sole  limiting  point. 

If  Gj,  6r2,  ...  Gn,  ...  be  sets  of  points,  no  two  of  which  have  an  element  in 
common,  there  exists  a  sequence  Pl}  P2,  ...  Pn,  ...  of  points,  such  that  each  one 
belongs  to  one  of  the  sets  of  {Gn}  and  that  no  two  of  them  belong  to  the  same 
set  of  {Gn}. 

By  means,  for  example,  of  the  transformation 


the  set  Gn  can  be  put  into  correspondence  with  a  set  Qn,  all  the  points 

which  are  interior  to  the  interval  [  —  -  ]  .    Let  0  denote  the  set  of  points 

\n  '  n  +  ij 

£,  given  as  the  sum  of  all  the  sets  Q1}  Q2>  ...  Qn,  ...  ;  then  the  point  £  =  0  is 
a  limiting  point  of  the  set  Q. 

Now  let  it  be  assumed  that  there  exists  a  sequence  {qm},  of  points  of  Q, 
which  converges  to  £  =  0  as  its  sole  limiting  point.  We  may  suppose  that 
there  is  only  one  of  these  points  in  any  one  set  Qn,  for,  if  gm  belongs  to 
a  set  of  {Qn}  to  which  any  one  of  the  points  q1}  q2,  ...  qtn_1  belongs,  we  may 
remove  qm  from  the  sequence  of  points.  Let  us  suppose  that  qn  belongs  to 
Qn',  then  by  applying  the  inverse  transformation  to  that  of  x  into  £,  we 
have  a  sequence  of  points  P1}  P2,  ...  Pn,  ...  which  belong  to  the  sets 

(TJ,  (TO,  ...  Gii,  ••• 

respectively. 

200.  The  question  of  the  justification  of  the  assumption  that  every 
infinite  aggregate  contains  an  enumerable  aggregate  as  a  part  is  related  to 
the  principle  of  selection.  It  was  assumed  by  Cantor  that  it  is  always  possible 
to  pick  out  of  a  given  aggregate,  successively,  elements  that  correspond  to 
the  numbers  of  the  integer  sequence.  In  a  large  class  of  cases  this  can  be 
effected  by  means  of  a  suitable  norm,  but  in  the  general  case  an  infinite 
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number  of  acts  of  choice,  each  of  which  is  affected  by  the  choices  already 
made,  are  required.  In  case,  however,  a  given  aggregate  M  can  be  divided 
into  an  enumerable  set  of  parts  Ml,  M2,  M3,  ... ,  no  two  of  which  have  an 
element  in  common,  one  element  may  .be  selected  from  each  of  these  parts, 
in  accordance  with  the  principle  of  selection;  and  thus  the  existence  of  the 
enumerable  part  of  M  follows  as  a  consequence  of  the  principle.  In  case 
M  is  an  aggregate  of  points,  in  one  or  more  dimensions,  an  enumerable  set  of 
non-overlapping  cells,  or  intervals,  may  be  defined,  each  of  which  contains  a 
part  of  M;  and  we  may  take  the  parts  Mlt  M2,  M3,  . . . ,  of  M,  to  be  those  parts 
that  are  contained  in  the  cells,  or  intervals,  of  the  set. 

MULTIPLE  CORRESPONDENCE. 

201.  A  propos  of  a  criticism*,  by  Levi,  of  Bernstein's  proof  (see  §  185), 
that  the  aggregate  of  all  closed  sets  in  the  ^-dimensional  continuum  has  the 
power  c  of  the  continuum,  Bernstein  j-  has  endeavoured  to  avoid  the  difficulty 
involved  in  the  use  of  a  correspondence  which  cannot  be  effectively  defined, 
by  introducing  the  conception  of  "multiple  equivalence."   Thus,  if  there  are 
two  aggregates  M  and  N,  for  which  an  aggregate  $  =  {<£}  of  reversible  (1,1) 
correspondences  </>  exists,  in  which  no  element  is  special  (ausgezeich.net),  then 
the  two  aggregates  are  said  to  be  multiply  equivalent.    The  cardinal  number 
</>  =/is  then  termed  the  multiplicity  of  the  correspondence.    In  case  the  multi- 
plicity is  unity,  the  aggregates  are  said  to  have  a  one-valued  correspondence. 
Tffe  difficulty  of  this  conception  is  the  same  as  that  in  the  multiplicative  axiom 
itself,  viz.,  that  the  aggregate  </>  employed  is  such  that  no  single  element  of  it 
is  capable  of  definition,  and  that  the  elements  are  consequently  indistinguish- 
able from  one  another. 

THE  NORMAL  ORDERING  OF  AN  AGGREGATE. 

202.  It  was  regarded  by  Cantor  as  highly  probable  that  every  aggregate 
is  capable  of  being  normally  ordered;  and  this  is  equivalent  to  the  theorem 
that  every  transfinite  cardinal  number  is  an  aleph-number.    If  this  principle 
could  be  accepted  as  valid,  the  particular  theorem  would  follow,  that  the  arith- 
metic continuum  is  capable  of  being  normally  ordered;  and  the  only  question 
which  would  remain  open,  as  regards  this  aggregate,  would  be  as  to  which 
particular  aleph-number  is  the  cardinal  number  c  of  the  continuum. 

No  proof  of  the  correctness  of  his  surmise  was  published  by  Cantor  himself, 
but  a  proof  was  given  by  Jourdain  J,  which  depends  upon  an  argument  in  which 
the  "inconsistent"  aggregate  W  (§  194?)  was  employed  to  shew  that  no  cardinal 
number  can  be  greater  than  every  aleph-number.  Apart  from  the  employment 
of  the  concept  of  the  aggregate  W,  which  has  been  shewn  to  be  affected  with 
contradiction,  this  proof  involved  implicitly  the  principle  of  selection. 

*  Lomb.  1st.  Rend.,  (2),  1902,  p.  863.  t  Gottinger  Xachrichten,  1904,  p.  557. 

J  Phil.  Mag.,  (6),  vol.  vn,  p.  67;  see  also  Math.  Annalen,  vol.  LX,  p.  465. 
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A  second,  and  more  cogent,  proof  was  given*  by  Zermelo,  in  connection 
with  which  the  axiom  associated  with  his  name  was  explicitly  stated  as 
necessary  for  the  purpose  of  his  proof.  He  has  later  published  f  a  new  form 
of  the  proof,  together  with  a  reply  to  criticisms  of  his  earlier  proof. 

It  is  assumed  that,  in  each  part  M',  of  a  given  aggregate  M,  one  element 
m',  called  the  special  (ausgezeichnetes)  element  of  M',  can  be  chosen.  The 
possibility  of  doing  this  for  all  the  parts  of  M  follows  from  the  axiom.  Each 
element  M',  of  {M1}, corresponds  to  a  special  element  m'  which  belongs  to  M; 
and  this  particular  mode  of  distributing  the  elements  of  M  upon  the  elements 
of  [M'\  is  called  a  "covering"  7;  the  employment  of  a  particular  "covering"  7  is 
essential  to  the  proof.  A  7-aggregate  is  then  defined  as  follows: — Let  My  be  a 
normally  ordered  aggregate  consisting  of  different  elements  of  M,  such  that,  if 
a  be  any  arbitrarily  chosen  element  of  My,  and  if  A  be  the  segment  of  My 
defined  by  a,  which  segment  consists  of  all  the  elements  of  My  that  precede  a, 
then  a  is  always  the  special  element  of  M  —  A.  Every  such  aggregate  Aly  is  a 
7-aggregate.  If  every  element  of  M  which  occurs  in  a  7-aggregate  be  called  a 
7-element  of  M,  it  is  shewn  that  the  aggregate  Ly,  of  all  7-elements,  can  be 
so  ordered  that  it  is  itself  a  7-aggregate,  and  contains  all  the  elements  of 
the  original  aggregate  M.  It  follows  that  M  can  be  normally  ordered. 

It  will  be  observed  that  the  theorem  that  any  aggregate  can  be  normally 
ordered  does  not  assert  the  possibility  of  effectively  carrying  out,  in  the  case 
of  any  given  aggregate,  the  process  of  arranging  the  elements  of  the  aggregate 
in  normal  order.  The  theorem  must  rather  be  regarded  as  an  existence  theorem, 
deducible  from  certain  postulates  which  include  the  multiplicative  axiom. 

In  the  later  form  of  his  proof,  Zermelo  states  the  theorem  in  the  following 
manner: 

If  there  corresponds  to  each  part  of  the  aggregate  M,  one  element  of  that 
part,  its  special  element,  then  the  aggregate  U  (M),  which  contains  as  its 
elements  all  parts  of  M,  has  one  and  only  one  element  M,  such  that  there 
corresponds  to  each  part  P,  of  M,  one  and  only  one  element  P0,  of  M,  such  that 
P0  has  P  as  a  part,  and  an  element  of  P  as  its  special  element.  The  aggregate 
M  is  normally  ordered  by  means  of  M. 

The  existence  of  the  aggregate  U  (M)  as  a  valid  conception  is  assumed. 

THE  COMPARABILITY  OF  AGGREGATES. 

203.  From  the  theorem  that  every  aggregate  can  be  normally  ordered,  it 
would  follow  that  the  cardinal  number  of  any  infinite  aggregate  must  be  an 
aleph-number,  or  the  equivalent  theorem  of  the  "comparability  of  aggregates," 
that  of  any  two  aggregates  one  at  least  is  equivalent  to  a  part  of  the  other.  It 

*  Math.  Annalen,  vol.  XLIX,  p.  514.  t  Math.  Annalen,  vol.  LXV,  p.  107. 
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has  been  proved*  by  Hartog,  conversely,  that  if  the  comparability  of  every  pair 
of  aggregates  be  assumed,  the  theorem  that  any  aggregate  can  be  normally 
ordered  can  be  deduced.  This  theorem  is  established  on  the  basis  of  a  system 
of  axioms  given f  by  Zermelo. 

It  follows  from  Hartog's  result,  combined  with  Zermelo's  theorem,  that  the 
three  principles  of  selection,  of  comparability  of  two  aggregates,  and  of  the 
normal  ordering  of  an  aggregate,  are  completely  equivalent  to  one  another, 
in  that  any  two  of  them  can  be  deduced  from  the  third.  In  case  none  of  the 
three  principles  is  assumed  to  be  valid,  Hartog's  investigation  proves  that  there 
exists  no  aggregate  whose  cardinal  number  is  greater  than  all  the  aleph- 
numbers.  The  proof  depends  upon  a  classification  of  all  the  normally  ordered 
aggregates  of  which  the  elements  are  also  elements  of  a  given  aggregate  M \ 
the  validity  of  the  conception  of  the  aggregate  of  all  such  normally  ordered 
aggregates,  corresponding  to  a  given  aggregate  M,  being  assumed, 

204.  The  discussion  of  the  fundamentals  of  the  theory  of  aggregates,  given 
in  the  latter  part  of  the  present  chapter,  in  which  the  divergence  of  view 
amongst  Mathematicians,  as  regards  some  parts  of  the  theory,  has  been  in- 
dicated, makes  it  clear  that,  for  some  time  to  come,  a  critical  attitude  is  likely 
to  be  maintained  in  some  quarters  as  regards  the  theory  of  the  transfinite,  at 
least  in  some  of  its  developments.  The  extreme  sceptical  attitude  has  been 
expressed  by  Poincare,  in  the}  words  "II  n'y  a  pas  d'infini  actuel;  les  Cantoriens 
Font  oublie,  et  ils  sont  tombes  dans  la  contradiction."  Even  if  the  general 
theory  of  classes  of  order-types  and  of  aleph-numbers  be  regarded  by  many 
Mathematicians  as  still  in  the  region  of  speculation,  nevertheless  the  debt 
which  Mathematical  Science  owes  to  the  genius  of  G.  Cantor  will  not  be 
materially  diminished.  The  fundamental  distinction  between  enumerable  and 
unenumerable  aggregates,  the  interpretation  of  the  arithmetic  doctrine  of 
limits,  the  ordinal  theory  of  the  arithmetic  continuum,  and  the  conception 
of  the  ordinal  numbers  of  the  second  class,  with  their  application  to  the  theory 
of  sets  of  points,  remain  as  permanent  acquisitions,  independently  of  the  accept- 
ance of  the  whole  of  the  higher  developments  of  the  abstract  theory  of  aggre- 
gates. This  order  of  ideas  has  become  indispensable,  for  purposes  of  exact 
formulation,  in  Analysis  and  in  Geometry;  it  is  constantly  receiving  new  appli- 
cations, owing  to  its  admirable  power  of  providing  the  language  requisite  for 
expressing  results  in  the  theory  of  functions  with  the  highest  degree  of  rigour 
and  generality.  Cantor's  creations  have  rendered  inestimable  service  in  formu- 
lating the  limitations  to  which  many  results  in  Analysis,  formerly  supposed 
to  be  universally  valid,  are  subject.  The  outlying  parts  of  the  theory,  although 
they  open  up  a  most  interesting  field  of  investigation,  from  which  much  may 
be  hoped  for  in  the  future,  do  not  appear  as  yet  to  be  comparable  in  importance, 

*  Math.  Annalen,  vol.  LXXVI,  p.  438.  t  Math.  Ann  ale  n,  vol.  i,xv,  p.  261. 

£  Revue  de  Metaph.  et  de  Morale,  vol.  xrv,  p.  316. 
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for  the  general  purposes  of  Analysis,  with  those  parts  which  are  accepted  as 
fully  established  by  all  except  the  most  extreme  finitists.  The  fact  that  the 
general  theory  of  aleph-numbers,  as  an  abstract  development  of  the  theory  of 
order,  has  received  but  few  applications  in  the  theory  of  functions,  differentiates 
it  from  the  theory  of  normally  ordered  enumerable  aggregates,  which  has  now 
become  a  most  useful  instrument  of  discovery  in  the  theory  of  functions  of  one 
or  more  variables.  All  aggregates  of  points  of  a  continuum,  which  are  defined 
by  the  methods  in  ordinary  use,  have  either  the  power  of  the  aggregate  of 
rational  numbers,  or  else  that  of  the  arithmetic  continuum  itself.  The  theories 
of  these  two  kinds  of  aggregates,  including  as  they  do  a  complete  arithmetic 
theory  of  limits,  would  thus  appear  to  afford  a  sufficient  basis  for  all  the  ordinary 
parts  of  Analysis. 


CHAPTER  V 
FUNCTIONS  OF  A  REAL  VARIABLE 

205.  IF  we  suppose  that  an  aggregate  of  real  numbers  is  defined,  the 
aggregate  being  either  enumerable,  or  of  the  power  of  the  continuum  ;  such  an 
aggregate  is  said  to  be  the  domain,  or  field,  of  a  real  variable.  It  is  necessary 
for  the  purposes  of  Analysis  to  be  able  to  make  statements  applicable  to  each 
and  every  real  number  of  the  aggregate,  and  which  shall  be  valid  for  any 
particular  number  that  may,  at  will,  be  selected.  This  is  done  by  employing  the 
real  variable,  denoted  by  some  symbol  other  than  those  used  to  denote  real 
numbers ;  and  the  essential  nature  of  the  variable  consists  in  its  being 
identifiable  with  any  particular  number  of  its  domain.  The  symbols  used  for 
denoting  variables  differ  from  those'employed  in  the  case  of  numbers  in  being 
non-systematic.  Operations  involving  real  variables  #(1),  x(z],  x(A],  ...,  with,  or 
without,  particular  numbers,  are  carried  out  in  conformity  with  the  same  formal 
laws  as  hold  in  the  arithmetic  of  real  numbers.  The  result  of  any  such  operation 
is  itself  a  variable  with  a  domain  of  its  own,  which  may,  or  may  not,  be  identical 
with  that  of  any  of  the  constituent  variables. 

The  numbers  being  used  to  designate  in  the  usual  manner  the  points  of 
a  set  on  a  straight  line,  the  variable  may  then  be  taken  to  refer  to  the  points 
of  the  set. 

If  the  given  set  of  points  be  bounded,  in  the  sense  explained  in  §  47,  then 
the  domain  of  the  variable  is  said  to  be  limited,  or  bounded.  When  the  domain 
of  the  variable  is  not  limited,  it  is  said  to  be  unlimited,  or  unbounded,  in  one  or 
in  both  directions. 

The  variable  is  said  to  be  continuous,  in  a  given  interval  (a,  b),  when  all  the 
points  of  the  interval,  including  a  and  b,  belong  to  the  domain  of  the  variable. 
If  the  points  a,  b  do  not  belong  to  the  domain,  but  every  internal  point  of  the 
interval  does  so  belong,  the  variable  is  said  to  be  continuous  in  the  open  interval 
(a,  b),  or  within  the  interval  (a,  b).  The  corresponding  definitions  apply  to  the 
case  of  an  aggregate  of  any  number  p  of  dimensions,  which  is  regarded  as  the 
domain  of  n  independent  variables  xw,  x('2},  . . .  x(v}.  In  this  case  a  closed,  or  an 
open,  connex  domain  of  p  dimensions  takes  the  place  of  a  closed,  or  an  open, 
interval.  A  variable  point  (xw,x(~],  ...#w),  of  such  a  domain,  is  often  con- 
veniently represented  symbolically  by  a  single  variable  x. 
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The  term  "variable"  has  been  commonly  associated  with  the  conception  of 
a  point  moving  in  a  straight  line  or  in  a  curve.  It  has  however  been  pointed 
out  in  the  course  of  the  discussions  of  the  continuum,  contained  in  the  earlier 
Chapters,  that  the  continuum  cannot  legitimately  be  regarded  as  a  synthetic 
construction  fonned  by  a  set  of  points  determined  successively.  Successive 
determination  is  applicable  only  in  the  case  of  any  enumerable  sequence  which 
may  be  defined  within  the  continuum,  and  such  a  sequence  may  represent  a 
succession  of  positions  of  a  point  moving  in  a  straight  line.  It  is  however  un- 
necessary to  proceed  to  a  detailed  analysis  of  the  conception  of  motion,  because 
the  Theory  of  Functions  has  no  need  of  the  conception  of  temporal  succession. 
The  theory  makes  continual  use  of  simply  infinite  sequences  determined  in  the 
continuum;  and  any  such  sequence  may  be  regarded  as  a  series  of  distinct 
determinations  of  the  variable  in  which  the  elements  are  in  logical  succession, 
each  element  after  the  first  being  preceded  and  succeeded  by  definite  elements. 

THE  FUNCTIONAL  RELATION. 

206.  If,  to  each  point  of  the  domain  of  the  independent  variable  x,  there 
be  made  to  correspond  in  any  manner  a  definite  number,  so  that  all  such  numbers 
form  a  new  aggregate  which  can  be  regarded  as  the  domain,  or  field,  of  a  new 
variable  y,  this  variable  y  is  said  to  be  a  (single-valued)  function  of  x.  The 
variables  «,j/are  called  the  independent  and  the  dependent  variable  respectively; 
and  the  functional  relation  between  these  variables  maybe  denoted  symbolically 
by  the  equation  y=f(x).  In  this  definition  no  restriction  is  made  a  priori  as 
regards  the  mode  in  which,  corresponding  to  each  value  of  x,  the  value  of  y  is 
assigned ;  and  the  conception  of  the  functional  relation  contains  nothing  more 
than  the  notion  of  determinate  correspondence  in  its  abstract  form,  free  from 
any  implication  as  to  the  mode  of  specification  of  such  correspondence.  In  any 
particular  case,  however,  the  special  functional  relation  must  be  assigned  by 
means  of  a  set  of  prescribed  rules  or  specifications,  which  may  be  of  any  kind 
that  shall  suffice  for  the  determination  of  the  value  of  y  corresponding  to  each 
value  of  x.  Such  rules  may,  in  any  particular  case,  be  embodied  in  a  single 
arithmetic  formula,  from  which  the  value  of  y  corresponding  to  each  value  of  x 
is  arithmetically  determinate ;  or  the  rules  may  be  expressed  by  a  set  of 
arithmetic  formulae  each  one  of  which  applies  to  a  part  of  the  domain  of  the 
independent  variable.  In  case  these  formulae  be  reducible  to  a  set  of  mutually 
independent  formulae,  that  set  must  be  a  finite  one.  In  case  the  function  be 
defined  by  an  enumerably  infinite  set  of  formulae,  each  applicable  to  a  part 
of  the  domain,  these  formulae  cannot  be  mutually  independent,  but  must  be 
subject  to  some  norm. 

It  should  be  observed  that,  when  for  any  particular  value  of  x  the  corre- 
sponding value  of  y  is  given  by  means  of  any  arithmetic  formula,  the  numerical 
value  of  y  is  in  general  only  formally  determinate ;  for  only  a  finite  number 
IT.  17 
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of  elements  of  a  convergent  aggregate  which  defines  the  value  of  y  can  in 
general  be  actually  found,  and  thus  the  value  of  y  can  be  specified  only  to 
any  required  degree  of  approximation,  but  it  is  still  regarded  as  perfectly  de- 
terminate. 

The  domain  of  x  consisting  of  a  set  (P)  of  points,  the  values  of  y,  in  the 
case  of  a  given  functional  relation  y=f(x),  may  be  represented  by  points  Q 
on  a  linear  interval;  all  such  points  forming  a  linear  set  (Q).  The  set  (Q)  is 
said  to  be  the  functional  image  of  the  set  (P),  determined  by  the  function  f(x) ; 
to  each  point  of  (P)  there  corresponds  a  single  point  of  (Q),  if/(#)  be  a  single- 
valued  function ;  but  to  each  point  of  (Q)  there  may  correspond  a  finite,  or  an 
infinite,  number  of  points  of  (P). 

If  cc  be  employed  to  denote  a  variable  point  P  (x(l},  x®,  ...  x{&}  in  a  p- 
dimensional  field,  all  that  has  been  said  will  apply  to  the  case  of  a  function  y,  of 
p  independent  variables.  The  linear  set  (Q)  will  be  the  functional  image  of  the 
^-dimensional  set  (P). 

The  perfectly  general  definition  of  a  function  which  has  been  given  above 
is  the  culmination  of  a  process  of  evolution  which  has  proceeded  largely  in 
connection  with  the  study  of  the  representation  of  functions  by  means  of 
trigonometrical  series.  By  the  older  mathematicians,  a  function  was  under- 
stood to  mean  a  single  formula,  at  first  usually  only  a  power  of  the  variable ; 
but  afterwards  it  was  regarded  as  defined  by  any  one  analytical  expression, 
and  was  extended  by  Euler  to  include  the  case  in  which  the  function  is  given 
implicitly  by  a  formal  relation  between  the  two  variables.  In  connection  with 
the  problem  of  the  determination  of  the  forms  of  vibrating  strings,  which  led 
to  the  discussion  of  functions  represented  by  trigonometrical  series,  the  con- 
ception arose  of  a  single  function  defined  in  different  intervals  by  means  of 
different  analytical  expressions.  The  arbitrary  nature  of  a  function  given  by 
a  graph  was  distinctly  recognized  by  Fourier ;  thus  the  notion  of  a  function 
was  emancipated  from  the  restriction  that  an  a  priori  representation  of  it  is 
necessary  by  a  single  formula. 

The  idea  that  a  function  can  be  defined  completely,  in  the  case  when  the 
domain  of  the  independent  variable  is  a  finite  continuous  interval,  by  means  of 
a  graph,  arbitrarily  drawn,  leaves  out  of  account  the  essentially  unarithmetic 
nature  of  geometrical  intuition.  A  curve  that  is  drawn  is  indistinguishable 
by  the  perception  from  a  sufficiently  great  number  of  discrete  points ;  and 
thus  all  that  is  really  given  by  an  arbitrarily  drawn  graph  consists  of  more  or 
less  arithmetically  inexact  values  of  the  ordinates  at  those  points  of  the  ic-axis 
at  which  we  are  able  to  measure  ordinates.  In  order  that  a  curve  may  be 
really  known,  sufficiently  to  serve  for  the  purpose  of  defining  a  function,  a  series 
of  rules  must  be  prescribed,  by  means  of  which  the  values  of  the  ordinates  can 
be  formally^  determined  at  all  points  of  the  #-axis.  It  is  sometimes  said,  in  order 
to  illustrate  the  generality  of  the  functional  relation,  that  a  function  is  definable 


206, 20?]  The  functional  relation  259 

in  the  form  of  a  table  which  specifies  values  of  y  corresponding  to  values  of  x, 
this  table  being  of  a  perfectly  arbitrary  character.  The  inadequacy  of  such 
illustration  is  manifest,  if  we  consider  that,  even  if  the  table  were  an  endless 
one,  as  has  been  remarked  in  §  187,  no  aggregate  of  y- values  can  be  defined  by 
an  endless  set  of  numbers,  apart  from  the  production  of  a  norm  by  which  those 
numbers  are  defined.  Moreover,  even  if  the  table  were  subject  to  a  definite 
norm,  it  could  only  theoretically  suffice  to  define  a  function  of  a  variable  whose 
domain  consisted  of  an  enumerable  set  of  points,  and  would  be  totally  in- 
applicable to  the  case  in  which  the  variable  has  a  continuous  domain,  unless 
some  special  restrictive  assumptions  as  to  the  nature  of  the  function  be  in- 
troduced, by  means  of  which  the  values  of  the  function  are  made  determinate 
at  the  remaining  points  of  the  continuous  domain. 

It  thus  appears  that  an  adequate  definition  of  a  function  for  a  continuous 
interval  (a,  b)  must  take  .the  form  first  given  to  it  by  Dirichlet*,  viz.  that  y  is 
a  single-valued  function  of  the  variable  x,  in  the  continuous  interval  (a,b),  when 
a  definite  value  of  y  corresponds  to  each  value  of  x  such  that  a  ^  x  £  b,  no  matter 
in  what  form  this  correspondence  is  specified.  A  particular  function  is  actually 
defined  when  y  is  arithmetically  defined  for  each  value  of  ac. 

No  elaborate  theory  is  required  for  functions  which  retain  their  complete 
generality,  in  accordance  with  the  abstract  definition  given  above,  since  few 
deductions  of  importance  can  be  made  from  that  definition  which  will  be  valid 
for  all  functions.  When,  however,  the  nature  of  a  function  is  in  some  way  re- 
stricted, either  in  the  whole  domain,  or  in  the  neighbourhoods  of  special  points 
of  that  domain,  there  is  room  for  the  development  of  a  theory  which  shall  deal 
with  the  peculiarities  that  follow  from  such  restriction  upon  the  complete 
generality  of  functions. 

207.  The  functions  defined  in  accordance  with  the  above  definition  are 
known  as  single-valued  functions,  since,  to  each  value  of  x  in  the  domain  of  x, 
there  corresponds  a  single  value  of  y.  The  definition  may  be  so  generalized  as 
to  be  applicable  to  multiple-valued  functions.  This  is  done  by  replacing  the 
requirement  that,  to  each  value  of  x  in  the  domain  of  x,  there  shall  correspond 
a  single  value  of  y,  by  the  more  general  statement  that,  to  each  value  of  x 
there  shall  correspond  a  definite  aggregate  of  values  of  y.  The  aggregate  of 
values  of  y  may,  for  any  particular  value  of  x,  consist  of  a  finite,  or  of  an  in- 
finite, set  of  numbers.  A  particular  function  is  then  defined  when  the  aggregate 
of  values  of  y  is  arithmetically  determinate  for  each  value  of  x,  in  accordance 
with  the  criteria  for  the  determinancy  of  a  linear  aggregate  which  have  been 
developed  in  the  theory  of  aggregates.  Although  the  Theory  of  Functions,  as 
developed  in  the  present  work,  is  mainly  concerned  with  single- valued  functions, 
it  is  necessary,  or  at  least  convenient,  in  the  course  of  the  examination  of  par- 
ticular functions  and  classes  of  functions,  to  make  use  of  auxiliary  functions 

*  See  Dirichlet's  Werke,  vol.  i,  p.  135. 

17—2 


260  Functions  of  a  real  variable  [CH.  v 

which  are  multiple- valued  at  certain  points  of  the  domain  of  the  independent 
variable.  Moreover,  Dirichlet's  definition,  in  its  original  form,  has  the  incon- 
venience that  it  excludes  from  the  category  of  functions  those  represented  by 
analytical  expressions  which,  for  particular  values  of  the  independent  variable, 
cease  to  define  a  single  number.  For  example,  an  infinite  series  which,  for  par- 
ticular values  of  the  variable,  either  diverges,  or  ceases  to  converge  to  a  single 
definite  limit,  does  not  define  a  single-valued  function  in  accordance  with 
Dirichlet's  definition,  for  the  whole  domain  of  the  variable,  and  yet  it  is  con- 
venient so  to  extend  the  meaning  of  the  term  function  that  a  function  may  be 
defined  for  the  whole  domain  by  such  a  series. 

The  distinction  has  been  considered  in  detail  by  Brod6n*  between  those 
functions  for  which  the  relation  between  the  dependent  variable  y  and  the 
independent  variable  x  is  formally  the  same  for  the  whole  domain  of  x,  and 
those  functions  for  which  the  domain  of  x  is  divisible  into  a  plurality  of  parts, 
for  which  the  forms  of  the  relation  between  x  and  y  are  different.  He 
remarks  that  the  distinction  is  one  relating  to  the  character  of  the  definitions 
rather  than  to  the  nature  of  the  functions  themselves;  in  the  former  case  the 
function  is  said  to  be  homonomically  defined,  and,  in  the  latter  case,  to  be 
heteronomically  defined.  Broden  has  given  a  formal  proof  that,  when  a 
function  is  heteronomically  defined,  the  number  of  parts  into  which  the 
domain  of  x  can  be  divided,  so  that  the  relations  of  y  to  x  in  any  one  part  are 
completely  independent  of  the  relations  in  the  other  parts,  must  be  finite. 

The  Theory  of  Functions  of  a  Real  Variable  is  concerned  with  the 
classification  of  functions,  according  as  they  possess  various  special  properties, 
e.g.  continuity,  differentiability,  integrability,  throughout  the  domain  of  the 
independent  variable,  or  at,  or  near,  special  points  which  form  part  of  that 
domain.  The  theory  requires  the  introduction  of  precise  arithmetical  defini- 
tions of  the  scope  and  meaning  of  these  characteristic  properties,  and  is  largely 
concerned  with  the  determination  of  criteria  which  shall  suffice  to  decide,  in 
the  case  of  a  function  defined  in  some  special  manner,  what  can  be  inferred 
as  regards  the  possession  by  such  function  of  properties  other  than  those  that 
are  immediately  apparent  from  the  definition  itself.  Much  of  the  theory  is- 
concerned  with  a  minute  examination  of  functions,  and  of  classes  of  functions,, 
which  possess  properties  that  do  not  occur  in  the  case  of  those  functions  which 
are  employed  in  ordinary  analysis  and  in  its  applications  to  Geometry  and 
Physics ;  and  the  theory  has  in  consequence  frequently  been  described  as  the 
Pathology  of  Functions.  It  appears  however,  from  the  theory  itself,  that 
many  of  those  peculiarities,  which  from  the  point  of  view  of  traditional  Analysis, 
would  be  described  as  exceptional,  have  no  claim  to  be  so  described ;  that  in 
fact  it  is  in  the  functions  of  ordinary  Analysis  that  the  abnormalities  really 
occur,  such  functions  occupying  an  exceptional  position  in  relation  to  a, 

*  Ada  Univ.  Lund.,  vol.  xxxni,  1897,  "  Functionentheoretische  Bemerkungen  und  Satze." 
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scientific  analysis  of  the  properties  of  functions  in  general.  An  important 
result  of  the  labours  of  those  who  have  developed  the  modern  theory  of 
functions  of  a  real  variable  has  been  that  restrictive  assumptions,  which  had 
previously  been  unconsciously  made  in  the  processes  of  ordinary  Analysis,  have 
been  placed  in  a  clear  light  ;  and  it  has  been  shewn  that  modes  of  reasoning 
which  had  their  origin  in  an  uncritical  application  of  ideas  obtained  from 
spatial  intuition  would  fail  to  yield  correct  results  when  applied  to  cases  of 
unrestricted  generality  ;  the  unsoundness  of  the  logical  basis  of  such  reasoning 
being  thereby  demonstrated. 

In  ordinary  Analysis  the  domain  of  the  independent  variable  is  taken 
to  be  a  limited,  or  unlimited,  continuous  interval.  In  the  theory  of  functions, 
on  the  other  hand,  it  has  been  found  advantageous  to  consider  also  the 
properties  of  functions  defined  for  a  domain  which  is  not  a  continuous  one. 
It  appears,  in  particular,  that  a  non-dense  perfect  set  of  points,  or  more 
generally  any  closed  set,  is  well  suited  to  be  the  domain  of  a  function,  inasmuch 
as,  for  such  domains,  the  principal  peculiarities  of  functions,  such  as  continuity, 
differentiability,  &c.,  are  capable  of  precise  formulation,  and  can  serve  for 
purposes  of  classification,  exactly  as  in  the  case  of  functions  defined  for  a 
continuous  domain.  Much  of  the  recent  progress  in  the  subject  is  due  to 
a  recognition  of  the  parity  of  all  perfect  sets  of  points,  not  only  as  regards 
their  internal  structure,  but  also  in  relation  to  their  fitness  for  forming  the 
domains  in  which  functions  can  be  defined,  without  loss  of  any  of  the  charac- 
teristic properties  that  serve  for  the  classification  of  functions  of  a  real  variable, 
or  of  several  such  variables. 

EXAMPLES. 

1.    A  function/^)  may  be  defined  for  the  interval  (0,  1)  as  follows  : 

for->^^—  —  .    f(x}  =  -x*.  and  for  #  =  0,  f(0)  =  l, 
n~        %  +  1  n 

n  denoting  any  positive  integer.  In  this  case,  the  norm  by  which  the  function  is  defined  is 
expressible  by  an  enumerable  set  of  formulae  which  are  however  not  independent  of  one 
another. 


2.    A  function  may  be  defined  as  follows  : 

for  !>* 
and  in  general, 


for  !>*•>     ,  /(,;)=!  ;    for      >#>,  /(*)=|  ;    for      >*  >    , 


for  -  >  r  >• 

31    =A  = 


where  Pn  denotes  the  nth  of  the  prime  numbers  2,  3,  5,  7,  ....     If  the  function  is  to  be 
defined  at  the  point  ,r=0,  this  may  be  done  by  assigning  to  /(O)  any  arbitrarily  chosen 

value  we  please.     It  will  be  observed  that  the  values  ~  are  in  this  case  not  representable 

*fi 

by  a  single  expression  which  involves  n  and  .r  only,  though  they  are  definite  single-valued 
functions  of  these. 
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3.    Any  number  x,  of  the  interval  (0,  1),  except  0,  can  be  uniquely  expressed  in  the  form 


where  bn  has,  for  every  value  of  n,  one  of  the  values  0,  1,  and  it  is  stipulated  that  all  the  bn 
are  not  to  be  zero  from  and  after  any  fixed  value  of  n. 

i 

A  multiple-valued  function*  may  be  defined  by  7/=#»l,  where  n  has  all  positive  integral 
values  for  which  bn=  1.  This  is  a  homonomic  definition,  although  no  analytical  expression 
of  a  unitary  character  can  be  given  for  the  representation  of  y. 

FUNCTIONS  OF  A  VARIABLE  AGGREGATE. 

208.  The  conception  of  the  functional  relation,  as  it  has  been  described 
above,  has  been  restricted  to  that  of  a  determinate  correlation  of  the  points  of 
a  domain,  linear,  plane,  spatial,  or  ^-dimensional,  forming  the  domain  of  the  in- 
dependent variable,  with  a  set  of  numbers,  or  points,  forming  the  domain  of  the 
dependent  variable.  The  modern  development  of  the  notion  of  the  functional 
relation  has  extended  it  to  the  -more  general  case  in  which  a  prescribed 
family  of  objects  of  any  specified  kind  takes  the  place  of  the  field  of  the 
independent  variable.  If  {0}  denotes  a  family,  or  aggregate,  of  objects  defined 
in  accordance  with  some  norm,  and  if  we  have  also  a  norm  by  means  of  which 
a  definite  number  is  made  to  correspond  to  each  member  0,  of  the  family,  a 
variable  y,  identifiable  with  each  such  number,  is  regarded  as  a  function  of 
the  objects  0  of  the  given  family.  The  ordinary  definition  considered  above 
of  a  function  of  a  variable  x,  or  of  a  number  p  of  variables,  is  the  particular 
case  of  the  more  general  conception  of  a  function,  which  arises  when  the 
objects  0  are  single  numbers,  or  points,  belonging  to  a  prescribed  linear,  or 
p-dimensional,  domain.  Another  case  of  the  functional  relation  which  has 
recently  become  of  considerable  importance  in  Analysis  is  that  in  which  each 
of  the  objects  0  consists  of  an  enumerable  set  of  numbers  x(l),  #(2),  ...  x(n}  ,  .... 
Thus  the  functions  considered  may  be  regarded  as  functions  of  a  point  in 
space  of  an  infinite  number  of  dimensions  ;  the  field  of  the  independent 
variable  being  any  specified  domain  in  such  space.  .The  particular  case  in 
which  the  numbers  x(l),  x&),  ...  x(n],  ...  are  so  restricted  that  the  sum  of  their 
squares  forms  a  convergent  series,  in  which  case  the  domain  of  the  point  is 
said  to  be  in  Hilbertian  space,  has  recently  been  applied  in  connection  with 
the  theory  of  integral  equations.  A  theory  of  functions  in  Hilbertian  space 
can  be  developed  in  this  connection. 

In  arithmetical  analysis  the  most  general  conception  of  a  function  is  that 
in  which  the  independent  variable  has  as  its  field  a  family  of  sets  of  points, 
either  linear,  plane,  spatial,  or  jo-dimensional.  Having  given,  by  means  of 
an  adequate  definition,  a  family  of  sets  of  points,  either  discrete  or  continuous, 
a  function  of  the  sets  is  defined  by  a  norm  which  assigns  a  definite  number 
to  each  set  of  points  of  the  given  family.  An  important  case  is  that  in  which 

*  Brod^n,  loc.  cit.,  p.  4. 
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the  sets  of  points  are  all  measurable ;  the  measure  of  a  set,  as  defined  in 
accordance  with  the  theory  developed  in  Chapter  ill,  is  then  a  function  of 
the  variable  set  in  a  given  family  of  such  measurable  sets. 

In  connection  with  the  modern  theory  of  integration,  more  general  functions 
of  measurable  sets  of  points  will  emerge.  The  theory  of  functions  of  curves  of 
prescribed  families,  developed  by  Volterra  and  others,  comes  under  this  head. 

The  Calculus  of  Variations  deals  with  the  theory  of  maxima  and  minima 
of  functions  of  continuous  sets  of  points ;  the  functions  taking  the  analytical 
form  of  integrals  which  determine  a  set  of  numbers  corresponding  to  the 
continuous  sets  of  points  belonging  to  prescribed  families. 

In  the  present  work  the  ordinary  case  of  functions  of  one  variable,  or  of  a 
finite  number  of  variables,  will,  for  the  most  part,  be  the  only  one  dealt  with, 
but  incidentally,  as  in  the  case  of  the  theory  of  integration,  functions  of  sets  of 
points  will  be  considered. 

The  present  Chapter  is  concerned,  in  the  first  instance,  with  the  properties 
of  functions  of  a  single  variable,  of  which  the  domain  is  an  interval,  or  other 
linear  set  of  points.  The  case  of  functions  of  two  or  more  variables,  i.e.  of 
points  belonging  to  a  domain  in  plane,  or  higher  dimensional,  space,  will  also 
be  considered;  especially  in  those  respects  in  which  the  properties  of  such 
functions  are  not  an  immediate  extension  of  the  properties  of  functions  of  a 
single  variable. 

THE  UPPER  AND  LOWER  BOUNDARIES  AND  LIMITS  OF  FUNCTIONS. 

209.  A  function  y  =/  (x),  being  defined  for  the  domain  of  x,  we  have  seen 
that  the  values  of  y  form  a  set  of  points,  determined  as  usual  upon  a  linear 
interval,  which  is  called  the  functional  image  of  that  set  of  points  which  forms 
the  domain  of  x.  In  case  the  set  of  points,  which  represent  the  values  of  y, 
is  a  bounded  set,  the  function  f  (x}  is  said  to  be  bounded  in  the  domain  of  x. 

When  the  set  of  values  of  y  is  bounded,  either  boundary  may  be  a  limiting 
point,  or  only  an  extreme  point,  of  the  set.  For  convenience,  and  in  accordance 
with  usage,  the  terms  upper  limit  and  lower  limit  will  be  applied  to  denote 
the  upper  and  lower  boundaries  of  the  derivative  of  this  set.  Thus  the  upper 
and  lower  limits  are  the  extreme  limiting  points  of  y  in  the  domain  of  x. 
Accordingly  we  may  say  that : 

If  the  set  of  points  y,  which  represents  the  functional  image  of  a  function 
f  (x),  defined  for  a  given  domain  of  x,  have  an  upper  and  a  lower  boundary, 
tlien  the  function  f  (x)  is  said  to  be  a  bounded  function,  and  the  boundaries  are 
said  to  be  the  upper  and  lower  boundaries  of  f  (x)  in  the  domain  of  x.  The 
upper  and  lower  limits  of  the  function  are  the  upper  and  lower  boundaries  of 
the  limiting  points  of  the  set  of  points  y. 

The  upper  limit  of  a  function  y  (a;)  for  its  given  domain  may,  or  may  not, 
be  attained,  i.e.  there  may,  or  may  not,  be  a  value  of  x,  in  the  domain  of  x, 
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for  which  the  functional  value  is  equal  to  the  upper,  or  to  the  lower  limit. 
This  is  the  case  whether,  or  not,  the  upper,  or  the  lower,  limit  is  identical  with 
the  upper,  or  the  lower,  boundary.  An  upper,  or  a  lower,  boundary  that  is  not 
identical  with  the  upper,  or  the  lower,  limit,  must  be  attained. 

In  case  y  have  no  upper  boundary,  or  no  lower  boundary,  for  the  domain 
of  x,  the  function  f(x)  is  said  to  be  an  unbounded  function.  In  this  case  there 
exist  values  of  the  function,  of  one  sign,  or  of  both  signs,  which  are  numerically 
greater  than  any  arbitrarily  assigned  number  A. 

When  y  has  no  upper  boundary  in  the  domain  of  x,  the  function  is  said  to 
have  the  improper  boundary  +  oo  ,  in  the  domain  of  x.  Similarly,  when  y  has 
no  lower  boundary,  it  is  said  to  have  the  improper  boundary  —  oc .  It  is 
frequently  said,  for  the  sake  of  brevity,  that  the  upper  or  the  lower  boundary 
of  the  function  is  infinite. 

The  excess  of  the  upper  boundary  of  a  function,  in  its  domain,  over  its  lower 
boundary,  is  called  the  fluctuation  (Schwankung)  of  the  function  in  the  domain. 

In  case  the  upper  or  the  lower  boundary  is  infinite,  the  function  is  said  to 
have  an  infinite  fluctuation  in  its  domain. 

Instead  of  the  whole  of  the  domain  of  x,  we  may  consider  that  part  which 
lies  in  a  given  interval  (a,  b),  closed  or  open,  and  the  preceding  definitions 
may  be  applied  to  this  portion  of  the  domain ;  thus  : 

The  upper  boundary  of  a  function  f  (x)  in  an  interval  (a,  b),  closed  or  open, 
is  the  upper  boundary  of  the  function  when  only  those  points  of  the  domain  of 
x  which  lie  in  the  interval  (a,  b)  are  taken  into  account.  A  similar  definition 
applies  to  the  case  of  the  upper  limit,  and  also  to  the  cases  of  the  lower 
boundary  and  the  lower  limit. 

The  excess  of  the  upper  boundary  of  f(x),  in  the  closed,  or  open,  interval 
(a,  b),  over  its  lower  boundary  in  that  interval,  is  called  the  fluctuation  of  f(x) 
in  the  closed,  or  open,  interval  (a,  b). 

In  case  one,  or  both,  of  the  boundaries  is  infinite,  the  fluctuation  of  the 
function  in  (a,  6)  is  said  to  be  infinite. 

If  the  upper  boundary  of  f(x)  in  (a,  b)  is  attained,  i.e.  if  there  exists  a 
value  c  of  x  such  that  f(c)  is  the  upper  boundary,  where  c  is  a  point  of  the 
domain  in  (a,  b),  then  this  upper  boundary  is  said  to  be  the  upper  extreme  of 
the  function  in  (a,  b)',  and 'a  similar  definition  applies  to  the  lower  extreme. 

If  the  end-points  a,  b  of  the  interval  be  left  out  of  account,  in  case  they 
belong  to  the  domain  of  x,  the  fluctuation  is  called  the  fluctuation  in  the  open 
interval  (a,  b).  This  is  sometimes  spoken  of  as  the  inner  fluctuation  of  the 
function  in  (a,  b),  and  is  determinable  as  the  limit  of  the  fluctuation  in  the 
interval  (a  +  e,  b  —  e),  when  e  is  indefinitely  diminished. 

Precisely  similar  definitions  are  applicable  to  the  case  of  a  function  of  p 
variables ;  a  cell  (a,  b)  taking  the  place  of  an  interval  (a,  b). 
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210.  In  accordance  with  the  definition  which  has  been  given  for  a  function 
in  any  domain,  the  value  of  the  function  at  any  particular  point  of  the  domain 
has  a  definite  finite  value.  It  may  happen  that  a  point  P,  of  the  domain 
of  x,  is  such  that,  in  any  arbitrarily  small  neighbourhood  of  P,  either  the 
upper,  or  the  lower,  boundary  of  the  function,  or  both,  may  not  exist  ;  so  that, 
in  any  neighbourhood  of  P  however  small,  there  exist  functional  values 
numerically  greater  than  any  number  that  may  be  assigned.  In  that  case,  the 
point  P  is  said  to  be  an  infinity,  or  point  of  infinite  discontinuity  of  the 
function',  although  the  function  has  a  definite  finite  value  at  the  point 
P  itself. 

Although  f(x)  is  not  properly  defined  at  a  point  P  (#0),  unless  a  definite 
numerical  value  be  assigned  to  /*(#„),  nevertheless  an  improper  definition  of  the 

functional  value  at  the  point  P  is  sometimes  admitted,  of  the  form  -j-  —  -  =  0  ; 


in  this  case  the  function  is  said  to  possess  an  infinity  at  P.  This  infinite  dis- 
continuity is  said  to  be  removable,  provided  that,  when  the  functional  value 
at  P  is  altered  to  some  finite  value,  the  function  have  finite  upper  and  lower 
boundaries  in  a  sufficiently  small  neighbourhood  of  P. 

There  are  other  cases  in  which  an  improper  definition  of  the  functional  value 
at  a  point  sc0  of  the  domain  of  so  is  admitted.  The  function  may  be  defined  by 
means  of  an  infinite  series,  of  which  the  terms  are  given  functions  of  x.  This 
series  may  diverge  at  the  particular  point  #0;  but  it  is  nevertheless  frequently 
convenient  to  regard  the  series  as  defining  the  function  for  all  values  of  x  in 
some  interval  which  includes  x0.  The  functional  value  at  x0  is  then  regarded 
as  infinite,  +  oo  ,  or  —  oc  . 

In  accordance  with  strict  arithmetic  theory,  the  function  is  regarded  as 
undefined  at  points  where  no  definite  finite  value  of  the  function  is  specified. 
For  the  most  part,  in  the  theory  which  will  be  developed  here,  this  restriction 
will  be  rigorously  adhered  to.  It  will  be  found,  however,  that  in  cases,  such  as 
in  the  theory  of  infinite  series,  in  which  it  is  convenient  to  admit  improper 
definitions  of  functions  at  particular  points,  no  essential  change  in  the  main 
results  of  the  theory  will  be  necessary. 

In  some  cases  it  will  be  found  convenient  to  remove  the  restriction  that 
at  each  point  of  the  domain  of  the  independent  variable  the  function  shall  be 
single-valued,  and  to  define  the  function  in  such  a  manner  that,  at  single  points 
or  at  each  point  of  some  set  belonging  to  the  domain  of  x,  the  function  may 
possess  finite,  or  infinite,  multiplicity.  It  will  be  found,  in  the  cases  in  which 
it  is  convenient  to  make  this  extension  of  the  meaning  of  a  function,  that  no 
difficulty  arises  as  regards  the  use  of  results  primarily  applicable  to  functions 
which  are  single-valued  at  all  points  of  the  domain  of  the  variable,  without 
exception. 
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THE  CONTINUITY  OF  FUNCTIONS. 

211.  Let  the  domain  of  the  independent  variable  x  be  continuous,  and 
either  bounded  or  unbounded;  and  denote  the  function  y,  at  the  point  x,  by 
f(x).  When  the  domain  of  x  consists  of  the  points  of  a  continuous  interval 
(a,  b),  that  interval  will  be  taken  as  closed  unless  the  contrary  is  stated. 

The  function  f(x)  is  said  to  be  continuous  at  the  point  a.  of  the  domain  of  x, 
if,  corresponding  to  any  arbitrarily  chosen  positive  number  e  whatever,  a  positive 
number  8,  dependent  on  e,  exists,  such  that  j/(a  +  7;)  —/(a)  |  <e,  for  all  positive 
or  negative  values  of  77  which  are  numerically  less  than  8,  and  which  are  such  that 
a  +  rj  is  in  the  domain  of  x.  At  an  end-point  of  a  limited  domain,  the  values  of  ?? 
will  have  one  sign  only. 

In  accordance  with  this  definition,  a  neighbourhood  (a  —  8,  a  +  8)  of  the 
point  a  exists,  such  that  the  function,  at  any  point  in  the  interior  of  this 
interval,  differs  numerically  from  its  value  at  or,  by  less  than  e.  It  follows  that 
the  inner  fluctuation  of  the  function  in  (a  —  8,  a  +  8)  is  less  than  2e,  and  it  is 
obvious  that  the  fluctuation  in  any  interval  interior  to  (a  —  B,  a  +  8)  is  less  than 
2e.  The  condition  of  continuity  of  the  function  f(x)  at  the  point  a.  may  thus 
be  stated  to  be  that  a  neighbourhood  of  the  point  can  be  determined  in  which 
the  fluctuation  of  the  function  is  as  small  as  lue  please. 

The  above  definition  of  continuity  at  a  point  is  that  due  to  Cauchy,  and 
is  a  particular  case  of  the  definition  of  continuity  for  a  function  of  any  number 
of  variables.  If  we  denote  by  f(xw,  x®, ...  xip]]  a  function  of  the  variables 
xa),  x(Z] ,  ...  x(p],  defined  for  any  continuous  domain,  the  condition  of  continuity 
at  the  point  (a'1',  a(2), ...  a.(p])  is  that,  corresponding  to  every  arbitrarily 
chosen  positive  number  e,  a  number  8,  dependent  on  e,  can  be  found,  such 
that  j/(a(I)  +  A(1),  a(2)  +  A(->,  ...  «<*>  +  h^>)-f(aw,  a(2),  ...  a(P>)  <  e,  provided 
h<u,  h(~}, ...  h(p}  have  any  values  which  are  numerically  less  than  8.  In  this 
case,  a  neighbourhood  (a  —  8,  a  +  8),  of  a  point  of  a  linear  domain,  is  replaced 
by  an  equilateral  cell,  which  is  a  square  in  the  case  of  a  two-dimensional 
domain. 

The  definition  of  continuity  has  been  stated  by  Heine*  in  a  form  which 
depends  upon  the  notion  of  a  convergent  sequence  of  numbers  or  of  points. 
Let  (P1}P<,,...Pn,...)  be  a  convergent  sequence  of  points  in  the  given  domain, 
and  of  which  P  is  the  limiting  point.  The  condition  of  continuity  of  the 
function  at  P  is  that,  for  every  such  convergent  aggregate  which  has  P  as 
limiting  point,  the  numbers/(P1),/(P2), . ..f(Pn),  •  - .  form  a  convergent  sequence 
which  represents  the  number  /(P).  It  is  easily  seen  that  a  function  which  is 
continuous  in  accordance  with  Cauchy's  definition  is  also  continuous  in  accord- 
ance with  that  of  Heine.  For  if  the  sequence  P1}  P2,  . . .  Pn, . . .  of  points  of  the 

*  Crelle's  Journal,  vol.  LXXIV,  p.  182. 
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domain  of  x  converges  to  P(o),  we  may  define  a  sequence  of  numbers 
*?! » »?2.  •  •  •  Vn,  •  •  •  corresponding  to  these  points,  such  that  tjn  ~  0,  as  n  **  oo  ,  and 
such  that  PPn<77n;  it  then  follows  by  Cauchy's  definition  that,  for  a  suffi- 
ciently large  value  of  n,  \f(x}  —  /(«)  |  <  6,  for  all  the  values  of  x  which  corre- 
spond to  the  points  Pn,  Pn+\,  ••••  Since  this  holds  for  all  values  of  e,  we  see 
that  the  sequence  {/(-?«)}  converges  to /(a). 

That  a  function  /(#),  which  is  continuous  at  the  point  a,  in  accordance 
with  Heine's  definition,  is  also  continuous  in  accordance  with  that  of  Cauchy, 
cannot*  be  proved  in  the  general  case  without  the  employment  of  Zermelo's 
axiom  (see  §  197).  To  establish  this  equivalence,  with  the  assumption  of  the 
axiom,  let  Hie>  (77)  denote  the  set  of  points  within  the  interval  (a  —  77,  a  +  77)  for 
which  \f(x}  —/(a)  |  =  e.  Assuming  that  there  exists  a  set  of  points  H(e)  (77),  of 
the  domain  of  x,  with  e  fixed,  however  small  77  may  be,  (which  will  be  the  case  if 
Cauchy's  definition  is  not  satisfied),  we  may  consider  a  monotone  sequence  {i)n} 
of  values  of  77  converging  to  zero.  It  is  impossible  that  the  sets  H(e}  (t]n)  should 
be  all  identical,  from  and  after  a  fixed  value  of  n,  for  the  intervals  (a  —  r)n,a.  +  yn) 
have  no  points  common  to  an  infinite  number  of  them  except  the  point  a  which 
does  not  belong  to  the  sets.  We  may  consequently,  without  loss  of  generality, 
assume  that  the  sets  H((]  (ijn)  are  all  different  from  one  another.  Considering 
the  sets  #»  (77,)  - #«>  (77,),  #w  fa)  -H«>  fa), . . .  H«  fa)-H^fal+1), . . .,  a  set  of 
points  p^po,--. pn,  •  •  •  exists  in  accordance  with  the  axiom,  such  that  pn  belongs 
to  H(e}  (r]n)  —  H(f}  (?7n+i),  for  every  value  of  n.  We  have  now  \f(pn)  —/(a)  I  =  e, 
for  every  value  of  n;  and  thus  a  sequence  [pn]  of  points  converging  to  a  exists, 
such  that  {f(pn)}  does  not  converge  to  /(a),  and  hence  Heine's  definition  of 
continuity  at  the  point  a  is  not  satisfied.  It  follows  that,  if  Heine's  definition 
is  satisfied,  the  set  H(f]  (77)  cannot  exist  for  all  values  of  77;  and  thus  that  for  a 
sufficiently  small  value  of  77,  Cauchy's  condition  \f(x)  —  f(a)  \  <  e  is  satisfied  if 
x  —  a  <  77,  for  all  points  x,  of  the  domain  of  x. 

A  function  which  is  not  continuous  at  a  point  a  may  satisfy  the  condition 
that,  in  a  neighbourhood  of  a/ on  the  right,  the  fluctuation  of  the  function  is 
as  small  as  we  please  when  the  neighbourhood  is  small  enough;  the  function  is 
then  said  to  be  continuous  on  the  right  at  a.  A  similar  definition  applies  to 
continuity  on  the  left. 

A  function  is  said  to  be  continuous  in  the  interval  (a,  b)  if  it  satisfies  the 
condition  of  continuity  at  every  point  in  the  interval. 

For  a  function  that  is  continuous  in  the  closed  interval  (a,  b),  the  upper 
boundary  is  identical  with  the  upper  limit,  and  the  lower  boundary  is  identical 
with  the  lower  limit  of  the  function  in  the  closed  interval.  This  does  not 
necessarily  hold  for  an  open  interval. 

*  See  Sierpinski,  Comptes  Rendus,  Paris,  vol.  CLXIII,  p.  088. 
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212.  The  domain  of  the  independent  variable  has  hitherto  been  considered 
to  be  continuous;  it  is  however  clear  from  a  consideration  of  the  definition  of 
continuity,  either  in  Cauchy's  or  in  Heine's  form,  that  the  definition  is  applic- 
able in  case  the  domain  of  the  independent  variable  is  not  continuous,  but 
consists  of  any  set  of  points  which  contains  limiting  points  that  belong  to  the 
set.  It  is,  of  course,  only  at  such  a  limiting  point  that  the  question  of  con- 
tinuity arises;  for,  at  an  isolated  point  of  the  aggregate,  there  are  no  values  of 
the  function,  other  than  that  at  the  point  itself,  in  any  sufficiently  small  neigh- 
bourhood of  the  point.  If  P  be  a  point  of  the  domain  of  x  which  is  a  limiting 
point  of  the  domain,  the  function  is  continuous  at  P  when  Cauchy's  definition 
of  continuity  is  satisfied,  those  points  only,  in  any  neighbourhood  of  P,  being 
taken  into  account,  which  belong  to  the  domain  for  which  the  function  is 
defined.  If  the  function  be  continuous  at  every  limiting  point  of  the  domain 
of  x  it  is  said  to  be  continuous  relatively  to  the  given  domain;  and  thus  the 
notion  of  continuity  of  a  function  is  applicable  whatever  be  the  domain  of  the 
independent  variable,  except  when  it  consists  of  an  isolated  set  of  points. 

Let  Pj,  P2,  P3,  ...  be  a  convergent  sequence  of  points  of  the  domain  of  x, 
of  which  Pw  is  the  limiting  point;  and  let  Pw  also  belong  to  the  domain  of  x. 
Supposing  the  functional  image,  corresponding  to  f(x),  to  contain  the  points 
Qi>  Q<2>  Qs>  •••  which  correspond  to  P1}  P2,  P3,  ...,  let  Ql}  Q2,  Q3,  .,.  form  a  con- 
vergent sequence  of  which  the  limiting  point  Qu  corresponds  to  Pw.  If  this 
condition  be  satisfied,  however  the  convergent  sequence  be  chosen  in  the  domain 
of  x,  the  aggregate  (Q),  of  values  of  y,  is  said  to  be  a  continuous  functional 
image  of  the  domain  (P)  of  x. 

It  is  clear  that  the  continuous  functional  image  of  a  closed  domain  is  itself 
closed.  For,  corresponding  to  the  points  of  a  convergent  sequence  (Qu  Q2,  Q3, . . .), 
in  (Q),  there  corresponds  an  aggregate  (P1;  P2J  P3, . . .),  in  (P),  which  must  have 
at  least  one  limiting  point,  -and  all  such  limiting  points  belong  to  the  domain 
(P),  and  must  correspond  to  the  limiting  point  of  (Q^ ,  Q2,  Qs ,...),  which  therefore 
belongs  to  the  aggregate  (Q).  Moreover  if  (P)  be  perfect,  the  continuous 
functional  image  (Q)  is  perfect  also:  for,  corresponding  to  any  particular  point 
Q',  of  (Q),  we  may  take  a  point  P',  of  (P),  for  which  Q'  is  the  image.  P'  is  the 
limiting  point  of  a  convergent  sequence  of  points  of  (P),  and  to  this  convergent 
sequence  there  corresponds  a  convergent  sequence  in  (Q),  of  which  Q'  is  the 
limiting  point.  It  has  thus  been  shewn  that  (Q)  contains  no  isolated  points, 
and  therefore  (Q)  is  perfect. 

If  (Q)  be  a  continuous  functional  image  of  the  closed  set  (P),  and  if  only 
one  point  of  (P)  correspond  to  each  point  of  (Q),  then  (P)  is  a  continuous 
functional  image  of  (Q). 

To  the  points  of  any  convergent  sequence  (Q1}  Q.,, ...),  in  (Q),  of  which  Q^ 
is  the  limiting  point,  there  corresponds  a  convergent  sequence (P1,P,i,P3, ...), 
in  (P),  of  ^hich  Pw  is  the  limiting  point;  and  Pw  is  the  functional  image  of  Qm. 
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213.  The  theorem  has  been  given  by  Weierstrass  that,  if  (a,  b}  be  any 
interval  containing  points  of  the  domain  of  a  function,  then  one  point  at  least 
exists  in  the  interval,  which  is  such  that,  in  any  arbitrarily  small  neighbourhood 
of  that  point,  the  upper  boundary  of  the  function  is  the  same  as  the  upper 
boundary  of  the  function  in  the  whole  interval  (a,  b}. 

This  theorem  holds  for  all  functions  without  restriction,  and  it  makes  no 
difference  whether  the  whole  interval  (a,  6),  or  only  a  set  of  points  in  that 
interval,  belongs  to  the  domain  of  the  independent  variable. 

Let  a  system  {Dn}  of  nets  with  half-closed  meshes  be  fitted  on  to  the  interval 
(a,  b).  If  M  be  the  upper  boundary  of  the  function  in  that  part  of  the  domain  of 
the  independent  variable  contained  in  the  closed  interval  (a,  b),  it  is  clear  that 
in  none  of  the  meshes  d1}  of  Z)j ,  can  the  upper  boundary  of  the  function  be  greater 
than  M,  and  that  in  one  at  least  of  these  meshes  the  upper  boundary  of  the 
function  must  be  M.  Take  the  mesh  c^  of  lowest  rank  in  Dl  for  which  this  is 
the  case.  In  one  at  least  of  the  meshes  of  D*  that  are  contained  in  d1}  the  upper 
boundary  of  the  function  must  be  M;  if  there  are  more  meshes  than  one  that 
satisfy  this  condition,  take  that  one  which  is  of  lower  rank  than  the  others,  and 
let  this  mesh  be  d.2.  Proceeding  in  this  manner,  we  obtain  a  sequence  of  meshes 
di,d2,  ...  dn,  ...  each  containing  the  next,  and  such  that  in  each  of  them  the 
upper  boundary  of  the  function  for  all  the  parts  of  the  domain  in  the  mesh  is  M. 
The  single  point  P  defined  by  this  sequence  of  meshes  satisfies  the  prescribed 
condition ;  for  any  neighbourhood  A,  of  P,  will  contain  all  the  meshes  dn,  from 
and  after  some  value  of  n\  therefore  in  this  neighbourhood  the  upper  boundary 
of  the  function  is  M. 

A  similar  result  holds  for  the  lower  boundary  of  a  function. 

It  is  clear  that  this  proof  can  be  applied,  in  the  case  of  functions  of  more 
variables  than  one,  to  prove  the  corresponding  theorem  that  one  point  exists 
in  a  domain  such  that  in  its  arbitrarily  small  neighbourhood  the  upper  boundary 
of  such  a  function  is  the  same  as  in  a  cell. 

In  the  case  of  a  function  which  is  continuous  in  the  interval  (a,  b),  it 
follows  from  the  foregoing  theorem  that  the  upper  and  lower  boundaries  of  the 
function  in  (a,  b)  are  both  finite,  and  thus  that  a  function  which  is  continuous 
in  an  interval  is  bounded  in  that  interval. 

For  consider  that  point  aclt  in  (a.  b),  in  the  arbitrarily  small  neighbourhood 
of  which,  (#1  —  €,%!  +  e),  the  upper  boundary  has  the  same  value  as  for  the  whole 
interval  (a,  b).  Since  the  function  is  continuous  at  xl}  corresponding  to  a  given 
number  B,  a  number  e  can  be  determined  such  that  \f(%)—f(%i)  \  <  8,  provided 
x  lies  in  (x^  —  e,  xl  +  e);  consequently  the  upper  boundary  off  (as}  in  this  interval 
must  be  finite,  and  hence  f(x)  has  a  finite  upper  boundary  in  (a,  b}.  It  may 
be  shewn  in  a  similar  manner  that  the  function  has  a  finite  lower  boundary. 
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A  function  which  is  continuous  in  the  closed  interval  (a,b)  is  suck  that  its 
upper  limit  and  its  lower  limit  are  each  actually  attained  at  one  point  at. least 
in  the  interval. 

For  suppose,  if  possible,  'that  f(x^  has  a  value  A,  different  from  M;  and 
consider  an  arbitrarily  small  interval  (x^  —  e,  xl  +  e)  for  which  M  is  the  upper 
limit  of  the  values  of  the  function;  then  points  exist  in  this  interval  for  which 
the  function  differs  by  less  than  an  arbitrarily  small  number  S,  from  M.  These 
values  of  the  function  would  differ  from  f(x^  by  an  amount  which  is  not 
arbitrarily  small,  and  this  would  be  inconsistent  with  the  condition  of  continuity 
of  the  function  at  the  point  xl.  It  follows  that  we  must  have  f(xl}  —  M. 
Similarly  it  may  be  shewn  that  the  lower  limit  m  is  reached  at  least  once  in 
the  interval  (a,  b). 

A  function  that  is  continuous  in  an  open  interval  (a,  b)  may  have  no  upper, 
or  no  lower,  boundary. 


CONTINUOUS  FUNCTIONS  DEFINED  FOR  A  CONTINUOUS  INTERVAL. 

214.  It  will  now  be  shewn  that,  iff(x)  be  continuous  in  the  closed  interval 
(a,b),  and  iff(a),  f(b)  have  opposite  signs,  then  there  is  at  least  one  value  of  x 
in  the  interval,  for  which  f(x)  vanishes. 

Let  us  suppose  that  f(a}  <  0,  /(&)  >  0  ;  and  let  a  system  of  nets  be  fitted 
on  to  the  interval  (a,  6).  Since  f  (b)  >  0,  in  a  sufficiently  small  neighbourhood 
(b  —  e,  6)  of  the  point  b,  f(x)  must  be  everywhere  positive.  If  n  be  sufficiently 
large,  one  or  more  meshes  of  Dn  are  interior  to  (b  —  e,  b),  and  in  these  meshes 
f(x)  is  positive.  There  exists  consequently  a  mesh  dn  of  lowest  rank  in  Dn, 
for  which  f(x)  is  negative  at  the  left  hand  end-point,  and  such  that  /(#)  is 
positive  at  the  right  hand  end-point ;  unless  /  (x)  =  0  at  the  left  hand  end- 
point,  in  which  case  a  point  such  as  the  theorem  requires  is  determined :  we 
assume  that  this  latter  case  does  not  arise.  There  is  then  among  those  meshes 
of  Dn+l  which  are  contained  in  dn,  one  of  lowest  rank,  such  that/(#)  is  negative, 
or  zero,  at  its  left  hand  end-point,  and  is  positive  at  its  right  hand  end-point. 
Proceeding  in  this  manner,  we  obtain  a  sequence  dn,  dn+l, . . .  dn+m, . . .  of  meshes, 
each  of  which  contains  the  next,  and  such  that,  either,  for  some  value  of  m, 
f(x)  is  zero  at  the  left  hand  end-point  of  dn+m,  or  else,  such  that,  for  the  un- 
ending sequence  of  values  of  m,  f(x)  is  negative  at  the  left  hand  end-point  of 
dn+m,  and  is  positive  at  the  right  hand  end-point.  In  this  latter  case,  there 
exists  a  point  c,  in  all  the  meshes  {dn+m}.  Iff(c)  were  negative,  in  a  sufficiently 
small  closed  neighbourhood  of  c,  f(x)  would  be  everywhere  positive;  and  this 
is  not  possible,  because  dn+m  would  be  interior  to  that  neighbourhood,  provided 
m  were  sufficiently  large.  In  a  precisely  similar  manner,  it  is  seen  that  /(c) 
cannot  be  positive.  Therefore  f(c)  =  0,  and  thus  the  theorem  is  established. 
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From  this  theorem  we  can  deduce  that,  whatever  values  f  (a),  f(b)  may  have, 
there  must  be  in  the  interval  (a,  b)  at  least  one  value  ofx,  for  which  f(x)  has  any 
prescribed  value  lying  between  f  (a)  andf(b). 

Let  this  value  be  C,  and  suppose/(a)  <  C  <f(b)  ;  then  the  function/(#)  —  C 
is  continuous  in  the  given  interval,  is  negative  when  x  =  a,  and  positive  when 
x  =  b  ;  thus  it  vanishes  at  least  once  in  the  interval  (a,  b). 

A  continuous  function  has  frequently  been  defined  as  a  function  such  that, 
if  /(a),  f(b)  be  its  values  at  any  two  points  a  and  6,  then  the  function  passes 
through  every  value  intermediate  between  f  (a)  and  /"(&),  as  x  changes  from  a 
to  b.  The  property  contained  in  this  definition  has  been  shewn  above  to  hold 
of  every  function  which  is  continuous  in  accordance  with  Cauchy's  definition  ; 
but  the  converse  theorem  does  not,  in  general,  hold.  The  definition  just  re- 
ferred to  is  accordingly  not  equivalent  to  that  of  Cauchy,  which  is  here  adopted 
as  the  basis  of  the  treatment  of  continuous  functions.  As  an  example  of  the 
non-equivalence  of  the  two  definitions,  we  may  consider  the  function  defined 

by  y  =  sin  -  ,  for  x  ^  0,  and  by  y  =  0,  for  x  =  0.    For  this  function  there  are 
oc 

values  of  x  between  a  (<  0)  and  b  (>  0)  for  which  f(x)  has  any  assigned  values  c 
lying  between  /(a)  and/(6)  ;  but  the  function  is  not  continuous,  in  accordance 
with  Cauchy's  definition,  in  any  interval  (a,  6)  which  contains  the  point  0.  It 
is,  in  fact,  easily  seen  that  the  point  0  is  a  point  of  discontinuity  of  the  function  ; 
for  an  arbitrarily  small  neighbourhood  of  the  point  0  contains  points  at  which 
the  function  has  all  values  in  the  interval  (—1,1). 

As  another  example*  of  a  function  which  satisfies  the  condition  referred  to, 
but  is  discontinuous  in  accordance  with  Cauchy's  definition,  let  the  number  x 
in  the  interval  (0,  1)  be  expressed  as  a  decimal  'a^a^a^  ...  an...  ;  then  con- 
sider the  decimal  'alo3a5a7  —  If  this  last  decimal  is  not  periodic,  we  take 
/(ar)  =  0;  if  it  is  periodic,  and  the  first  period  commences  at  am-\,  we  take 
f(x)  =  'a^a^n^a^^  —  The  function  so  defined  for  the  interval  (0,  1),  of  x, 
has  every  value  between  0  and  1,  in  every  arbitrarily  small  interval  in  the 
domain  of  x  ;  thus  the  function  is  discontinuous  at  every  point.  A  value  of  x 
for  which  f(x)  has  any  prescribed  value  'Pipz  ...pn...  is 


where  'a^az  ...  is  any  periodic  decimal,  the  first  period  of  which  begins  at  aM_1} 
and  A,  B,  ...  K  are  arbitrarily  chosen  digits.  Nevertheless  there  are  values  of 
x,  between  a  and  /3,  at  which  the  function  takes  any  assigned  value  intermediate 
between  /(a)  and/(/3). 

*  See  Lebesgue,  Lemons  sur  I  'integration,  p.  90. 
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CONTINUOUS  FUNCTIONS  DEFINED  AT  POINTS  OF  A  SET. 

215.  It  will  now  be  shewn  that,  if  a  function  f  (x),  having  prescribed  values 
at  each  point  of  an  infinite  set  of  points  in  the  interval  (a,  b),  be  continuous  in 
that  interval,  then  the  values  of  the  function  are  determinate  at  each  point  of  the 
derivative  of  the  set. 

Suppose  «j,  «2,  os, ...  on, ...  to  be  a  convergent  sequence  of  points,  for  which 
X-L  is  the  limiting  point,  and  suppose  /  («i), /(er2),  •••  /(«»)»  •••  to  be  known;  it 
will  be  shewn  that  these  functional  values  form  a  convergent  sequence  whose 
limit  is  f{$i}.  An  interval  (#a  —  S,  #x  +  B)  can  always  be  found,  corresponding 
to  any  fixed  number  e,  such  that  the  function  at  any  point  of  this  interval 
differs  from  /(#i)  by  less  than  the  arbitrarily  small  number  e ;  this  follows 
from  the  continuity  of  the  function.  A  number  n  can  be  found  such  that  all 
the  points  an,  an+l,  an+2, ...  lie  within  the  interval  (x-i  —  B,  a;1  +  B).  It  follows 
that  |/(#i)—  /(«n)  j  and  |/(#i)—/(a»+i)  i  >  etc-  are  an"  ^es8  tnan  «>  which  is 
arbitrarily  small ;  hence  f(x1)  is  the  limit  of  the  sequence  f(al),f(a2)) ...,  and 
thus  f(%i)  is  determinate.  From  this  special  case  it  follows  that,  for  all  the 
limiting  points  of  a  given  set  of  points  in  (a,  b),  the  values  of  the  continuous 
function  are  determinate.  It  thus  appears  that  the  function  is  determinate 
for  all  points  which  belong  to  the  derivative  of  the  given  set,  for  the  points  of 
which  the  values  of  the  function  are  known. 

In  particular,  if  a  continuous  function  have  prescribed  values  for  points  of  a 
set  which  is  everywhere  dense  throughout  the  interval  (a,  b),  then  its  values  are 
determinate  for  all  points  of  the  interval. 

A  special  case  of  such  a  set  would  be  that  of  the  rational  points  within  the 
interval.  It  follows  that  a  continuous  function  whose  values  are  known  for  all 
the  rational  points  in  an  interval  is  determinate  for  all  the  irrational  points. 
A  continuous  function  which  is  known  to  be  constant  for  all  the  rational  points 
has  the  same  constant  value  for  all  the  irrational  points  in  the  interval. 

It  is  clear  that,  if  cells  be  employed,  instead  of  intervals,  the  method  in  the 
proof  is  applicable  to  shew  that  a  continuous  function  of  any  number  of  variables 
is  determinate  at  each  point  of  the  derivative  of  a  set  of  points  at  which  the 
values  of  the  continuous  function  are  assigned. 

A  generalization  of  the  above  theorem  is  that,  if  a  function  is  continuous 
with  reference  to  a  domain  which  consists  of  a  set  (P),  and  is  known  for  all 
points  of  a  sub-set  which  is  everywhere  dense  in  (P),  then  the  function  is 
determinate  for  every  point  of  (P).  This  may  be  seen  by  remembering  that 
every  point  of  (P)  is  a  limiting  point  of  the  sub-set,  and  applying  the  same 
reasoning  as  before. 

216.  From  the  theorem  established  above,  that  a  continuous  function  is 
determinate  when  its  values  at  an  everywhere  dense  enumerable  set  of  points 
are  prescribed,  we  may  deduce  that  the  cardinal  number  of  the  aggregate  of  all 
continuous  functions  of  a  real  variable  is  the  cardinal  number  c,  of  the  continuum. 
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We  may  suppose  the  values  of  a  function  to  be  prescribed  at  the  rational 
points.  The  cardinal  number  of  the  aggregate  of  all  functions  defined  for  the 
rational  points  only  is  the  cardinal  number  of  the  ways  of  distributing  on  the  ag- 
gregate of  rational  numbers  the  aggregate  of  numbers  of  the  continuum.  This 
number  is  ca,  which  has  been  shewn  in  §  183  to  be  equal  to  c.  Only  some  of  the 
distributions  of  this  kind  are  such  as  will  give  rise  to  continuous  functions; 
hence  the  aggregate  of  all  continuous  functions  is  a  part  of  the  aggregate  of 
all  possible  distributions  on  the  set  of  rational  numbers  of  the  numbers  of  the 
continuum.  It  follows  that  the  cardinal  number  of  the  aggregate  of  all  con- 
tinuous functions  is  ^  c.  Again,  this  cardinal  number  is  =  c ;  for  among  the 
continuous  functions  are  those  which  are  constant,  and  everywhere  equal  to 
any  assigned  number  of  the  continuum;  and  thus  the  aggregate  of  all  con- 
tinuous functions  contains  a  part  which  has  the  cardinal  number  c.  Since  the 
cardinal  number  is  ^  c,  and  also  ^  c,  it  is  equal  to  c. 

This  theorem  was  established  by  Borel*,  who  also  shewed  that  the  aggre- 
gate of  all  analytical  functions  of  two  or  more  variables  has  the  cardinal 
number  c. 

The  cardinal  number  of  the  aggregate  of  all  functions  of  a  real  variable  is 
that  of  all  distributions  of  the  continuum  upon  itself;  this  is,  in  accordance 
with  the  definition  in  §  150,  denoted  by  cc,  for  which  we  may  write/. 

Each  particular  distribution  of  the  numbers  of  the  continuum  on  themselves 
is  definable  by  a  definite  norm,  and  corresponding  to  each  such  distribution 
there  is  a  definite  function  for  a  continuous  domain.  Let  the  aggregate  of 
all  such  functions  be  denoted  by  F:  it  will  then  be  proved  that  the  cardinal 
number/,  of  F,  is  >  c. 

First,  F  has  a  part  which  is  equivalent  to  the  Continuum.  This  is  at  once 
seen,  since  the  functions  /(#)  =  <v  where  a/ is  any  number  of  the  continuum, 
constitute  such  a  part.  It  follows  that  /  =  c. 

Next,  let  it  be  assumed,  if  possible,  that  F  is  equivalent  to  a  part  of  the 
continuum.  As  has  just  been  proved,  such  a  part  cannot  have  a  cardinal 
number  <  c ;  we  therefore  assume  that  F  is  equivalent  to  the  set  of  numbers 
of  the  continuum.  This  amounts  to  the  assumption  that  F  can  be  ordered  in 
the  same  type  as  the  continuum,  so  that,  to  any  assigned  number  £  of  the  con- 
tinuum, there  corresponds  a  definite  set  of  rules  R$,  which  defines  a  function 
ft  (a;).  The  correspondence  Between  £  and  R$  must  itself  be  defined  by  a  set  of 
rules,  so  that  when  £  is  assigned,  R^ ,  and  therefore  the  function/^  (a?),  is  defined. 
The  aggregate  {ft  (a?)}  must  contain  every  definable  function  of  a  real  variable. 
The  number  £  being  assigned,  ft  (x)  is  producible ;  and  its  existence  implies 
that,  at  any  assigned  point  £',  the  functional  value  ft  (£')  can  be  determined 
arithmetically.  We  may  take,  for  example,  £'  =  f ;  and  thus,  if  £  is  assigned, 

*  See  Lemons  sur  la  theorie  des  fonctions,  p.  127. 
H.  18 
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ft  (£)  is  known.  We  may  regard  ft  (£)  as  a  function  of  £;  for  its  value  at  any 
point  £  can  be  arithmetically  determined,  and  it  is  therefore  an  element  of  the 
aggregate  F  of  all  functions.  With  this  understanding  as  to  /$  (£),  choose  a 
fixed  number,  say  unity ;  then  the  function  <£  (£)  =f$  (£)  +  1  has  a  definite 
norm ;  for  we  have  only  to  add  to  the  rules  by  which  /j  (£)  is  defined,  the 
further  rule,  that,  at  each  point  £,  unity  is  to  be  added  to  the  value  of  /|  (£). 
We  have  now  a  new  definable  function  <£  (x) ;  but  this  cannot  possibly  belong 
to  the  aggregate  F,  for  if  it  do  so  belong,  there  must  be  some  one  point  £, 
of  the  continuum,  with  which  it  corresponds ;  but  <£  (x)  cannot  be  identical 
with/ft(o;),  for  <£  (£)  and/^di)  differ  by  unity.  Since  <f>  (£)  is  not  contained 
in  F,  contrary  to  the  hypothesis,  it  follows  that  F  cannot  be  equivalent  to  the 
continuum,  and  thus  the  theorem,  f>c,  is  established.  It  has  therefore  been 
shewn  that  f>c,  and  consequently  that — the  aggregate  of  all  functions  of  a 
real  variable  has  a  cardinal  number  f,  greater  than  c. 

UNIFORM   CONTINUITY. 

217.  It  will  now  be  shewn  that,  if  the  domain  of  x  be  a  continuous  interval, 
then  a  continuous  function  is  uniformly  continuous  through  the  domain  of  x. 
It  will  be  proved  that  a  number  8  can  be  found,  corresponding  to  any  given  ev 
such  that,  for  all  values  of  x,  the  fluctuation  off(x)  within  the  neighbourhood 
{x  —  8,  x  +  8),  or  for  all  of  this  neighbourhood  which  lies  within  the  domain,  is 
less  than  the  number  e.  The  essential  point  is  that  8  is  independent  of  x. 

Consider  a  symmetrical  system  {Dn},  of  nets  with  closed  meshes,  fitted  on  to 
the  interval  (ft,  6). 

If  e  be  a  prescribed  positive  number,  there  must  be  some  smallest  integer 
n},  such  that,  in  each  of  the  meshes  of  the  net  Dni,  the  fluctuation  of/(#)  is 
<  ^e.  For  suppose  that  no  such  integer  ^  exists ;  then,  however  large  n  may 
be  taken,  there  is  one  mesh  at  least  of  Dn  in  which  the  fluctuation  of  f(x)  is 
=  %e.  Let  Dn'  be  the  set  of  all  the  meshes  of  Dn  for  which  this  is  the  case;  the 
sequence  of  sets  of  meshes  D/,  D2',  . . .  Dn' ,  ...,  each  of  which  contains  the  next,, 
defines  a  closed  set  of  points  in  all  of  them.  Let  P  be  such  a  point,  and  let 
a  neighbourhood  A',  of  P,  be  constructed,  within  which  the  fluctuation  of  the 
function  is  <  |e.  This  neighbourhood  A'  will  contain,  in  its  interior,  a  mesh  of 
Dn,  from  and  after  some  fixed  value  of  n,  and  in  each  of  these  the  fluctuation 
of  /(#)  is  =  |e,  and  thus  there  is  a  contradiction  in  the  assumption  that  such 
a  point  as  P  exists.  Therefore  Dn  cannot  exist  for  all  values  of  n,  and  thus,  for 
some  integer  nl}  the  fluctuation  in  every  net  of  Dni  is  <  ^-e. 

If  x  be  any  point  of  the  closed  interval  (a,  b),  and  the  meshes  of  Dni  be  all 
of  breadth  28,  the  interval  (x  -  8,  x  +  8),  or  the  part  of  it  that  is  in  (a,  b),  is  in 
the  interval  formed  by  at  most  two  consecutive  meshes  of  DHl ,  and  the  inner 
fluctuation  of  f(x)  in  (x  —  8,  x  +  8)  is  consequently  <  e. 
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It  has  thus  been  shewn*  that  it  is  unnecessary,  for  a  function  of  a  real  variable, 
to  draw  a  distinction,  as  has  sometimes  been  done,  between  functions  which  are 
uniformly,  and  those  which  are  non- uniformly,  continuous  in  the  continuous 
domain  of  x\  for  all  continuous  functions  are  uniformly  continuous. 

The  theorem  may  also  be  stated  in  the  following  form : 

Iff(x}  be  continuous  in  the  closed  interval  (a,  b),  then,  corresponding  to  any 
arbitrarily  chosen  positive  number  e,  a  number  77  can  be  determined,  such  that  the 
condition  f(an1)  —  f(x2)  |  <  e>  **  satisfied,  for  any  two  points  x1}  x2,  in  (a,  b),  such 
that  xl  —  x2 1  <  77. 

The  following  theorem  can  be  immediately-  deduced : 

If  a  function  be  continuous  in  a  finite  interval,  then  the  interval  can  be 
divided  into  a  finite  number  of  sub-intervals,  in  every  one  of  which  the  fluctua- 
tion of  the  function  is  less  than  a  prescribed  positive  number. 

It  is  in  fact  clear  that,  if  e  be  the  prescribed  number,  the  condition  is 
satisfied  when  the  interval  is  subdivided  in  any  manner  such  that  the  length 
of  the  greatest  of  the  sub-intervals  is  <  rj. 

Another  proof  of  the  above  theorem,  in  an  extended  form,  will  be  given  in 
§  234,  by  employing  the  Heine-Borel  theorem. 

It  is  clear  that  the  above  proof  applies  also  to  the  case  in  which  the  domain 
of  x  is  not  a  continuum,  but  is  any  closed  set;  because  the  essential  point  of 
the  proof  depends  upon  the  limiting  points  all  belonging  to  the  domain.  Those 
meshes  of  the  nets  which  contain  no  points  of  the  domain  of  x  are  disregarded. 
For  domains  which  are  not  closed  the  proof  does  not  apply;  thus  a  function 
which  is  continuous,  relatively  to  an  aggregate  which  is  not  closed,  is  not 
necessarily  uniformly  continuous. 

In  the  case  of  a  function  of  p  variables,  when  the  domain  of  the  variables 
is  a  closed  set,  the  neighbourhood  (#!  —  &,  x.2  —  8,  ...xp  —  8;  x1-^-8,  x2  +  8,  ... 
Xp  +  8),  of  the  point  (XL,  xz,  ...,  xp),  may  be  (§  49)  denoted  by  (x  —  8,  as +  8). 
The  condition  of  uniform  continuity  takes  then  the  same  form  as  in  the  case 
p=l.  If  we  take  the  cell  in  which  the  domain  is  contained  to  have  its  edges 
all  equal,  and  employ  a  system  of  nets  in  which  the  edges  of  each  mesh  are 
all  equal,  the  above  proof  can  be  applied  to  shew  that  a  function  that  is  con- 
tinuous in  a  domain  that  consists  of  a  closed  set  of  points  is  uniformly  con- 
tinuous. 

*  This  theorem  was  first  stated  and  proved  by  Heine;  see  Crelles  Journal,  vol.  LXXI  (1870), 
p.  361,  and  vol.  LXXIV  (1872),  p.  188. 
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ABSOLUTE   CONTINUITY. 

218.  A  function  f(x),  defined  in  the  closed  interval  (a,  b\  is  said  to  be  abso- 
lutely continuous  in  (a,  b)  if,  corresponding  to  an  arbitrarily  chosen  positive 
number  e,  another  positive  number  can  be  so  determined  that,  in  every  enumerable, 
or  finite,  set  of  non-overlapping  intervals  (xl,  #/),  (#2,  #2'),  ...  (xn,  #„'),  ...,  in 
the  interval  (a,  b),  and  such  that  the  total  measure  of  the  intervals  is  <  77,  the 
sum,  or  limiting  sum,  2  |/(X-)  ~f(xr'}  is  <  e. 


r=l 


It  is  clear  that  a  function  which  is  absolutely  continuous  in  (a,  b)  is  also 
continuous,  in  accordance  with  the  definition  of  §  211;  but  the  converse  does 
not  hold.  We  may  take  the  set  of  intervals  to  consist  of  a  single  interval  of 
length  <  77;  thus  the  condition  of  uniform  continuity  is  satisfied. 

It  is  easily  seen  that  an  equivalent  form  of  the  definition  is  that,  in  every 
non-overlapping  set  of  intervals  of  total  measure  <  77,  the  sum,  or  limiting  sum, 
of  the  fluctuations  of  the  function  in  the  intervals  is  <  e. 


THE   CONTINUITY    OF    UNBOUNDED    FUNCTIONS. 

219.  Let  it  be  assumed  that  the  continuous  function  <£  (x),  defined  in  the 
interval  (a,  b),  of  x,  has  1  and  —  1  for  its  upper  and  lower  boundaries,  these 
being,  in  virtue  of  the  continuity  of  the  function,  also  its  upper  and  lower  limits. 

Consider    the    function  f(x)  =  -  —     ,,  >.    ,   from   which    it    follows   that 

1  —    (f>  (x) 


f(x) 
&  (x\  —  —  J  v  /  —  •  the  functions  <b  (x),  f(x)  have  the  same  sign  at  each  point 

i  +     *   ' 


x.  When  <£(#)  =  1,  we  may  suppose  the  improper  value  oc  to  be  assigned  to 
f(x);  and  when  </>(#)  =  —  1,  we  may  suppose  the  improper  value  —  oo  to  be 
assigned  tof(x).  In  this  manner,  by  adjoining  to  the  real  numbers  the  two 
elements  oo  ,  —  oo  ,  we  ensure  that  a  (1,  1)  correspondence  exists,  without  ex- 
ception, between  the  values  of  0  (x),f(x).  Thus,  to  the  closed  interval  (—1,  1), 
there  now  corresponds  the  closed  interval  (—00,00),  and,  in  the  correspondence 
so  defined,  the  relative  order  of  two  elements  is  preserved  without  change ; 
thus,  as  <£(#)  increases  steadily  from  —  1  to  1,  f(x)  increases  steadily  from 
—  oo  to  oc  . 

The  function  f(x}  is  continuous  for  any  finite  value  a,  of  x,  which  is  not 
such  that  </>  (a)  has  one  of  the  values  1,  —  1,  0. 

A    ( ™\  A  //v\ 

For  f(x)-  /•(«)  =  7, 


provided  f(x)  and  /(a)  have  the  same  sign.     A  neighbourhood  of  a  can  be  so 
determined  ^that,  at  every  point  of  that  neighbourhood,  1  —   <f>(x)    >  h,  where 
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h  is  some  positive  number  so  chosen  that  1  —  |  $  (a)  >  h,  and  also  such  that 
j  (f)  (#)  _  <£  (a)  |  <  h*€)  where  e  is  an  arbitrarily  chosen  positive  number.  In 
this  neighbourhood  we  have  then  !/(#)—/(«)  <  e,  and  thus  f(x)  is  con- 
tinuous at  o. 

If  <£  (o)  =  0,  we  have  j  <£  (a)  <  e,  in  a  sufficiently  small  neighbourhood  of  o, 
and  therefore  |/(a)  |  <  y—  -  <  2e,  if  e  <  J-.  Hence  /(#)  is  continuous  at  a. 

At  a  point  a  at  which  <£  («)  =  !,  the  function  /(#)  has  the  value  oc  .  If 
lY  be  an  arbitrarily  large  number,  a  neighbourhood  of  a  exists  in  which 

N 
<£  (x)  >  -,, — Y  >  in  virtue  of  the  continuity  of  $  (x),  at  a;  so  that  /(a)  >  N  in 

that  neighbourhood.  In  a  similar  manner,  it  may  be  shewn  that,  at  a  point  a, 
at  which /(a)  =  —  1,  a  neighbourhood  of  a  exists  such  that /(a)  <  —  N. 

It  is  convenient  to  extend  the  definition  of  continuity  of  a  function  at  a 
point  in  such  wise  that  the  points  at  which  f(x)  has  the  values  x ,  —  oo  are 
points  of  continuity  of  the  function,  corresponding  to  the  continuity  of  the 
function  </>  (x)  at  such  points. 

We  are  thus  led  to  the  following  definition  of  the  continuity  of  an  un- 
bounded function  at  a  point  in  the  neighbourhood  of  which  the  function  has 
indefinitely  great  values. 

A  function  f  (x)  is  said  to  be  continuous,  in  the  extended  sense,  at  a  point  a, 
when,  corresponding  to  any  arbitrarily  chosen  positive  number  N,  a  neigh- 
bourhood of  a.  exists,  such  that,  at  every  point  of  that  neighbourhood,  f(x)  >  N. 
It  is  also  said  to  be  continuous  at  a,  if  the  above  conditions  be  replaced  by 
f(x)  <  —  N.  The  value  of  the  function  at  a  may  be  regarded  as  oo  ,  in  the  first 
case,  and  —  oo  in  the  second  case. 

An  unbounded  function  f(x)  is  regarded  as  continuous  in  the  interval 
(a,  6),  for  which  it  is  defined,  if  it  is  continuous  at  every  point  f(x)  in  the 
neighbourhood  of  which  the  function  is  bounded,  and  is  continuous,  in  the 
extended  sense,  at  every  other  point. 

Thus,  in  the  above  case,  the  unbounded  function  f(x),  continuous  in  this 
sense,  corresponds  to  <j>  (x)  which  is  continuous  in  the  mure  restricted  sense  of 
the  term.  It  can  easily  be  seen  that  the  continuity  of  $  (x)  is  a  consequence 
of  that  of  f(x). 

In  the  above  extension  of  the  conception  of  a  continuous  function  to  the 
case  of  unbounded  functions,  the  points  oo  and  —  oo  have  been  regarded  as 
distinct. 

It  is  however  possible  to  regard  these  points  as  not  distinct  from  one 

another,  so  that  when  we  consider  the  functions  f(x),  - — -   a  value  at  which 

J\ac!) 

f(x)  =  0  corresponds  to  a  single  value  +  x> ,  of  -^-—.  . 

J  \x) 
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In  this  case  a  function  f{x)  is  said  to  be  continuous  at  a  point  a  for  which, 
corresponding  to  each  arbitrarily  chosen  positive  number  N,  a  neighbourhood  of 
a  exists,  in  the  whole  of  which  \f(x)  \  >  N. 

It  is  clear  that  this  condition  may  be  satisfied  in  cases  in  which  the  con- 
dition of  continuity  is  not  satisfied  when  oo  and  —  oo  are  regarded  as  distinct 
values  of  a  function. 


THE   LIMITS    OF    A    FUNCTION    AT    A   POINT. 

220.  Let  a  be  a  limiting  point  of  the  set  of  points  which  forms  the  domain 
of  the  independent  variable  x  ;  the  point  a  may,  or  may  not,  itself  belong  to 
the  domain  of  x.  Let  (a,  a  +  h)  be  a  neighbourhood  of  a  on  the  right,  and  let 
U(h\  L(h)  denote  the  upper  and  lower  limits  of  a  given  function  f(x)  for  all 
the  points  of  the  domain  of  x  which  are  interior  to  the  interval  (a,  a  +  h).  It 
will  be  observed  that  /(a),  if  it  exists,  is  not  reckoned  amongst  the  functional 
values  of  which  U  (h),  L  (h)  are  the  upper  and  lower  limits. 

Let  a  sequence  of  diminishing  values  be  assigned  to  h,  which  converges  to 
zero;  denoting  this  sequence  by  h1}  h^,  hs,  ...;  the  corresponding  numbers 
U(hi\  U  (h2),  ...  U(hn\  ...  form  a  sequence  of  which  the  members  do  not  in- 
crease, and  therefore  they  have  in  general  a  definite  lower  limit,  which  is 
called  the  upper  limit  of  f(x)  at  a  on  the  right.  It  is  easily  seen  that  this 
limit  is  independent  of  the  particular  sequence  {hn},  employed  in  defining  it.  It 
may  happen  that  all  the  upper  limits  U  (h)  are  infinite,  in  which  case  we  say 
that  the  upper  limit  of  f(x)  at  a  on  the  right  is  +  oc  ;  or  it  may  happen  that 
the  sequence  U(h^),  U(h2),  ...  U(hn),...  has  no  lower  limit,  in  which  case  we 
say  that  the  upper  limit  of  f(x}  at  a  on  the  right  is  —  oo  .  In  any  case,  the 
finite,  or  infinite,  upper  limit  of  f(x)  at  ct,  on  the  right,  is  denoted  by/(a  +  0). 

The  numbers  £(/h),  L(h*\  .  .  .  L(hn),  .  .  .  form  a  sequence  of  which  the  elements 
do  not  diminish,  and  they  have  an  upper  limit,  which  is  called  the  lower  limit 
of  f(x},  at  a  on  the  right,  and  may,  as  in  the  former  case,  have  infinite  values 
oo  ,  or  —  oc  .  This  limit  is  denoted  by  /(a  4-  0). 

Corresponding  definitions  apply  to  the  left  of  the  point  a  ;  and  the  limits 


of  /(#)  at  a  on  the  left  are  denoted  by  /(a  —  0),  /(a  —  0)  respectively.  In  case 
the  point  a  is  a  limiting  point  of  the  domain  of  x,  on  one  side  only,  the  two 
limits  of  the  function  at  a  on  the  other  side  are  non-existent. 

In  defining  f(a  +  6)  it  is  immaterial  whether  we  employ  the  upper  limit, 
or  the  upper  boundary,  of  the  function  in  the  open  interval  (a,  a  +  h).  If  U(h} 
be  the  upper  limit,  and  U  (h}  the  upper  boundary,  all  except  a  finite  number 
of  values  of  f(x)  in  the  open  interval  (a,  a  +  h)  are  such  that  f(x}  <U(h)  +  e/4; 
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where  €h  is  an  arbitrarily  chosen  positive  number.  A  number  li  <  h  can  be  so 
chosen  that  U(h')  £  U(h'}  <  U(h)  +  eh.  If  this  be  done  in  such  a  manner  that 
lim  eh=  0,  it  follows  that  U(h}  and  U  (h)  have  the  same  lower  limit,  as  h  ~  0. 


A  similar  remark  applies  to /(«  +  0),  /(a  —  0),  /(«  —  0). 

The  definitions  of  the  upper  and  lower  limits  at  a  point  a  may  be  stated 
shortly  as  follows: 

The  upper  limit  f  (a.  +  0)  of  a  function  at  a,  on  the  right,  is  the  limit  of 
the  upper  limit  of  f(x}  in  the  open  interval  (a,  a  +  h),  when  h  is  indefinitely 
diminished. 

The  lower  limit  f  '(a  +  0)  of  a  function,  at  a,  on  the  right,  is  the  limit  of 
the  lower  limit  of  f(x)  in  the  open  interval  (a,  a  +  h),  when  h  is  indefinitely 
diminished. 

The  definitions  for  the  left  of  a  may  be  stated  in  a  precisely  similar  manner. 


It  is  to  be  observed  that  the  four  functional    limits  /(«  +  0),  /(a  +  0), 


/(«  -  0),  /(«  -  0)  are  entirely  independent  of /(a),  in"  case  a  belongs  to  the 
domain  for  which  /O)  is  defined.  Any  arbitrary  alteration  in  the  value  of 
f(d)  will  not  affect  these  four  limits  of  /(a?)>  at  a. 

The  conditions  that  the  point  a  may  be  a  point  of  continuity  of  the 
function/O)  are  that/(a  +  0),  /(aj+0),/(a^rO),/(a  -  0),/(a)  should  all  have 
the  same  finite  value. 


It  may  happen  that  /(a),  /(a  +  0),  /(«+_0)  have  one  and  the  same  finite 
value,  but  that  either,  or  both,  of  /(a  —  0),  /(a  —  0)  have  not  this  value;  in 
this  case/(#)  is  said  to  be  continuous  at  a,  on  the  right.  Continuity  at  a,  on 
the  left,  is  defined  in  a  similar  manner. 

If  the  four  functional  limits  at  a  be  all  finite  and  equal,  but  f(a)  have  a 
different  value,  then  the  function  is  said  to  have  a  removable  discontinuity  at 
the  point  a.  In  this  case  the  function  would  be  made  continuous  at  a  merely 
by  properly  altering  the  value  of /(a). 

The  four  functional  limits  at  the  point  x  =  0  are  usually  denoted  by 
/(+0),  /(+  0),/(-"0),  /(-  0)  respectively. 


If,  at  a  point  «,/(«)  =/(a  +  0)  =/(a  +  0) •=/(«  -  0)  =/(«  -  0)  =  +  oe  ,  or 
if  /  (a),  and  all  the  four  limits,  are  —  oc  ,  the  function  is  said  to  be  continuous 
at  a,  in  the  extended  sense  of  the  term ;  the  distinction  between  +  oc  and 
—  oo  being  preserved. 


In  case  /(a)  =/(a  +  0)  =/(«  +  0)  =  +  oo  ,  or  —  oc  ,  the  function  is  said  to 
be  continuous  at  a,  on  the  right,  in  the  extended  sense  of  the  term. 
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The  four  numbers  f(x  +  0),  /(a  +  0),  /0»-0),  /(#-0),  which  are  de- 
pendent on  a;,  may  be  regarded  as  denning  functions  of  #  called  the  upper, 
and  the  lower,  right  hand,  or  left  hand,  limiting  functions  associated  with  /(#). 

The  upper  associated  function  A  (x)  may  be  defined  to  be  a  function 
whose  value  at  x  is  the  greater  of  the  two  numbers /(#  +  0),  f(x  —  0);  and  the 
lower  associated  function  a  (x)  may  be  defined  to  be  a  function  whose  value 
at  x  is  the  lesser  of  the  two  numbers  f(x  +  Q),  f(x  —  0). 


The  greatest  of  the  three  numbers  f(x),f(x  +  0),f(x  -  0)  may  be  regarded 
as  the  value,  at  x,  of  a  function  M  (x),  called  the  maximal  function  associated 
with  f  (a). 

The  least  of  the  three  numbers  f(x),  f(x  +  0),  f(x  —  0)  may  be  regarded 
as  the  value,  at  x,  of  a  function  m  (x},  called  the  minimal  function  associated 
with  f(x). 

221.  If  the  upper  and  lower  limits  of  f(x)  at  a,  on  the  right,  have  t,he 
same  value,  this  common  value  is  called  the  limit  of  f(x)  at  a,  on  the  right, 
and  is  denoted*  by/(oc+  0).  If  the  upper  and  lower  limits  of  f(x)  at  a,  on 
the  left,  have  the  same  value,  this  is  called  the  limit  of  f(x)  at  a,  on  the  left, 
and  is  denoted  by  /(a  —  0).  Either  of  the  limits,  on  the  right,  or  left,  at  a 
point,  when  such  limit  exists,  may  be  either  finite  or  infinite. 

The  limit  at  x  =  Q,  on  the  rigKt,  is  denoted  by  /(+0);  and  the  corre- 
sponding limit  on  the  left  is  denoted  by  /(— 0). 

The  limit  at  a  point  P,  on  one  side,  may  be  also  defined  as  follows: — Let 
(Pj,  P2,  Ps, ...)  be  any  convergent  sequence  of  points  belonging  to  the  domain 
of  x,  which  is  such  that  a,  or  P,  is  its  limiting  point,  and  such  that  all  the 
points  of  the  sequence  are  on  the  one  side  of  P.  The  values  of  f  (x)  at 
Pj,  P2,  Pz,  ...  form  an  aggregate  which  may  be  a  convergent  sequence;  let  us 
suppose  it  to  be  so,  and  also  that  its  limit  has  a  value  which  is  independent 
of  the  particular  sequence,  which  is  however  subject  to  the  conditions  above 
stated.  In  that  case,  this  limit  is  denoted  by  f(a  +  0),  or  by /"(a—  0),  as  the 
case  may  be,  and  is  called  the  limit  off(x)  at  a,  on  the  right  or  left. 

It  may  be  observed  that  the  necessary  and  sufficient  condition  for  the 
existence  of  a  definite  finite  limit,  on  the  right,  at  a,  is  that,  corresponding  to 
every  arbitrarily  small  number  e,  a  neighbourhood  (or,  a  +  8)  can  be  found,  such 
that  the  difference  of  the  values  of  the  function  at  every  pair  of  points  of  the 
domain  of  x,  which  are  in  the  interior  of  this  interval,  is  numerically  less 
than  e. 

*  This  notation  was  introduced  by  Dirichlet ;  see  Werke,  vol.  i,  p.  156. 
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The  necessary  and  sufficient  condition  that  /(a  +  0)  should  exist,  and  be 
equal  to  +  oo  ,  is  that,  if  A  be  an  arbitrarily  chosen  positive  number,  &  can  be 
so  determined  that  at  every  point  interior  to  (a,  a  +  8)  the  condition  f(x)  >  A 
is  satisfied.  In  order  that  /(a  +  0)  may  exist,  and  be  equal  to  —  oo  ,  the  corre- 
sponding condition  is  that  f(x)  <  —  A. 

It  is  possible  that  one  of  the  limits  /(a  +  0),  /(a  —  0)  may  exist  and  not 
the  other.  If  the  domain  of  x  be  either  a  continuous  interval,  or  a  perfect  set 
of  points,  a  may  be  taken  to  be  at  any  point  of  the  domain. 

When  the  condition  for  the  existence  off(ct  +  0),  or  off(ct  —  0),  at  a  point 
a,  is  not  satisfied,  the  convergent  sequence  (Pl ,  P2 ,  . . .  Pn, . . .),  of  which  P  (a)  is 
the  limiting  point,  may  be  such  th&tf(P1),f(P2),  ...f(Pn), ...  is  either  not  a 
convergent  sequence,  or  else  that  its  limit  depends  upon  the  particular  choice 
of  the  points  P1}  P2, ...  Pn, ....  In  this  case  the  fluctuation  off(x)  within  an 
arbitrarily  small  neighbourhood  (a,  a  +  8),  on  the  one  side  of  a,  is  either  a 
finite  number  which  has  not  zero  for  its  limit,  when  8  is  indefinitely  diminished, 
or  else  it  is  indefinitely  great,  however  small  8  may  be. 

222.  If  a?j,  xz,  uc3, ...  xn,  ...  be  a  convergent  sequence  of  points  belonging 
to  the  domain  of  x,  with  «  for  its  limiting  point,  then  the  sequence 
f(xi\f(x2\  •••f(xn)>  •••  may  not  be  convergent;  but,  if  it  be  convergent,  its 
limit  may  have  either  (1)  a  single  value  independent  of  the  mode  in  which  the 
convergent  sequence  is  chosen,  in  which  case  a  is  either  a  point  of  continuity 
of  f(x),  or  a  point  of  removable  discontinuity  of  the  function;  or  (2)  one 
of  two  values,  in  which  case  both  the  limits  f(a  +  0),  f(a  —  0)  exist;  or 
(3)  one  of  a  finite,  or  an  indefinitely  great,  number  of  values,  which  all  lie 
between  the  greatest  and  least  of  the  four  functional  limits  at  a. 

If  the  sequence  f(x1),f(x2),  ...f(xn),  ...  be  not  convergent,  by  omitting  a 
certain  set  of  the  numbers  of  the  sequence,  we  shall  obtain  a  convergent 
sequence,  or  else  one  which  diverges  to  +  oo ,  or  to  —  x  .  We  can  therefore 
without  loss  of  generality  suppose  that  xl,  x2,...xn,  ...  are  so  chosen  that 
this  is  the  case. 

The  aggregate  of  all  possible  values  of  the  limits  of  such  a  sequence 
f(xl},f(x^),  ...f(xn),  ...,  for  all  sequences  {#„}  which  converge  to  a,  each  of 
which  limits  has  a  finite  value,  or  is  GO  ,  or  —  x,  is  called  the  aggregate  of 
functional  limits  at  the  point  or.  Of  the  numbers  a-l}  x.2,  ...  xn,  ...  an  infinite 
number  are  on  one  side  of  ot;  we  may  therefore  without  loss  of  generality 
assume  that  xl,  xz,  ...  xn, ...  are  all  on  the  same  side  of  a. 

It  will  be  shewn  that  the  aggregate  of  functional  limits  at  a  is  a  closed  set 
(which  may  be  finite),  provided,  in  case  any  functional  limit  is  infinite,  we 
regard  the  point  x  ,  or  —  x  ,  as  belonging  to  the  set. 

Suppose  U  is  the  limit  to  which  such  a  sequence  converges.  If  the 
aggregate  of  all  values  of  C7"is  not  finite,  let  C/"n,  U2, ...  UH,  ...  be  a  convergent 
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sequence  of  values  of  U,  of  which  Um  is  the  limit.  It  will  be  shewn  that  Uu  is 
itself  a  value  of  U.  First  suppose  that  Z7W  is  finite.  Let  Ur  be  the  limit  of  a 
sequence  {f(xn(r])},  for  n=-l,  2,  3,  ...;.  we  may  choose  r  so  large  that 


for  this,  and  all  greater,  values  of  r.    We  can  then  choose  n  so  that 


for  that,  and  all  greater,  values  of  n.    It  follows  that,  for  all  sufficiently  large 
values  of  r  and  n,    UM  —f(xn(r})  \  <  e. 

As  e  is  arbitrarily  small,  we  can  obtain  a  sequence  of  numbers  x,  such 
as  ccn(r\  for  which  f(scn{r))  converges  to  UM,  whilst  the  numbers  xn(r]  converge 
to  a.  Hence  Z7W  belongs  itself  to  the  aggregate  of  functional  limits. 

In  case  U^  is  infinite,  we  can  choose  r  so  that  Ur  >  A,  an  arbitrarily 
chosen  number;  then,  as  before,  n  can  be  so  chosen  that  Ur—f  (xn(r))  \  <  £e, 
and  thus  \f(xn(r))  >  A.  —  ^-e.  Taking  an  increasing  sequence  of  values  of  A, 
and  a  diminishing  sequence  of  values  of  e,  we  obtain  a  sequence  of  points 
scn(r},  for  which  \f(xn(T)}  \  is  divergent. 

It  thus  appears  that  the  aggregate  of  functional  limits  at  a  point  a  is 
closed,  in  the  ordinary  sense,  when  the  upper  and  lower  functional  limits,  at  a, 
are  all  finite;  and  that,  when  this  is  not  the  case,  the  set  will  be  closed  if  we 
regard  one,  or  both,  of  the  points  oo  ,  —  oo  as  belonging  to  it. 

The  aggregate  of  functional  limits  may  be  finite,  or  may  consist  of  a 
closed  set,  of  any  type  as  regards  density. 

223.  If  the  domain  of  x  be  unbounded,  in  one,  or  in  both,  directions,  it  may 
happen  that  a  point  ^(>0)  of  the  domain  can  be  found,  corresponding  to 
every  arbitrarily  chosen  positive  number  e,  such  that  the  difference  between 
the  values  of  f(x),  for  any  two  values  of  x  which  are  both  greater  than  x1}  is 
numerically  less  than  e.  In  this  case  the  function  has  a  definite  limit,  as  x  is 
increased  indefinitely  in  its  domain  ;  and  this  is  called  the  limit  of  /  (x)  for 
x~  oo  .  Under  a  corresponding  condition  f(x]  may  have  a  definite  limit  for 

OC"^  —  00  . 

If,  as  x  increases,  a  point  x±  of  the  domain  of  x,  corresponding  to  each 
assigned  positive  number  A,  chosen  as  great  as  we  please,  can  be  found,  such 
that/(V)  >  A,  for  all  values  of  x  which  belong  to  the  domain,  and  are  >  x1}  then 
the  limit  of  /(#)  is  said  to  be  oo  ,  as  x  is  increased  indefinitely.  Iff(x)  <  —  A, 
for  all  such  values  of  a^,  then  the  limit  of  f(x)  is  said  to  be  —  oo  .  Similar 
definitions  apply  to  the  case  in  which  x  has  indefinitely  great  values  in  the 
negative  direction. 

In  case  the  limit  /(a  +  0),  at  a  point  a  on  the  right,  do  not  exist  as  a 
definite  number,  and  be  not  infinite  with  a  fixed  sign,  it  is  frequently 
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convenient  to  regard  /(a  +  0)  as  still  existent,  but  indeterminate,  and 
capable  of  all  values  belonging  to  some  closed  set  of  which  /(a  +  0), /(a  4-0) 
are  the  extreme  values.  It  is  then  said  that  /(a  +  0)  is  indefinite  in  value, 
and  that /(a  +  0), /(a  +  0)  are  its  limits  of  indeterminancy.  A  similar  remark 
applies  to  /(a  —  0),  which  may  also  be  either  definite,  or  indefinite,  with 
/(a  —  0),/(a  -  0)  as  its  limits  of  indeterminancy.  One,  or  both,  of  the  limits  of 
indeterminancy,  in  either  case,  may  be  infinite. 

THE  DISCONTINUITIES  OF  FUNCTIONS. 

224.  Let  us  suppose  the  domain  of  x  to  include  all  points  in  a  sufficiently 
small  neighbourhood  of  a  point  a ;  or,  in  any  case,  let  a  be  a  limiting  point  of 
the  domain  of  a?. 

The  fluctuation  of  the  function  f(x)  in  the  closed,  or  open,  neighbourhood 
(a  —  8,  a +  8)  of  the  point  a,  depends  in  general  upon  8,  but  cannot  increase  as 
8  is  diminished.  It  therefore  has  a  lower  limit,  for  values  of  S  which  converge 
to  zero.  This  limit,  which  may  be  zero,  finite,  or  indefinitely  great,  is  called 
the  saltus  (Sprung),  or  measure  of  discontinuity,  of  the  function  /(#),  at  a; 
thus: 

The  saltus,  or  measure  of  discontinuity,  of  a  function  /(#),  at  a  point  a,  is 
the  limit  of  the  fluctuation  of  the  function  in  a  neighbourhood  (a  —  8,  a  +  8),  as 
8  converges  to  zero. 

The  upper  boundary  of  the  function  f  (x}  in  the  interval  (a  —  8,  a  +  8)  has  a 
lower  limit,  as  8  is  indefinitely  diminished,  which  is  called  the  maximum  M(a), 
of  the  function  f  (a;),  at  a  (see  §  220). 

The  lower  boundary  of  the  function,  in  the  same  interval,  has  an  upper  limit, 
as  8  is  indefinitely  diminished,  which  is  called  the  minimum  m(a\  of  f(x),  at  a. 
Either  the  maximum  or  the  minimum  at  a  point  may  be  indefinitely  great. 

The  saltus  of  f(x)  at  a  is  easily  seen  to  be  the  excess  of  the  maximum  at 
a  over  the  minimum. 

It  is  clear  that  the  maximum  of  /(#),  at  a,  is  the  greatest  of  the  numbers 


/(«  +  0), /(a  —  0), /(a),  and  that  the  minimum  is  the  least  of  the  numbers 
/(a  +  0),  /(a  —  0),  /  (a);  and  thus  that  the  saltus  at  a  is  the  excess  of  the  greatest 

over  the  least  of  the  numbers  /(a  +  0),  /(a  +  0),  /(a  -  0),  /(a  -  0),  /(a). 

At  a  point  of  continuity  of  f(oc)  the  saltus  is  zero.  Any  point  at  which 
the  saltus  has  a  finite  value  (>0),  or  is  indefinitely  great,  is  called  a  point  of 
discontinuity  of  /(#),  and  in  the  latter  case  it  is  said  to  be  &  point  of  infinite 
discontinuity. 
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If  the  closed  neighbourhood  (a,  a  +  8),  on  the  right  of  a,  be  taken,  the  lower 
limit  of  the  fluctuation  in  this  neighbourhood  when  8  is  indefinitely  diminished 
is  called  the  saltus  at  a,  on  the  right.  This  is  equivalent  to  the  excess  of  the 
greatest  over  the  least  of  the  three  numbers  /(ot  +  0),  /  (a  +  0),  /(a).  A  cor- 
responding definition  applies  to  the  saltus  at*  a  on  the  left. 

225.  The  points  of  discontinuity  of  a  function  may  be  classified  as  follows:  — 
(1)  If  both  the  limits  /(oc  +  0),  /(a—  0)  exist,  and  have  values  which  differ 
from  one  another,  the  point  a  is  said  to  be  a  point  of  discontinuity  of  the  first 
kind,  or  a  point  of  ordinary  discontinuity. 

The  difference  between  the  greatest  and  least  of  the  three  numbers 
J  (a  +  0),  f  (a  —  0),  /'(a)  is  the  saltus,  or  measure  of  discontinuity,  of  the  function 
at  a.  If  a  be  not  a  point  of  the  domain  of  x,  f  (a.  +  0)—  /(«  —  0)|  measures 
the  saltus  at  a;  and  if  a  be  a  point  of  the  domain,  and  f(a)  lies  between 
/(a  +  0)  and/(a  —  0),  then  the  saltus  is  also  measured  by  J  /(«  -f  0)  —/(a  —  0)  |. 

When  /(a)  does  not  lie  between  /(a  +  0)  and  /(a  —  0),  the  function  is  said 
to  have  an  external  saltus  at  a. 

In  every  case,  the  saltus  on  the  right  is  measured  by  /(a  4-0)—  /(a) 
and  that  on  the  left  by  |/(«  -  0)  -/(a)  |  . 

Whether  there  be  an  external  saltus  at  or,  or  not,  the  number 


is  said  to  measure  the  oscillation  (Schwingung)  at  a.  The  oscillation  at  a 
point  differs  from  the  saltus  in  that  the  functional  value  /(a),  at  the  point,  is 
in  the  former  case  disregarded. 

If  f  (a)  =  f  (a  —  0),  whilst  f  (a)  £  f  (a  +  Q),  the  function  is  said  to  be 
ordinarily  discontinuous  at  a  on  the  right.  If  f(a)^f(a  —  0),  whilst 
f  (a)  —  f  (a  +  Q],  the  function  is  said  to  have  an  ordinary  discontinuity  at  a 
on  the  left. 

*/ 

It  may  happen  that/(a  +  0),  /(a  —  0)  have  equal  values  which  differ  from 
/(a).  In  that  case  the  discontinuity  at  a  is  removable  (see  §220);  since  by 
merely  altering  the  functional  value  at  the  one  point  a,  the  function  can  be 
made  continuous  at  the  point. 

(2)  If  neither  of  the  limits  f(a  +  Q),f(a  —  0)  exists,  the  discontinuity  at 
a  is  said  to  be  of  the  second  kind. 

The  oscillation*  at  a  is  measured  by  the  excess  of  the  greater  of  the  two 
numbers  /'(a+0),  f(a  —  0)  °ver  the  lesser  of  the  two  numbers  f(a 
/(a  —  0),  the  value  off  (a)  being  left  out  of  account. 


The  differences  /(a  +  0)  -/(g  +  0),  /(a  -  0)  -/(a  -  0)  may  be  spoken  of 
as  the  oscillation  at  a.  on  the  right,  and  on  the  left,  respectively. 

*  This  definition  of  the  "  Schwingung  "  is  given  by  Pasch  in  his  Einleitung  in  die  Differential- 
und  Integralrechnung,  p.  139. 
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By  Dini*,  a  definition  of  the  saltus  is  adopted  which  differs  from  the  one 
which  we  have  employed ;  he  takes  the  greatest  of  the  four  differences 
|/(«  ±  0)-/(a)  as  the  measure  of  the  saltus,  the  greater  of  the  two 
differences  |  f(a  +  0 )  —  /(«)  \  being  taken  as  the  measure  of  the  saltus  on  the. 
right. 

(3)  It  may  happen  that  one  of  the  two  limits /(a +  0), /(a— 0)  exists, 
whilst  the  other  does  not.    In  this  case,  the  point  a  may  be  said  to  be  a  point 
of  mixed  discontinuity. 

If  /(a)  exist,  and  be  equal  to  that  one  of  the  two  limits /(«  4-  0),  /(a  —  0) 
which  exists,  then  the  function  is  continuous  at  a  on  one  side,  and  has  a  dis- 
continuity of  the  second  kind  on  the  other  side. 

(4)  If  one  or  more  of  the  four  limits  f(a±  0)  be  indefinitely  great,  the 
point  a  is  one  of  infinite  discontinuity. 

Under  infinite  discontinuities  is  sometimes  included  the  case  in  which 
/(a)  is  defined  by  !//(«)  =  0,  or  when  f(x}  is  defined  as  the  limiting  sum  of 
a  series  which,  for  the  value  a,  becomes  divergent. 

226.  In  an  arbitrarily  small  neighbourhood  (a,  a  +  h},  on  the  right  of  a 
point  a  at  which  the  limits  /(a  +  0),  /(« +  0)  have  different  values,  there 
must  be  an  infinite  number  of  points  at  which  f(x)  >/(a  +  0)  —  e,  where  e  is 
an  arbitrarily  small  fixed  number. 

For,  if  there  were  only  a  finite  number  of  such  points  in  (a,  a  +  h),  h  could 
be  chosen  so  small  that  all  such  points  would  be  excluded  from  the  neigh- 
bourhood; thus,  in  a  sufficiently  small  neighbourhood  (a,  a  +  h),  we  should  have 
at  every  internal  point  /(#)  ^f(a  +  0)  —  e ;  and  then  the  upper  limit  at  a  on 
the  right  could  not  be/(a+  0).  In  a  similar  manner  it  can  be  shewn  that,  in 
the  arbitrarily  small  neighbourhood  (a,  a  +  h},  there  must  be  an  infinite  number 
of  points  at  which  /(#)  <  /(«  +  0)  +  e. 

In  this  case  we  say  that,,  in  the  arbitrarily  small  neighbourhood  of  a,  on 
the  right,  the  function  makes  an  infinite  number  of  finite  oscillations.  In 
case  of  the  infinity  of  one,  or  of  both,  of  the  limits  /(a  +  6)  and /(a  +  0),and, 
in  the  latter  case,  only  if  they  be  of  opposite  signs,  the  function  makes  an 
infinite  number  of  infinite  oscillations  in  the  arbitrarily  small  neighbourhood 
of  a.  A  similar  remark  applies  to  the  case  in  which  /"(a  —  0),  /(a  — 0)  have 
unequal  values.  It  has  thus  been  shewn  that : 

A  point  of  discontinuity  of  the  second  kind  is  one  such  that,  in  its  arbi- 
trarily small  neighbourhood,  the  function  makes  an  infinite  number  of  finite  or 
infinite  oscillations. 

*  See  Grundlagen,  p.  55. 
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In  an  arbitrarily  small  neighbourhood  on  either  side  of  a  point  of 
discontinuity  of  the  first  kind,  the  function  may  make  'an  infinite  number 
of  oscillations;  but  since  the  neighbourhood  can  be  chosen  so  small  that 
the  fluctuation  of  the  function  in  its  interior  is  arbitrarily  small,  the 
oscillations,  when  they  are  infinite  in  number,  are  arbitrarily  small,  sufficiently 
near  the  point. 

EXAMPLES. 

1 .  Let  /  (07)  =  sin  x\x,  when  x  ^  0,  and  f(x)  =  A,  when  x = 0. 

In  this  case  /(  +  0)=/(  —  0)  =  1,/(0)  =  A  ;  thus  f(x)  has  a  removable  discontinuity  at 
^•  =  0,  unless  -4  =  1,  in  which  case  the  function  is  continuous  in  any  interval. 

2.  Let /(#)  =  - ;  we  have  then  f(a  +  0)  =  co ,  f(a  —  0)  =  —  »,  and  f(a)  is  undefined. 

iZ-'  —  QJ 

3.  Let  f(x)  =  (x  —  a)  sin ;   then  /(«  +  0)  =  0,  /(«  —  0)=0.    This  function  is  con- 

•X?  —  (JL 

tinuous  at  x=a,  and  makes  an  infinite  number  of  oscillations  in  any  neighbourhood  of 
that  point. 

4.  If  f(x}  =  — '- — cosec — —  .  then 

•>   ^     '        v —  //  v  —  a.' 


/(«  +  0)  =  oc,   /(o  +  0)=-oo>    /(o-6)  =  x,    /(a-0)=-QO. 
This  function  has  an  infinite  discontinuity  of  the  second  kind  at  the  point  a. 

i  1 

5.    If  f(x)  =  ex,    we   have  /(  +  0)  =  «  ,  /(  -0)  =  0.    If  /(#)=*—  i,    then  /(  +  0)  =  0, 

l-ex 


6.  If  /(^)  =  sin  x,  lim  f(x)  is  indeterminate,  the  limits  of  indeterminancy  being  +  1,  -  1. 

X~x 

In  the  case  /(a?)  =  o?  sin  x,  the  corresponding  limits  of  indeterminancy  are  +00,  -  oo  . 

7.  Let  y  =  E(x\  where  E  (x}  denotes  the  integral  part  of  x.    This  function  is  dis- 
continuous when  x  has  an  integral  value  n;   we  then  have  E(n  —  0)  =  n—l,  E(n)  =  ny 
j£(w+0)=n. 

8.  Let  (o?)  denote  the  positive  or  negative  excess  of  ,v  over  the  nearest  integer  i  and 
when  M  exceeds  an  integer  by  ^,  let  (o;)  =  0.    This  function  is  continuous  except  for  fames 
j;  =  n  +  ^,  where  n  is  an  integer.    We  have  (;i  +  |)  = 


ORDINARY  DISCONTINUITIES. 

227.  Although  the  points  of  discontinuity  of  the  first  kind  have,  in  ac- 
cordance with  usage,  been  spoken  of  above  as  ordinary  discontinuities,  it  will 
be  shewn  that  they  are,  in  a  sense,  more  exceptional  than  discontinuities  of 
the  second  kind.  The  following  theorem  will  be  established: 

The  set  of  points  at  which  a  function  f(x)  has  ordinary  discontinuities  is 
enumerable  (or  finite,  or  absent};  whereas  the  set  of  points  of  discontinuity  of 
the  second  kind  may  be  imenumerable. 
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Let  k  be  a  positive  number,  and  let  Sk  denote  that  set  of  points  of  ordinary 
discontinuity  in  (a,  b)  at  which  the  saltus  of  f(x)  is  >  k.  If  the  set  Sk  be  not 
finite,  let  a  be  a  point  of  its  derivative.  Any  neighbourhood  of  a  contains  points 
of  Sk,  and  therefore  contains  pairs  of  points,  both  on  the  same  side  of  a,  such 
that  the  difference  of  the  values  of  f(x)  at  the  points  of  such  a  pair  is  nu- 
merically >  k.  It  follows  that  the  point  a  must  be  a  point  of  discontinuity  of 
the  second  kind,  and  thus  that  a  is  not  a  point  of  Sk ;  and  therefore  the  set  S^ 
is  an  isolated  set,  and  is  consequently  enumerable.  If  kl}  k2,  ...  be  a  sequence 
of  diminishing  numbers  that  converges  to  zero,  all  the  sets  S^,  Sk2,  ...  are 
enumerable  (or  finite,  or  non-existing),  and  therefore  their  outer  limiting  set, 
which  contains  all  the  points  at  which  f(x)  has  an  ordinary  discontinuity,  is 
enumerable  (or  finite,  or  absent). 


THE  SYMMETRY  OF  FUNCTIONAL  LIMITS. 

f 

228.  The  question  as  to  the  nature  of  the  set  of  points  at  which  a  function/(#) 
has  functional  limits,  on  one  side,  which  differ  in  value  from  the  corresponding 
functional  limits  on  the  other  side  has  been  investigated*  by  W.  H.  Young. 

The  general  result  obtained  is  that: 

Except  at  points  of  an  enumerable  set,  the  upper  and  lower  limits  of  a  function, 
on  the  right,  are  equal  to  the  upper  and  lower  limits  respectively,  on  the  left,  and 
there  is  no  external  saltus. 

Let  us  consider  the  set  of  points  G^,  A  at  which 

f(x  +  0)  -/(tf^O)  >  k,   f(x~+0)  >  A  ; 

where  A  and  k  are  positive  numbers.  If  the  set  G^A  is  not  enumerable,  it  con- 
tains a  component  H,  each  point  of  which  is  a  limiting  point  on  both  sides 
(§  89).  It  follows  that,  at  each  point  of  H,  f(x  —  0)  ^  A  ;  and  therefore 
f(x  +  0)  >  A  +  k,  at  each  point  of  H.  Similarly,  it  is  seen  that,  at  each  point  of  H, 
Ji(x  +  0)  >  A  +  2k ;  and  by  repeating  the  process  we  have  f(x  +  0)  >  A  +  nk, 
whatever  value  n  may  have.  It  then  follows  that  at  the  points  of  H,  f(x  -f  0)  =  +  oo  , 
and  therefore  f(x  —  0)  =  +  oo  ;  which  is  contrary  to  the  condition 


f(x+0)-f(x-0)>k. 

Therefore  the  set  G^  A  is  enumerable.  Giving  to  A  the  values  in  a  sequence  of 
decreasing  numbers  diverging  to  —  oc  ,  we  see  that  the  set  of  all  points  at  which 
f(x  +  0)  —  f(x  —  0)  >  k  is  enumerable.  Next,  letting  k  have  the  values  of  the 
numbers  in  a  decreasing  sequence  converging  to  zero,  we  see  that  the  set  of 
points  at  which  f(x  +-  0)  -f(x—  0)  >  0  is  enumerable.  Similarly,  it  may  be 
shewn  that  the  set  of  points  at  which  f(x  —  6)  —f(x  +  0)  >  0  is  enumerable. 
Therefore,  except  at  the  points  of  an  enumerable  set,  we  have  f(x  +  0 )  =f(x  —  0). 

*  See  Quarterly  Journal  of  Math.,  vol.  xxxix,  p.  67. 
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By  considering  the  function  —f(x),  for  which  an  upper  limit  is  obtained 
by  changing  the  sign  of  the  corresponding  lower  limit  of  f(x),  we  see  that 
f(x  +  0)  —f(x  —  0),  except  at  the  points  of  an  enumerable  set. 


Therefore  f(x  +  0)  -f(x  -  0)  and  f(x  +  0)  =/(a?-0),  except  for  values  of 
x  belonging  to  an  enumerable  set. 


In  the  proof  of  the  above  theorem  /(a?)  may  be  substituted  for  /(#+  0), 
without  affecting  the  reasoning.  It  then  follows  that  the  set  of  points  at  which 
f(%)>f(a;  —  Q)  is  an  enumerable  set.  By  changing  f(x)  into  —/(so),  it  then 
follows  that  the  set  at  which  /(#)</(#  —  ())  is  also  enumerable.  Similarly 

the  sets  at  which  f(x}  >f(x  +  0),  and  at  which  f(x)  </Q  +  0),are  enumerable. 
Hence,  except  at  the  points  of  an  enumerable  set,  we  have 


and 


The  theorem  has  now  been  completely  established. 
Also,  it  follows  that  : 

The  set  of  points  at  which  there  is  a  removable  discontinuity  is  enumerable 
(or  finite,  or  absent}. 

FUNCTIONS  CONTINUOUS  IN  AN  OPEN  INTERVAL. 

229.  The  theorem  that  a  function  that  is  continuous  in  a  closed  interval 
(a,  b)  attains  its  upper  and  lower  boundaries  (which  are  also  limits),  each  at  one 
point  in  the  interval,  may  be  extended  to  the  case  of  a  function  which  is  con- 
tinuous only  in  an  open  interval  (a,  b). 

We  have  in  that  case  the  following  theorem  : 

If  a  function  f(x)  be  continuous  in  an  open  interval  (a,  b),  and  if  f(a  +  0), 


f(b  —  0)  are  both  less  than  the  upper  boundary  of  the  function  in  the  interval, 
there  is  one  interior  point  at  least  at  which  the  function  attains  the  value  of  its 
upper  boundary.  Similarly  iff(a  +  0),  f(b  —  0)  are  both  greater  than  the  lower 
boundary  of  the  function  in  the  interval,  there  is  one  interior  point  at  least  at 
which  the  function  attains  the  value  of  its  lower  boundary. 

For  e,  e'  may  be  so  chosen  that,  for  a  <  x  ^  a  +  e,  and  for  b  —  e  £  x  <  b,  the 
upper  boundaries  of  the  function  are  less  than  in  (a,  b}.  The  original  theorem 
may  then  be  applied  to  the  closed  interval  (a  +  e,  b  —  e),  and  the  result  follows. 

The  theorem  of  §  214  may  also  be  extended  to  the  case  of  a  function  that  is 
continuous  in  an  open  interval  (a,  b).  We  have  the  following  theorem : 

If  f(x)  be  continuous  in  the  open  interval  (a,  b),  and  if  any  one  of  the 
functional  limits  at  a  is  of  opposite  sign  to  any  one  of  those  at  b,  there  is  at  least 
one  point  of  the  open  interval  at  which  f(x)  has  the  value  zero. 
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Let  f(a  +  0)  £  A  £f(a  +  0),  and  f(b  -  0)  ^  £  £/(6  -  0),  where  A  and  E 
ajre  any  functional  limits  at  a  and  b,  not  necessarily  extreme  limits;  and 
suppose  A  <  0,  5  >  0.  Positive  numbers  e,  e'  can  be  so  determined  that 


if  the  theorem  of  §  214  be  applied  to  the  closed  interval  (a  +  e,  b  —  e')  the  result 
follows. 


If  A,  B  satisfy  the  conditions  f(a  +  0)  ^  A  S/(a  +  0),  f(b  -  0) 


is  a£  £eas£  one  va£we  q/"  a;  m  </ie  open  interval  at  which  f(oc)  has  a  prescribed 
value  G,  lying  between  A  and  B. 

For  the  theorem  just  proved  can  be  applied  to  the  function  f(x)  —  C. 

If  f(x)  be  continuous  in  the  open  interval  (a,  b),  the  aggregate  of  functional 
limits,  on  the  right  at  a,  contains  every  number  in  the  interval  (f(a  +  0),  /(a  +  0)). 


Let  c  be  a  number  such  that /(a  +  0)  >  c  >f(a  +  0),  and  let  e  be  less  than 


t  (a  +  0)  -  c,  and  than  c  -/(a  +  0) ;  thus  /(a  +  0)  -  e  >  c  >/(a  +  0)  +  e. 

In  an  interval  (a,  a  +  h),  where  a  +  h<b,  there  is  a  set  of  points  GV1',  at  each 
point  of  which /(a  +  0)  +  e  >/(#)  >/(«  +  0)  -  e,  and  another  set  Gh(z),  at  each 
point  of  which  /(a  +  0)  +  e  >f(x)  >/(a  +  0)  —  e.  Let  £  be  the  upper  boundary 
of  the  set  GV1',  and  £2  the  upper  boundary  of  the  set  GV2).  We  have  then, 
since  /(#)  is  continuous  at  ^  and  £2, 


In  the  closed  interval  of  which  £1;  £2  are  the  end-points,  since  c  lies  between 
),  f(%z),  there  is  a  set  of  points  at  which  the  function  is  equal  to  c.  This 
set  must  be  closed ;  if  £  be  its  upper  boundary,  we  have/(£)  =  c ;  where  £  is  a 
definite  point  in  (a,  a  +  h).  If  now  we  assign  to  h  successively  the  values  in  a 
monotone  sequence  that  converges  to  zero,  we  obtain  a  sequence  of  points  £ 
that  converges  to  a,  and  at  each  point  of  the  sequence  the  value  of  f(x)  is  c. 
Thus  the  aggregate  of  functional  limits  at  a  includes  the  number  c.  This 
sequence  of  values  of  f  is  determinate,  and  ks  construction  does  not  involve 
the  use  of  the  principle  of  selection. 

If  we  assign  to  e  the  values  in  a  monotone  sequence  that  converges  to  zero, 
there  will  be  a  corresponding  sequence  of  values  of  ^  such  that,  in  that  sequence, 
/(&)  converges  to  f(a  +  0).  Similarly  the  sequence  /(£>)  will  converge  to 
/(q  +  0). 


H. 
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SEMI-CONTINUOUS   FUNCTIONS. 

230.  The  condition  of  continuity  of  a  function/  (x),  at  a  point  ^.namely  that, 
if  e  be  any  prescribed  positive  number,  an  open  interval  (x—h,  x  +  h)  exists 
such  that,  for  any  point  x  in  it,  f(x')  —f(x)  \  <  e,  can  be  divided  into  two 
separate  conditions,  viz.  that /(a/)  <f(x)  +  e,  and  that 

/(#)  >/(*)-* 

It  is  possible  that,  at  a  point  x,  one  of  these  conditions  may  be  satisfied  and 
not  the  other.  This  consideration  gives  rise  to  the  definition  of  a  property 
called  semi-continuity. 

If  (f>(x)  be  a  function  defined  for  a  continuous  domain,  and  if,  corre- 
sponding to  every  arbitrarily  chosen  positive  number  e,  an  open  neighbour- 
hood (x  —  h,  x  +  h)  of  a  particular  point  (x)  can  be  determined  such  that, 
for  every  point  x'  in  this  open  interval,  the  condition,  <f> (x)  <  $ (x)  +  e,  is 
satisfied;  then  the  point  x  is  said  to  be  a  point*  of  upper  semi-continuity  of 
the  function  <£  (x). 

If  an  open  neighbourhood  of  the  point  x  can  be  determined,  for  each  e, 
such  that  </>  (x'}  >  (f>  (x)  —  e,  then  the  point  x  is  said  to  be  a  point  of  lower 
semi-continuity  of  the  function  <£  (x). 

That  a  point  x  may  be  a  point  of  continuity  of  the  function  (f>  (x),  it  is 
necessary  that  both  the  above  conditions  be  satisfied. 

A  function  may  have  upper,  or  lower,  semi-continuity  at  a  point,  on  the 
right,  or  on  the  left. 

If  every  point  of  the  domain  (a,  b),  for  which  the  function  <£  (x)  is  defined, 
is  a  point  of  upper  semi-continuity,  then  the  function  <£  (x)  is  said  to  be  an 
upper  semi-continuous  function. 

A  similar  definition  applies  to  a  lower  semi-continuous  function. 

It  is  clear  that,  if  </>  (x)  be  a  lower  semi-continuous  function,  then  —  </>  (x) 
is  an  upper  semi-continuous  function.  Thus  the  properties  of  the  one  class  of 
functions  may  easily  be  extended  to  the  other  class. 

If  f  (x)  be  a  function,  defined  for  the  interval  (a,  b),  and  if  M  (x),  m  (x) 
denote  the  maximum  and  the  minimum  off(x)  at  the  point  x,  then  M  (x)  is  an 
upper  semi-continuous  function,  and  m  (x)  is  a  lower  semi-continuous  function. 

For  a  neighbourhood  (x  —  h,  x  +  h)  of  any  point  x  can  be  determined,  such 
that  the  value  (see  §  224)  of  f(x),  at  every  point  in  this  neighbourhood,  is 
less  than  M(x)  +  e,  where  e  is  a  prescribed  positive  number.  At  every  point 
in  (x  —  hl}x  +  Aj),  where  h1  is  chosen  <  h,  the  value  of  the  function  M  (x)  is  less 
than  M(x)  +  e.  Since  this  holds  for  every  value  of  e,  the  function  M  (x)  is 
upper  semi-continuous  at  x. 

*  See  Baire's  memoir  "  Sur  les  fonctions  des  variables  reelles,"  Annali  di  Mat.  (3  A),  vol.  in, 
1899. 
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It  is  clear  that  the  function  m(x),  where  m  (x)  denotes  the  minimum  of 
f(x)  at  the  point  x,  is  a  lower  semi-continuous  function. 

The  saltus  M  (x)  —  m  (.«),  of  the  function  f  (x),  may  be  taken  to  be  the 
value  of  a  function,  w  (x),  which  is  called  the  saltus-f unction  off(x). 

The  saltus-f  unction  <o  (x)  of  any  function  f  (x)  is  an  upper  semi-continuous 
function. 

For  M  (x)  and  —  m  (x)  are  both  upper  semi-continuous  functions,  and  it  is 
easily  seen  that  the  sum  of  two  such  functions  belongs  to  the  same  class. 

Instead  of  considering  the  functions  M(x),  m  (x)  we  may  consider  the  upper 
and  lower  associated  functions  A  (x},  a  (x)  (see  §  220).  It  can  be  proved  in  the 
same  manner  as  in  the  cases  of  M  (x),  m  (x)  that  the  upper  associated  function 
A  (x)  is  upper  semi-continuous,  and  that  the  lower  associated  function  a  (x)  is 
lower  semi-continuous. 


The  function  f(x  +  0)  is  upper  semi-continuous  on  the  right,  and/(#  +  0) 
is  lower  semi-continuous  on  the  right. 

If  </>  (x)  be  any  upper  semi-continuous  function,  then  the  set  of  points  for 
which  (f)  (x)  =  a.  is  a  closed  set,  where  a.  is  any  fixed  number. 

For  let  S  be  a  set  of  points  at  which  the  condition  </>(#)  ^  a  is  satisfied, 
and  let  P  be  a  limiting  point  of  S.  Let  us  suppose  that,  if  possible,  <£  (P)  <  a ; 
then  a  neighbourhood  of  P  can  be  determined,  such  that  at  every  point  in  it 
the  value  of  </>  (x}  is  less  than  a,  and  hence  this  neighbourhood  cannot  contain 
any  of  the  points  of  S;  but  this  is  contrary  to  the  hypothesis  that  P  is  a 
limiting  point  of  S.  It  follows  that  <£  (P)  =  a,  and  thus  that  the  set  of  points  for 
which  <f>  (x)  =  a  contains  all  its  limiting  points,  and  is  therefore  a  closed  set. 

It  can  be  shewn,  in  a  similar  manner,  or  it  can  be  deduced  from  the  above 
theorem,  that  the  set  of  points  for  which  ty  {x)  ^  a,  is  a  closed  set;  where  ty  (x)  is 
any  lower  semi-continuous  function. 

If  we  apply  the  theorem  proved  above  to  the  saltus-function  o>  (x),  of  any 
function  f(x),  we  obtain  the  following  theorem: 

Having  given  any  function  f(x}  defined  for  a  continuous  domain,  the 
saltus-function  <w  (x)  is  such  that  the  set  of  points  for  which  to  (x)  ~  a,  forms  a 
closed  set. 

231.  In  an  open  interval,  f(x)  and  the  minimal  function  m  (x)  have  the 
same  lower  boundary. 

For,  if  there  are  points  at  which  f(x}  is  between  A  —  e,  A  +  e;  at  any  such 
point  m  (x)  ^  A+e.  If  A  is  the  lower  boundary  off(x),  we  see  that  the  lower 
boundary  of  m  (x)  is  ^  A,  since  e  is  arbitrarily  small.  Again  if  there  are  points 
at  which  m(x)  is  between  A'  —  e,  A'  +  e,  there  must  be  a  point  at  which  f(x) 
is  also  between  these  numbers;  hence  if  A'  is  the  lower  boundary  of  m  (x),  the 
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lower  boundary  off(x)  is  <  A'+  e,  arid  is  therefore  ^  A'.  The  two  inequalities 
shew  that  f(x)  and  m  (#)  have  the  same  lower  boundaries. 

If  0  (x)  is  an  upper  semi- continuous  function,  defined  in  the  interval  (a,  b), 
the  set  of  points  at  which  w  (x)  —  a-  is  non-dense  in  the  interval  (a,  b). 

We  have  in  this  case  M  (x)  =  $  (x),  and  thus  «o  (x)  =  <£  (x)  —  m  (x}.  If 
possible,  let  there  be  an  open  interval  in  which  at  every  point  4>(x)  —  m(x)  =0-; 
and  thus  m(x)^<f>  (x)  —  a-.  If  «  be  a  point  of  the  interval,  and  e  ( <  cr)  be  arbi- 
trarily chosen,  a  point  #,  in  an  arbitrarily  small  neighbourhood  of  a,  contained 
in  the  interval,  exists  such  that  <£  (a?j)  <  m  (a)  +  e  <  </>  (a)  —  (a  —  e).  In  an 
arbitrarily  small  neighbourhood  of  xlt  also  contained  in  the  open  interval,  a 
point  tfo  can  be  determined  such  that  <f>  (x.2)  <  </>  (x^  —  (cr  —  e),  or  that 

</>(*2)<0(a)-2(cr-e). 

Proceeding  in  this  manner,  a  point  xn,  arbitrarily  near  to  a,  can  be  so  determined 
that  (f)(xn)<  (j>  (a)  —  n  (cr  —  e),  for  any  integer  n;  and  thus  <f)(xn)  is  an  arbi- 
trarily great  negative  number.  It  follows  that  there  exists  an  everywhere 
dense  set  of  points  at  Avhich  <£(#)<—  A,  where  A  is  an  assigned  positive 
number.  For,  if  ft  be  a  point  at  which  <£  (x)  <  —  A',  where  A'  >  A,  a  neighbour- 
hood of  /3  exists  in  which  </>  (x)  <  —  A'  +  e  <  —  A,  if  e  be  chosen  <  A'  —  A. 
This  neighbourhood  contains  no  points  of  the  set  for  which  <£(#)  =  —  A. 
Since  the  points  (3  are  everywhere  dense,  we  see  that,  in  any  interval,  a  sub- 
interval  exists  which  contains  no  points  at  which  </>(#)  ^  —  A,  and  thus  the 
set  of  such  points  is  non-dense.  Giving  to  A  the  values  in  a  divergent 
sequence  of  increasing  numbers,  and  remembering  that  0  (x)  is  finite  at 
every  point,  we  see  that  the  points  of  the  continuum  (a,  b)  are  divided  into 
the  sum  of  sets  of  a  sequence  of  non-dense  sets.  But  this  is  impossible,  since 
the  continuum  does  not  form  a  set  of  the  first  category.  It  follows  that  there 
cannot  exist  an  everywhere  dense  set  of  points,  corresponding  to  each  positive 
number  A,  such  that  <f>(x)<—  A.  Thus  no  open  interval  exists  in  which 
to  (x)  =  cr,  and  therefore  the  set  of  points  at  which  o>  (x)  =  cr  is  non-dense.  We 
have  now  the  equivalent  form  of  the  above  theorem : 

The  points  of  discontinuity  of  an  upper  semi-continuous  function  form  a 
set  of  the  first  category. 

• 

232.  If  (f)  (x)  be  an  upper  semi-continuous  function,  and  if,  at  every  point 
of  the  interval  (a,  b),  the  minimum  of  <f>  (x)  be  zero,  then  there  exists  a  set  of 
points,  everywhere  dense  in  the  interval,  at  which  (f)  (x)  is  itself  zero. 

To  the  interval  (a,  b),  the'  field  of  the  variable  x,  let  there  be  fitted  on 
a  system  of  nets  with  closed  meshes.  Let  P  be  the  centre  of  a  'mesh  dn,  of  Dn; 
then  since  the  minimum  of  <£  (x)  at  P  is  zero,  a  mesh  dni  (^1  >  n),  of  the 
sequence  of  meshes  which  defines  P,  must  be  such  that  in  every  point  of  dni 
we  have  0  ^  <£  (x)  <  e/2.  Another  mesh  d^,  contained  in  dni,  can  be  so  deter- 
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mined  that,  for  all  points  of  it,  0  ^  <f>  (x)  <  e/22 ;  and  so  on.  We  have  then  a 
sequence  of  meshes  dni,  dnt,  ...  each  of  which  contains  the  next,  and  such 
that,  in  dnf,  we  have  0  S  <£(V»  <  e/2r.  This  sequence  defines  a  point  P  (x),  in  all 
of  them,  for  which  0  ^  <f>  (x)  <  e/2r,  for  every  value  of  r.  Therefore.  <f>  (x)  =  0; 
and  every  mesh  dn  contains  a  point  such  as  x.  It  is  thus  seen  that  <£  (x} 
vanishes  at  the  points  of  an  everywhere  dense  set. 

In  particular,  we  see  that,  if  w  (x)  be  the  saltus-function  of  any  given  func- 
tion /(#),  and  if  w  (x)  has  its  minimum  equal  to  zero  at  every  point  of  the  domain 
of  x,  then  o>  (x)  vanishes  at  an  everywhere  dense  set  of  points.  The  points  of  this 
set  are  the  points  of  continuity  off(x). 

233.  If  infinite  values  of  a  function  are  admitted,  the  function  f(x)  may 
still  be  regarded  as  semi-continuous,  when  it  is  infinite. 

Consider  a  function  /(«•)  which  has  infinite  values ;  the  function 


is  bounded,  and  the  values  1,  —  1  correspond  to  values  -I-  x  ,  —  x  respectively, 
f(x\ 

We  have          /<«0 -/(«)- n      ,f^7^' 


provided  $  (x),  $  (x'}  have  the  same  sign  ;  hence  if  </>  (x)  —  $(#)<  e,  then 


tl  _,  (a)  I,. 

Therefore,  if  <£  (x)  is  upper  semi-continuous  at  x,  so  also  isf(x).   In  case  <f>  (x) 
is  positive,  and  </>  (x')  negative,  </>  (x'}  <  <£  (x),  and  then  f(x)  <  f(x). 

Iff(x)  =  oo  ,  we  have  <jj  (x)  =  1,  and  at  all  other  points  <£  (x')  ^  $  (x);  there- 
fore a  point  where  f(x)  =  +  x  is  a  point  where  $  (x)  is  upper  semi-con- 
tinuous ;  and  similarly  where/(#)  =  —  x  ,  0  (x)  is  lower  semi-continuous. 

It  is  therefore  convenient  to  regard  f(x)  as  upper  semi-continuous,  where 
it  has  the  value  +  x  ,  and  lower  semi-continuous  where  it  is  —  x  ;  the  corre- 
spondence with  <£  (x)  being  thus  preserved. 

234.  The  following  theorem  is  a  generalization*  of  the  theorem  of  §  217, 
that  a  continuous  function  is  uniformly  continuous  in  its  closed  domain. 

If  f(x)  be  bounded  in  the  interval  (a,  b),  and  if  k  be  a  number  greater  than 
the  upper  boundary  of  <a(x)  in  (a,  6),  there  exists  a  positive  number  a,  such  that, 
in  every  closed  sub-interval  in  (a,  b)  of  length  not  exceeding  a,  the  fluctuation-  of 
f(x)  is  <  k. 

It  is  convenient  to  assume  that,  for  values  of  x  that  are  <  a,  f(x)  is  defined 
to  be  equal  to  /(a),  and  that,  for  values  of  x  that  are  >b,f(x)  =f(b}. 

*  See  Baire,  loc.  cit.,  p.  15. 
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If  x  be  any  point  in  (a,  b),  the  fluctuation  of  f(x)  in  the  interval  (x  —  h,  x+h) 
is  <  k,  provided  h  be  sufficiently  small.  The  upper  boundary  of  all  values  of 
h  for  which  this  is  the  case  may  be  denoted  by  h.  If  6  be  a  fixed  number 
such  that  0  <  6  <  1,  the  fluctuation  of  the  function  in  the  closed  interval 
(x  —  6h,  x  +  6h)  is  <  k.  Corresponding  to  each  point  x  in  (a,  b),  a  definite 
interval  may  be  assigned,  viz.  the  interval  (as  —  6h,  x  +  Oh),  or  the  part  of 
it  which  is  interior  to  (a,  b).  By  the  Heine-Borel  theorem,  a  finite  set 
of  these  intervals  exists,  such  that  every  point  of  (a,  b)  is  interior  to  at  least 
one  of  these  intervals.  The  end-points  of  the  intervals  of  this  finite  set  A 
form  a  finite  set  of  points  in  (a,  b) ;  let  a  be  the  smallest  of  the  distances  be- 
tween consecutive  points  of  this  set.  Any  closed  interval  whatever,  in  (a,  b), 
of  length  not  exceeding  a  is  in  one  of  the  intervals  of  the  finite  set  A.  There- 
fore, in  such  an  interval,  the  fluctuation  of  f(x)  is  <  k. 

The  corresponding  theorem  for  the  case  of  a  bounded  function  defined  in 
a  cell  of  two  or  more  dimensions  may  be  stated  as  follows : 

If  a  function  be  bounded  in  a  cell  (a,  b)  of  any  number  of  dimensions,  and 
if  k  be  a  number  greater  than  the  upper  boundary  of  the  saltus-function  in  the 
whole  cell,  there  exists  a  positive  number  a  such  that,  in  every  closed  cell  con- 
tained in  (a,  b),  of  span  not  exceeding  a,  the  fluctuation  of  the  function  is  <  k. 

The  proof  is  a  modification  of  that  given  above  for  the  case  of  a  linear  inter- 
val. There  is  shewn  to  exist  a  finite  set  of  closed  cells,  such  that  in  each  of  them 
the  fluctuation  of  the  function  is  <  k,  and  such  that  each  point  in  the  cell  (a,  b) 
is  interior  to  one  of  the  cells  of  the  finite  set.  If  we  project  on  the  edges  of 
(a,  b)  all  the  edges  of  the  cells  of  this  finite  set,  we  have  on  each  edge  of 
(a,  b)  a  finite  set  of  points.  The  number  a  can  then  be  taken  to  be  the  least 
distance  between  consecutive  points  of  these  finite  sets,  when  all  the  points 
on  all  the  edges  of  (a,  b)  are  taken  into  account.  Any  cell  whatever,  in  (a,  b), 
of  span  not  exceeding  a  will  be  in  one  of  the  cells  of  the  finite  set,  and  there- 
fore the  fluctuation  of  the  function  in  such  a  cell  is  <  k. 

The  definitions  of  semi-continuity  given  above,  and  the  theorems  estab- 
lished, are  applicable  when  the  domain  of  the  variable  x  is  not  an  interval 
(a,  b),  but  any  closed  set  of  points.  In  that  case,  we  regard  functional  values 
in  any  interval  as  only  existing  at  those  points  in  the  interval  which  belong 
to  the  domain  of  x. 

Moreover,  the  definitions  and  theorems  are  applicable  to  the  case  of 
functions  of  a  number  p  of  variables.  In  this  case,  instead  of  an  interval 
(x  —  h,  x  +  h)  used  in  defining  semi-continuous  functions  and  the  saltus- 
function,  we  may  employ  the  "sphere" 

(p  _  xwy-  +  (p>  _  ^(2))2  +  . . .  +  (g»)  -  aAp)y*  <  h*,  or  ^  h\ 

according  as  the  neighbourhood  is  open  or  closed,   or  else  a  cell   may  be 
employed. 
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EXAMPLE. 


If*  f(x)  be  any  function,  <£  (x)  its  maximum,  and  ty(x}  its  minimum  at  the  point  #,  and 
<7  (#)  be  any  continuous  function,  then  <£  (x)  +g  (x),  \js  (x)  +g  (x}  are  the  maxima  and  minima 
at  x,  of  f(x)  +g  (x}.  Also  the  functions  <£  (x)  -/(#),  f(x)  —  ^  (x}  have  each,  in  any  domain, 
the  minimum  zero. 

APPROXIMATE   CONTINUITY. 

235.  A  function  f(x)  is  saidf  to  be  approximately  continuous  at  a  point  a 
of  the  domain  (a,  b)  of  the  variable  x,  when,  corresponding  to  each  arbitrarily 
chosen  number  e,  the  set  of  points  x  for  which  \f(x)  —/(a)  j  <  e  has  the  metric 
density  unity  at  the  point  a. 

The  necessary  and  sufficient  condition  that  the  function  f(x)  should  be  ap- 
proximately continuous  at  the  point  a  is  that  it  should  be  continuous,  at  a,  rela- 
tively to  a  set  of  points  G  which  has  the  metric  density  unity  at  the  point  a. 

The  condition  in  the  theorem  is  sufficient ;  for  there  then  exists  a  number 
h,  such  that,  in  the  interval  (a  —  h,  a  +  h),  the  condition  j /(#)—/(«)  <  e  is 
satisfied  for  every  point  x  that  belongs  to  G.  This  part  of  G  has  the  same 
metric  density  at  a  as  G  itself. 

To  shew  that  the  condition  is  necessary,  let  it  be  assumed  that  the  set 
G  (a,  e)  of  points  at  which  \f(x)  —/(a)  \  <  e  has  the  metric  density  1  at  the 
point  a,  whatever  value  e  may  have. 

Let  {en}  be  a  sequence  of  decreasing  values  of  e  that  converges  to  zero ; 
for  each  €n  a  number  hn  (<  An-i)  can  be  so  determined  that  the  measure  of 
the  part  of  G(a,  en)  in  the  interval  (a  —  hn,  a  +  hn)  is  >2hn(l  —  en).  The 
numbers  hn  may,  if  necessary,  be  so  altered  that  hn+l/hn<  l/en,  for  every 
value  of  n. 

Let  Fn  be  the  part  of  G  (a,  en)  that  is  in  the  two  intervals  (a  +  enhn+1 ,  a  +  hn), 
(a  -  hn,  a  —  6nhn+l);  then  Fn+m  is  a  part  of  G(a,  en+m),  and  therefore  of  G  (a,  en). 
If  G  be  the  set  M(Fl}F2, ...  Fn,...\  then  the  set  G  has  the  metric  density  1 
at  the  point  a,  since  m (Fn)>  2hn (1  —  2en).  Let  a;1,a;2,...xm,...  be  a  sequence  of 
numbers  converging  (increasing,  or  decreasing)  to  a,  and  such  that  all  of  them 
are  points  of  G.  Each  one  of  them  xm  belongs  to  one  of  the  sets  Flt  F2. ...  Fn,  ...', 
say  to  Fjn-  It  is  clear  that  the  numbers  m  increase  as  m  does  so,  and  that  m  ~  oc 
as  m  ~  oo  . 

We  have  then  |/(#m)  —  /(«)  |  <  e^  ;  and  therefore  the  sequence  {f(xm)}  con- 
verges to  /(a).  Hence  f(x)  is  continuous  relatively  to  G;  also  G  has  the 
metric  density  1  at  the  point  a. 

*  Baire,  loc.  cit.,  p.  9. 

t  Denjoy,  Bull,  de  la  Soc.  Math,  de  France,  vol.  XLIII,  p.  165. 
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THE   CLASSIFICATION   OF   DISCONTINUOUS  FUNCTIONS. 

236.  Let  us  suppose  a  function  to  be  defined  for  all  points  in  a  continuous 
interval  (a,  b) ;  at  each  point  x,  the  saltus  of  the  function  has  a  finite  value,  or 
is  indefinitely  great,  its  value  being  zero  at  a  point  of  continuity.  With  a  view 
to  the  classification  of  functions,  in  accordance  with  the  distribution  of  the 
points  of  continuity,  and  of  discontinuity,  in  the  interval  (a,  b),  the  question 
arises,  what  is  the  most  general  distribution  of  the  points  of  continuity  ? 

The  answer  to  this  question  is  contained  in  the  theorem : 

The  points  of  continuity,  if  they  exist,  of  a  function  defined  for  a  continuous 
interval,  form  an  ordinary  inner  limiting  set. 

To  prove  this  theorem,  let  e  be  a  fixed  positive  number,  and  enclose  each 
point  P,  of  continuity  of  a  function  /(#),  in  an  interval  so  chosen  that  the 
fluctuation  of  f(x)  therein  is  <  e ;  all  the  points  of  continuity  are  then  en- 
closed in  a  set  of  intervals  which  in  general  overlap ;  let  these  intervals  be 
replaced  by  the  equivalent  set  Aer  of  non-overlapping  intervals.  Imagine  these 
sets  A6  constructed,  corresponding  to  a  sequence  of  diminishing  values  of  e 
which  converges  to  zero ;  there  exists  then  a  set  of  points  which  are  interior 
to  intervals  of  all  these  sets  of  intervals,  since  this  set  of  points  includes  all 
the  points  of  continuity  of  f(x).  If  Q  be  any  point  which  belongs  to  the 
inner  limiting  set  so  defined,  Q  must  be  a  point  of  continuity  of  f(x) ;  for, 
corresponding  to  any  arbitrarily  small  number  en,  Q  is  in  the  interior  of  some 
interval  in  which  the  fluctuation  of  the  function  is  <  en,  and  thus  Q  is  a  point 
of  continuity  of  the  function. 

In  accordance  with  the  theorems  which  have  been  obtained  in  §  97,  re- 
lating to  ordinary  inner  limiting  sets,  the  points  of  continuity  of  a  function 
may  form  an  enumerable  set  which  contains  no  component  dense  in  itself,  or 
else  they  form  a  set  of  the  cardinal  number  of  the  continuum.  In  the  latter 
case  the  set  is  a  residual  set,  provided  it  be  everywhere  dense.  For  the  closed 
set  at  which  <w  (#)  ^  e,  will,  for  every  value  of  e,  be  non-dense ;  and  the  points 
of  discontinuity  accordingly  form  a  set  of  the  first  category. 

These  results  lead  to  the  following  classification*  of  functions  : 

(1)  A  function  may  have  no  points  of  continuity ;  it  is  then  said  to  be 
totally  discontinuous. 

(2)  The  points  of  continuity  may  form  an  enumerable  set  which  has  no 
component  dense  in  itself. 

(3)  The  set  of  points  of  continuity  may  be  of  the  cardinal  number  of  the 
continuum,  and  be 

*  See  a  paper  by  W.  H.  Young,  Wiener  Sitzungsber.,  vol.  cxn,  Abt.  n  a,  p.  1307. 
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(a)  non-dense ; 

(b)  everywhere  dense  and  unclosed,  in  which  case  the  function  is  said 
to  be  a  point-wise  discontinuous  function ; 

(c)  everywhere  dense  and  closed,  in  which  case  the  function  is  con- 
tinuous ; 

(d)  everywhere  dense  in  each  interval  of  a  set,  and  non-dense  in  each 
interval  of  another  set  external  to  the  former  one. 

This  last  case  (d)  is  not  essentially  distinct  from  the  former  ones. 

By  Hankel  and  others  the  term  "  totally  discontinuous  "  has  been  applied 
to  all  functions  which  are  neither  continuous  nor  point-wise  discontinuous. 

237.  It  has  been  shewn  by  W.  EL  Young  that  a  function  can  be  con- 
structed which  is  continuous  at  every  point  of  any  given  ordinary  inner  limit- 
ing set  of  points,  and  is  discontinuous  at  every  other  point  of  the  interval. 

Let  E  denote  an  ordinary  inner  limiting  set,  and  let  the  function  f(x)  be 
defined  as  follows : 

(1)  At  every  point  x,  of  E,  let/(#)  =  oc. 

(2)  It   has   been   shewn   that   a   sequence   of   sets   of   non-overlapping 
open  intervals  can  be  constructed  such  that  the  only  points,  each  of  which  is 
in  an  interval  of  every  set,  are  the  points  of  E.    Let  Q  be  a  limiting  point  of 
E  which  does  not  belong  to  E ;  then  a  number  n  exists  such  that  Q  is  in  an 
interval  of  the  (n  —  l)th  set,  but  not  in  one  of  the  intervals  of  the  wth  set.    Let 
this  interval  of  the  (n  —  l)th  set  be  of  length  dQ;   and  at  the  point  Q  let 
f(z)  =  XQ  +  endQ,  where  e  is  a  fixed  positive  number  less  than  unity;  in  the 
case  n  =  1,  we  put  dq  =  e. 

(3)  If  R  be  a  point  which  does  not  belong  either  to  E  or  to  its  derivative, 
it  must  lie  between  two  definite  points  A,  B  both  of  which  belong  to  E  or  to 
E',  and  such  that  no  point  of  E  or  of  E'  lies  between  A  and  B.   If  XR  be  a 
rational  number,  let  f(xR}  =  XA  or  XB,  according  as  R  is  nearer  to  A  or  to  B; 
when  XR  is  irrational,  \etf(xR)  =  a;E;  and  if  R  be  the  middle  point  of  AB,  let 
f(xR}  =  xR. 

It  is  clear  that  the  function  so  defined  is  discontinuous  at  every  internal 
point  of  the  interval  AB,  and  at  the  end-point  A  it  is  continuous,  or  dis- 
continuous, on  the  right,  according  as  A  does,  or  does  not,  belong  to  E ; 
a  similar  result  holds  for  B.  It  has  thus  been  shewn  that  the  function  is 
discontinuous  at  every  point  which  does  not  belong  to  E. 

To  shew  that,  at  any  point  P,  of  E,  the  function  is  continuous,  consider 
those  intervals,  one  of  each  set  in  the  sequence,  which  contain  the  point  P : 
the  lengths  of  these  intervals  will  have  a  lower  limit  d,  which  may  be  zero. 


298  Functions  of  a  real  variable  [OH.  v 

In  every  interior  point  of  d  we  have  /(#)  =  x  ;  and  thus,  if  P  be  interior 
to  d,  P  is  a  point  of  continuity  of  the  function.  If  P  be  an  end-point  of  d,  it 
is  certainly  continuous  on  the  side  towards  the  interval;  and  we  have  to 
shew  that  it  is  also  continuous  on  the  other  side.  Choose  an  arbitrarily  small 
number  cr,  and  an  arbitrarily  large  integer  m  ;  then  a  number  n1>m  can 
be  found  such  that  the  Mjth  interval,  and  all  subsequent  intervals  of  the  sequence 
which  contain  P,  are  of  length  between  d  and  d  +  a.  The  piece  of  one  of  these 
intervals  which  is  not  a  portion  of  d  is  of  length  «r;  suppose  that  Q  is  a 
point  in  this  piece  which  belongs  to  E'  but  not  to  E  :  then 

i  /Op)  -/Oe)    =    Xp-xQ-  endQ    <  \  a-  +  em  (d  +  er)  | 
since  n  ^  n^  >  m,  and  dQ  <  d  +  a.   From  this,  there  follows 
f(xp)  -/(XQ)  <  2o-  +  emd  <  3<r, 


if  m  be  chosen  sufficiently  great.  If  R  be  an  interior  point  of  an  interval 
AB  which  contains  in  its  interior  no  point  of  E',  or  of  E,  and  if  the  points 
A,  B  be  both  so  near  to  P  that  their  distances  from  it  are  less  than  cr,  we 
have  f(xp}—f(xR)  '  <  cr,  in  virtue  of  the  definition  of  f(xR};  also  if  only 
one  of  the  ends  A,  B  be  within  the  interval  of  length  <  d  +  cr,  which  has  been 
chosen,  then  an  interval  further  on  in  the  sequence  can  always  be  found 
such  that  the  middle  point  of  AB  is  exterior  to  it,  and  thus  the  inequality 
f(xp)—f(xR)  \  <  cr  holds  as  in  the  former  case.  It  has  now  been  shewn 
that,  for  any  arbitrarily  chosen  a,  a  neighbourhood  of  P  can  be  found  such 
that,  for  all  points  x  in  it,  |  f(xp)—f(x]  \  <  3cr  ;  therefore  P  is  a  point  of 
continuity  of  the  function.  The  case  in  which  d  =  0  does  not  require  separate 
treatment. 


EXAMPLES. 

1.*  Let  G  denote  a  non-dense  perfect  set  of  points  in  the  segment  (0,  1),  such  that 
the  end-points  of  the  complementary  intervals  are  rational  points.  Let/(#)  be  defined 
thus  : — At  every  irrational  point  inside  an  interval  complementary  to  G,  let  f(.v)  =  x  ;  at 
every  rational  point  of  such  interval,  let  /  (x)  be  equal  to  the  value  of  x  at  the  middle 
point  of  the  interval ;  and  at  every  point  external  to  a  complementary  interval,  let/(#)  =  1. 
This  function  is  discontinuous  except  at  the  middle  points  of  the  intervals  complementary 
to  G  ;  thus  the  set  of  points  of  continuity  is  an  enumerable  set  which  contains  no  com- 
ponent dense  in  itself. 

2.*  With  the  same  non-dense  perfect  set  as  in  Ex.  1,  let  AB  be  a  complementary 
interval  of  G,  and  M  its  middle  point.  At  every  rational  point  of  AM  except  J/,  let 
f(.c)  =  xA,  and  at  every  rational  point  of  MB  except  M,  let  f(x}=x^ ;  also  at  all  points  of 
(0,  1),  except  those  for  which  the  functional  value  has  been  already  specified,  let  f(x)  —  x. 
In  this  case  the  points  of  continuity  are  non-dense,  and  of  the  power  of  the  continuum. 

*  See  W.  H.  Young,  loc.  cit. 
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POINT- WISE  DISCONTINUOUS  FUNCTIONS. 

238.  A  function  being  defined  for  the  continuous  domain  (a,  6),  it  has 
been  shewn  in  §  230  that,  if  k  be  any  fixed  positive  number,  those  points,  at 
which  the  saltus  of  the  function  is  =  k,form  a  closed  set. 

This  theorem  follows  immediately  from  the  property  of  semi-continuous 
functions  established  in  §  230,  by  considering  the  saltus-function.  It  may  also 
be  proved  directly  as  follows : 

If  P  be  a  limiting  point  of  the  set  for  which  the  saltus  is  ~  k,  then  in  any 
arbitrarily  small  neighbourhood  of  P  there  are  points  of  the  set ;  hence  the 
fluctuation  of  the  function  in  this  neighbourhood  is  =  k,  and  therefore  the 
saltus  at  P  is  =  k. 

Moreover,  such  a  limiting  point  P,  of  the  set  of  points  at  which  the  saltus 
is  ^  k,  must  be  a  point  of  discontinuity  of  the  second  kind,  at  least  on  one 
side  of  P.  If,  at  P,  the  function  have  a  limit  on  the  right,  a  neighbourhood 
PQ  can  be  found  such  that  the  inner  fluctuation  in  PQ  is  <  k;  hence  inside  PQ 
there  can  be  no  point  at  which  the  saltus  is  ^  k ;  and  therefore  P  is  not  a 
limiting  point  on  the  right,  of  the  set  for  which  the  saltus  is  =  k.  A  similar 
remark  applies  to  the  left  of  P. 

We  have  already,  in  §  236,  defined  a  point- wise  discontinuous  function  as 
one  of  which  the  points  of  continuity  are  everywhere  dense  and  unclosed  in 
the  domain  of  the  function;  this  definition  is  that  given  by  Dini*,  and  is 
equivalent  to  the  following  definition  given  by  Hankelf : 

A  point-wise  discontinuous  function  is  one  for  which  those  points  at  which 
the  saltus  is  =  k,  an  arbitrarily  chosen  positive  number,  form  a  non-dense  set 
K,  whatever  value  k  may  have. 

That  this  set  is  closed  has  been  shewn  above. 

To  prove  the  equivalence  of  the  two  definitions,  let  it  be  assumed  that 
in  any  arbitrarily  chosen  sub-interval  (a,  /3),  a  point  of  continuity  xl  can  be 
found.  A  neighbourhood  can  be  found  for  xly  internal  to  (a,  {3),  in  which  the 
fluctuation  of  the  function  is  <  k,  and  this  neighbourhood  can  contain  no 
point  at  which  the  saltus  is  =  k ;  hence  the  points  at  which  the  saltus  is  ~  k 
form  a  non-dense  set  K,  since,  interior  to  any  sub-interval,  a  sub-interval  can 
be  found  which  contains  no  point  of  the  set  K. 

Conversely,  choose  a  descending  sequence  of  values  of  k,  say  &1;  k.2,  k3, ... 
which  converges  to  zero,  and  let  K1,KZ,KS,...  be  the  corresponding  non- 
dense  closed  sets,  each  of  which  necessarily  contains  the  preceding  one ;  then 
the  set  M  (Kl,  Kz,  K3, ...)  is  the  set  of  all  the  discontinuities  of  the  function. 

*  See  Grundlagen,  p.  81. 

t  Math.  Annalen,  vol.  xx,  p.  90.  This  is  a  reproduction  of  Hankel's  Univ.  Programm, 
Tubingen,  1870,  entitled  "  Untersuchungen  iiber  die  unendlich  oft  oscillierenden  und  unstetigen 
Functionen." 
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In  accordance  with  §  93,  this  set  is  of  the  first  category,  and  the  comple- 
mentary set,  which  is  the  set  of  points  of  continuity  of  the  function,  is  every- 
where dense,  and  has  the  cardinal  number  of  the  continuum,  being  a  set  of 
the  second  category,  of  the  kind  that  is  called  a  residual  set. 

It  will  be  observed  that  the  set  of  all  the  points  of  discontinuity  may  be 
either  everywhere  dense,  or  non-dense,  in  the  whole  or  part  of  the  domain  of 
the  variable.  This  set  may  be  finite,  enumerably  infinite,  or  of  the  power  of 
the  continuum. 

In  accordance  with  the  theorem  proved  in  §  231,  an  upper  semi-continuous 
function  can  only  be  point-wise  discontinuous. 

The  set  K,  although  non-dense,  is  not  necessarily  of  content  zero.  By 
Harnack*,  the  term  point-wise  discontinuous  function  was  only  used  for  such 
functions  as  possess  the  property  that  the  set  K,  for  each  value  of  k,  has 
content  zero.  It  will  be  seen  that  this  latter  case  is  of  special  importance  in 
connection  with  the  theory  of  Riemann  integration. 

It  has  already  been  shewn,  in  §  236,  that  the  points  of  continuity  of  the 
point-wise  discontinuous  function  form  an  ordinary  inner  limiting  set ;  and  if 

ft},  {«,...{«,-  ' 

be  the  sets  of  intervals  complementary  to  the  closed  sets  Klt  K2>  ...  Kn,  ..., 
they  form  a  sequence  of  sets  of  non-overlapping  intervals  which  define  the  set 
of  points  of  continuity  as  their  inner  limiting  set. 

The  whole  theory  of  point- wise  discontinuous  functions  is  applicable  to 
the  case  in  which  the  domain  of  the  variable  is  not  a  continuum,  but  is  any 
perfect  set.  In  this  case  also,  the  points  of  continuity  of  a  point-wise 
discontinuous  function  are  everywhere  dense  relatively  to  the  perfect  domain, 
and  the  points  at  which  the  measure  of  discontinuity  is  =  k  form  a  closed 
set,  non-dense  relatively  to  the  domain  of  the  variable.  That  this  is  the  case 
may  be  shewn  by  making  the  points  of  the  perfect  set  correspond  in  order  to 
the  points  of  a  continuous  interval,  as  explained  in  §  96.  The  points  of  dis- 
continuity, and  those  of  continuity,  relatively  to  the  perfect  domain,  are  sets 
of  the  first  and  second  category  respectively,  relative  to  that  domain. 

239.  An  important  class  of  functions  is  that  in  which  each  function  has 
ordinary  discontinuities  only ;  the  domain  of  the  functions  being  either  a  con- 
tinuous interval,  or  a  perfect  set  of  points. 

A  function  which  has  only  ordinary  discontinuities  is  point-wise  discontinuous. 

The  set  of  points  at  which  the  saltus  is  ^  k  must  be  finite.  For  if  it  were 
not  finite  it  would  have  a  limiting  point  which  would  be  a  point  of  discontinuity 
of  the  second  kind ;  and  this  does  not  exist.  It  thus  follows  that  the  set  of 
points  of  discontinuity  is  a  non-dense  enumerable  set. 

*  Math.  Annalen,  vol.  xix,  p.  242,  and  vol.  xxiv,  p.  218. 
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A  monotone  function  is  one  such  that,  for  every  pair  of  values  xlf  xz  of  the 
variable,  such  that  #2>ff,,  the  condition  /(#2)  =/(#•])  is  satisfied;  or  else,  for 
ever)/  such  pair  the  condition  f  (x2)  ^f(x^)  is  satisfied. 

In  the  former  case  the  function  is  said  to  be  monotone  non-diminishing,  and 
in  the  latter  case  monotone  non-increasing.  Since  a  monotone  function  has 
no  oscillations  in  the  neighbourhood  of  any  point,  every  discontinuity  is  an 
ordinary  one.  It  follows  that : 

A  monotone  function  is  point-wise  discontinuous  (or  continuous).  Its  points 
of  discontinuity  form  an  enumerable  set.  The  points  at  which  the  saltus  is 
^  k  (>  Q)form  a  finite  set. 

The  domain  of  the  variable  being  either  a  continuous  interval  or  any  perfect 
set,  let  us  suppose  that,  at  every  point,  the  oscillation  (see  §  225)  of  the  function, 
both  on  the  right,  and  on  the  left,  is  <  k',  there  can  then  only  be  a  finite  number 
of  points  at  which  the  saltus  is  =  k  ;  i.e.  the  set  K  is  finite. 

For  if  K  were  not  finite,  it  must  contain  a  limiting  point  P,  which  has  been 
shewn,  in  §  238,  to  be  a  point  of  discontinuity  of  the  second  kind.  Any  arbitrarily 
small  neighbourhood  of  P,  on  one  side  at  least,  must  therefore  contain  points 
at  which  the  saltus  is  ^  k,  and  hence  the  oscillation  at  P  on  this  side  could 
not  be  <  k. 

The  domain  can  therefore  be  divided  into  a  finite  number  of  parts  within 
each  of  which  there  is  no  point  at  which  the  saltus  is  =  k ;  and  it  follows  that 
the  domain  can  be  divided  into  a  finite  number  of  parts,  within  each  of  which 
the  fluctuation  of  the  function  is  <  k. 

If,  at  each  point  of  a  set  which  is  everywhere  dense  in  the  domain  of  the 
variable,  there  exist  a  limit  of  the  function  on  one  side  at  least,  then  the  function 
is  either  point-wise  discontinuous,  or  else  it  is  continuous. 

In  any  interval,  containing  points  of  the  domain,  a  point  can  be  found  which 
has  a  neighbourhood,  on  one  side  at  least,  in  which  the  inner  fluctuation  is  <k ; 
within  such  neighbourhood  the  saltus  is  everywhere  <  k ;  hence  the  points 
of  K  are  non-dense  in  the  domain,  and  thus  the  function  is  either  point-wise 
discontinuous,  or  else  it  is  continuous. 

If  a  function  be  defined  for  a  continuous  interval,  and  all  the  points  of  dis- 
continuity be  ordinary  ones,  at  least  on  one  side,  then  the  set  K,  of  points  at  which 
the  saltus  is  =  k,  is  a  set  of  content  zero. 

The  set  K  can  be  resolved  into  a  perfect  set  G  and  an  enumerable  set ;  the 
set  G  contains  points  which  are  limiting  points  on  both  sides,  and  at  such 
points  the  oscillation  both  on  the  right  and  on  the  left  must  be  ~  k ;  it  follows 
that  the  set  G  is  non-existent,  and  that  K  is  therefore  an  enumerable  closed 
set,  which  has  necessarily  content  zero. 
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The  theorem  still  holds  if  there  be  points  of  discontinuity  of  the  second  kind 
which  form  a  closed  set  of  content  zero,  for  these  points  may  be  enclosed  in 
a  finite  number  of  intervals  whose  sum  is  arbitrarily  small ;  the  theorem  can 
then  be  applied  to  each  of  the  remaining  intervals  of  the  domain. 

240.  It  has  been  shewn  in  §  222  that  the  aggregate  of  functional  limits  of 
a  function,  at  any  point,  is  a  closed  set. 

A  point-wise  discontinuous  function*  can  be  so  constructed  that,  at  a  point 
XQ,  the  aggregate  of  functional  limits  is  a  prescribed  closed  set  G. 

To  establish  this  theorem,  we  observe  that,  if  G  be  unenumerable,  it  may 
-be  replaced  by  an  enumerable  set  6?!,  everywhere  dense  in  G.  Let 

V    V    V 

*  1>     '  2>     V  3)    •  •  • 

denote  the  set  GI,  arranged  in  the  order- type  o>. 

Next,  choose  an  enumerable  sequence  xl,x.2,  ...  xn,  ...  of  values  of  x  having 
the  single  limiting  point  x0 ;  -and  arrange  the  sequence  {a?n}  in  the  order-type 
o)2.  The  sequence  [xn]  may  thus  be  split  up  into  an  enumerable  set  of  sequences 
{xlr},  {x2r},  ...  [xgr],  ...  each  of  which  has  the  limit  XQ.  The  function /(a?)  may 
be  defined  by  the  specifications,  that  f(x)  =  0,  for  all  values  of  x  which  do  not 
belong  to  the  sets  [xlr],  [xzr],  ...;  and  that  y  (#«»•)  =  vsr,  where  vsl,  vs2,  ...  vsr,  ... 
is  a  sequence  chosen  so  as  to  converge  to  the  limit  Vs.  The  function  f(x)  so 
defined  is  continuous  at  every  point  except  a-0,  x1}  ...  xn,  ...,  and  it  has  the 
required  property  ;  since  the  points  of  6^,  and  therefore  of  G,  are  all  values  of 
the  aggregate  of  functional  limits  at  the  point  a?0. 


EXAMPLES. 

l.t  If  /(#),  (f>  (x)  be  two  point-wise  discontinuous  functions  denned  for  the  same  interval, 
there  is  an  everywhere  dense  set  of  points  at  each  of  which  both  functions  are  continuous. 
This  theorem  follows  at  once  from  the  fact  that  the  points  common  to  two  residual  sets  also 
form  a  residual  set,  and  that  this  holds  for  every  sub-interval  contained  in  the  given  interval. 

2.  Let  (x)  denote  the  positive  or  negative  excess  of  x  above  the  integer  nearest  to  it,  and 
if  x  be  half-way  between  two  successive  integers,  let  (x)  =  0.  Let  a  function  J  /  (x)  be  defined 
for  the  interval  (0,  1)  as  the  limit  of 

.(#)      (2#)  ,(3#),        ,(rc*) 
1    H"    4    "    9  %2   ' 

when  n  is  indefinitely  increased.  The  function  /(#)  is  a  point-  wise  discontinuous  function, 
in  which  the  set  K,  of  points  at  which  the  saltus  is  >  k,  is  finite  for  each  positive  value  of  k 
It  can  be  proved  that,  if  x  =  mj2n,  where  m  and  2n  are  relative  primes,  then 

m+Q\-f(m\        ^  f 

+       ~J  ~        2  '        J 


*  This  theorem  was  given  by  Bettazzi,  see  Rendiconti  di  Palermo,  vol.  vi,  p.  173. 
t  See  Volterra,  Giornalc  di  Mat.,  vol.  xix,  1881. 
J  See  Biemann's  Ges.  Werke,  p.  242. 
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For  values  of  x  not  of  the  above  form,  f(x)  is  continuous.  "The  number  of  points  of  A"  is  the 
number  of  irreducible  proper  fractions  having  even  denominators  2n,  such  that  ir*/8n*  >  k.  • 
The  set  of  all  the  points  of  discontinuity  is  everywhere  dense  in  the  interval  (0,  1). 

3.  Let*  y  =  c,  for  all  rational  values  of  x ;  and  y  =  d,  for  all  irrational  values  of  x.    This 
function  is  totally  discontinuous. 

4.  Lett  f(x)  —  1,  for  all  values  of  x  in  the  interval  (0,  1),  except  x  =  — ,  (n=  1,  2,  3, ...), 

55 

for  which/(4?)  =  0.  At  each  of  the  points  (  —  J  there  is  a  saltus  equal  to  unity.  This  function 
is  point-wise  discontinuous,  and  the  content  of  K  is  zero,  for  every  value  of  k. 

5.  Int  the  interval  (-,  1  )  of  x,  let  /(#)  =  !  ;  in  the  interval  1^,  -),  let /(#)  =  -  ; 

\*     /  \2"  a/  a 

and  in  general,  in  the  interval  ( Q^+I ,  —  ) ,  let  /(#)  =  — .    In  this  case  the  point-wise  dis- 

\^  '        £     /  £ 

continuous  function  f(x)  is  such  that  the-  number  of  points  at  which  the  saltus  is  >  k  is 
finite  for  every  value  of  k  >  0. 

6.  TheJ  points  of  a  continuous  interval  (0, 1)  may  be  put  into  correspondence  with  the 
points  of  a  non-dense  set  of  points,  dense  in  itself,  contained  in  an  interval  (a,  b),  in  such  a 
manner  that  the  relative  order  of  two  points  of  the  interval  (0,  1)  is  the  same  as  that  of 
the  corresponding  points  in  (a,  b).    Such  a  correspondence  is  defined  by  a  point-wise  dis- 
continuous monotone  function  y=f(x). 

7.§    Let  the  numbers  of  the  interval  (0,  1)  be  expressed  as  finite  or  infinite  decimals 

x—'OiC^az  ...  an  ...,  and  let/ (a-)  =  (  —: }  +  (T£.}  +....    The  function  f(x}  is  monotone,  and 

\iu/       \1<JW 

is  discontinuous  for  every  value  of  x  represented  by  a  finite  decimal.  The  set  of  points  K, 
for  a  given  value  of  k,  is  finite.  The  function  f(x)  defined  by/(^)  =  '0a10a20a3  ...  has  similar 
properties. 

8. 1 1  Let  the  points  of  the  interval  (0,  1)  be  represented  by  decimals,  and  consider  the 
set  G0  of  those  points  for  which  only  the  digits  0  and  I  occur  in  the  decimal  representation, 
excluding  those  points  for  which  all  the  figures  are  0,  from  and  after  some  fixed  place. 
The  set  G0  is  non-dense  in  the  interval  (0,  1),  and  has  the  cardinal  number  c.  Any  point 
#0,  of  GQ,  is  represented  by  •«1a2a3...  an...,  where  a,,  is  0  or  1.  Let  £  be  a  fixed  point 
•&!&2  •••  bn  ...  of  6*0,  and  let  x^  denote  a;0  +  2|=  'C1c2  ...  cn  ... ;  so  that  cn  =  an  +  '2,bn.  With  £ 
fixed,  let  the  set  of  all  points  x^  be  denoted  by  <?£  ;  the  points  of  G*  are  all  different  from 
those  of  6*0,  and  for  two  values  £,  £'  of  £,  the  sets  G^  G^  have  no  point  in  common.  For 
the  two  numbers  ~Ci c2 . . .  cn  . . .,  'Ci'c2' ...  cn' ...  are  identical  only  when  cn  =  cn',  which  holds  only 
when  an  —  an',  and  bn=bn'.  If  we  read  off  in  the  dyad  scale  the  decimal  representation  of  f, 
we  obtain,  by  giving  |  all  the  values  in  (?o,  every  point  in  (0,  1)  except  the  point  0,  and 
these  once  only ;  let  the  point  which,  by  thus  using  the  dyad  scale,  corresponds  to  £,  be 
denoted  by  (£).  Now  let  /(#)  be  defined  by  the  rales /(#>)  =  (£),  /(#o)==0>  and  f(x)  =  Q  for 
all  other  values  of  x.  The  point-wise  discontinuous  function  /  (x}  so  defined  for  the  interval 
(0, 1)  is  such  that,  at  all  the  points  of  the  enumerable  set  (?>,  the  saltus  is  (£) ;  the  set  of 
all  the  points  of  discontinuity  is  non-dense  in  (0,  1) ;  and  f(x)  is  constant,  and  =0,  in  an 
everywhere  dense  set  of  linear  intervals. 

*  Dirichlet's  Werke,  p.  132.  t  Hankel,  Math.  Annalen,  vol.  xx. 

£  Harnack,  Math.  Annalen,  vol.  xxm.  §  Peano,  Riv.  di  Mat.,  vol.  i. 

||  Schoenflies,  Gottinger  Nachrichten,  1899. 
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9.*  Let  the  points  .r,  of  the  interval  (0, 1),  be  expressed  as  radix-fractions  in  the  scale 
of  3.  Let  GO  be  the  set  of  points  for  which  all  the  figures  of  the  radix-fraction  are  0  and  1, 
excepting  those  points  for  which  all  the  figures  are  0,  after  some  fixed  place.  Let  Gn  consist 
of  all  the  points  which  contain  the  digit  2  in  at  most  the  first  n  places,  but  are  also  such 
that  the  nth  figure  is  2  ;  then  On  is  non-dense,  and  of  cardinal  number  c.  There  are  left 
only  those  points  for  which  the  radix-fractions  contain  the  digit  2  an  infinite  number  of 
times ;  and  these  points  belong  to  a  set  If,  for  which  the  radix-fractions  contain  an  infinite 
number  of  digits  other  than  2,  or  to  a  set  (?,  for  each  point  of  which  every  digit  is  2,  from 
and  after  some  fixed  one.  Each  point  of  G  can  be  represented  by  a  terminating  radix-fraction 
which  contains  only  a  finite  number  of  2's,  and  can  be  added  to  a  Gn.  Let  GQ,  Gt,  <?2,  ..., 
when  so  increased,  become  Cr0,  Glt  G2, ...  ;  and  take  a  sequence  of  decreasing  numbers 
#01  9\t  9ii  ••••  Let  the  function/(.r)  be  defined  by  the  rules  f(x}=gn,  if  x  is  a  point  of  Gn, 
and/(#)=0  for  all  points  of  H.  The  point-wise  discontinuous  function  /(#)  is  continuous 
at  all  the  points  of  H,  and  the  points  of  discontinuity  are  everywhere  dense  in  (0,  1),  and 
of  cardinal  number  c. 


DEFINITION    OF   POINT-WISE   DISCONTINUOUS    FUNCTIONS   BY   EXTENSION. 

241.  Let  us  suppose  a  function  f(x)  to  be  defined  for  a  domain  which, 
consists  of  a  set  of  points  which  is  dense  in  itself,  but  not  closed ;  and  further 
lek  us  assume  that  f(x)  is  continuous  in  this  domain.  The  new  domain 
obtained  by  adding  to  the  original  domain  those  of  its  limiting  points  which 
do  not  belong  to  it  may  be  spoken  of  as  the  extended  domain.  It  has  been 
pointed  out  in  §  222  that,  at  a  point  a,  of  the  extended  domain,  which  does  not 
belong  to-  the  original  domain,  there  is  an  aggregate  of  functional  limits 
which  is  certainly  a  closed  set,  and  may  consist  of  a  finite,  or  an  infinite,  set 
of  numbers. 

Let  us  now  define  a  function  <£(#),  for  the  extended  domain,  in  the 
following  manner: — At  each  point  of  the  original  domain,  which  may  be 
called  a  primary  point,  let  </>  (x}  =f(x) ',  at  each  point  a,  which  may  be  called 
a  secondary  point,  and  which  does  not  belong  to  the  original  domain,  attribute 
to  <£(#)  the  values  contained  in  the  aggregate  of  functional  limits  of  f(x) 
at  a ;  this  function  $  (x}  may  then  be  multiple-valued  at  any  secondary 
point.  The  new  function  </>(«),  defined  for  the  extended  domain,  may  be 
spoken  of  as  the  function  obtained  by  extension  of  f(x);  and  those  points  for 
which  4>(x)  is  multiple- valued  are  regarded  as  points  of  discontinuity  at 
which  the  measure  of  discontinuity  is  the  excess  of  the  greatest  over  the 
least  value  of  the  function  at  that  point. 

It  will  be  shewn  that  the  extended  function  <f>  (x)  is  point-wise  discontinuous 
in  the  extended  domain,  unless  it  be  continuous.  This  gives  rise  to  a  method  of 
constructing  point-wise  discontinuous  functions  which  has  been  employed  by 
Broden  in  various  special  cases.  Since  we  may  so  choose  the  original  domain 

*  Schoenflies,  Gbttinger  Nachrichten,  1899. 
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that  it  shall  consist  of  an  enumerable  set  of  points,  the  method  includes 
one  for  the  construction  of  a  point-wise  discontinuous  function  from  an 
enumerable  set  of  specifications. 

To  prove  that  the  extended  function  <£(#)  is  at  most  point- wise  dis- 
continuous, it  is  sufficient  to  shew  that  </>  (as)  is  continuous  at  all  points  of  the 
original  domain  G,  which  is  a  set  that  is  everywhere  dense  in  the  extended 
domain  G'. 

Let  x  be  a  point  of  G,  and  let  it  be  the  limiting  point  of  a  convergent 
sequence  (#/,  #2',  x3',  ...),  of  which  all  the  points  belong  to  G'.  Consider  the 
aggregate  [<f>  (#/),  ^>(x^), ...},  where  <f>(x1/),<j)(x,i'),  ...  have  any  of  the  values  which 

belong  to  the  points  #/,  #/,  x$, Now  a  point  xn,  of  G,  can  be  found  such 

that  )  xn'  —  xn  |  <f]n,  and  I  </>  (#«')  —f(xn)  |  <  en,  where  r)n,  en  are  independent 
arbitrarily  small  numbers.  If  we  take  a  sequence  of  values  of  tj,  such  that 
r)i>ij2>  r}3,  ... ,  with  zero  as  its  limit,  and  also  a  similar  sequence  of  the 
e  numbers,  then  the  sequence  (#1}  #2,  ars, ...)  has  the  same  limit  x  as  the 
sequence  (#/,  x2',  #3',  ...);  and  the  aggregate  {<£(#/),  (f>(x2')}  ...}  has  the  same 
limit  as  the  convergent  aggregate  {f(%i), /(%•>)>  •••},  viz.  f(x)  or  <£(#);  and 
thus  the  theorem  is  established. 

It  will  be  observed  that  the  values  of  <p  (x)  at  all  the  secondary  points  in 
an  arbitrarily  small  neighbourhood  of  a  secondary  point  a  depend  only  on 
the  values  of  f(x)  in  that  same  neighbourhood ;  it  follows  therefore  that  a  is 
a  point  of  continuity  or  of  discontinuity  of  <jb  (x),  according  as  the  aggregate 
of  functional  limits  of  f(x)  at  a  consists  of  one  number  or  of  more.  In 
the  latter  case  the  measure  of  discontinuity  of  <£  (x)  at  «  is  the  excess  of 
the  greatest  over  the  least  of  the  numbers  belonging  to  the  values  of  </>  (as)  at 
the  point. 

It  has  been  shewn  (§  240)  that  a  point- wise  discontinuous  function  can  be 
so  constructed  that,  at  a  given  secondary  point,  the  values  of  the  function 
may  be  an  arbitrarily  assigned  closed  set. 

242.  Although  a  class  of  point-wise  discontinuous  functions  may  be  ob- 
.tained  by  extension  of  a  continuous  function  defined  for  a  primary  domain, 
dense  in  itself  but  unclosed,  yet  not  every  point-wise  discontinuous  function 
can  be  generated  in  this  manner. 

Let  f(x)  be  a  point-wise  discontinuous  function  in  a  domain  which  is 
either  a  continuum  or  a  perfect  set  of  points. 

Consider  the  function  <j)  (x\  obtained  by  taking  the  values  of  f(x)  as  given 
only  at  its  points  of  continuity,  and  extending  this  function  to  the  complete 
domain,  in  the  manner  explained  above. 

At  each  point  of  discontinuity  of  f(as)  there  is  a  saltus  Ay,  and  at  that 
point  the  function  <f>  (x),  obtained  by  extending  the  set  of  values  of  f(x)  at 

H.  20 
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its  point  of  continuity,  has  a  measure  of  discontinuity  &$,  which  will  be  zero 
in  case  $  (x)  be  continuous  at  the  point;  but  in  any  case  the  condition^  &  kf 
is  satisfied,  since,  within  any  neighbourhood  of  the  point,  the  fluctuation  of 
<f>  (&•)  cannot  be  greater  than  that  of /(#). 

If  k$  =  0  at  any  point  of  discontinuity  of  f(x\  that  point  may  be  said  to 
be  a  point  of  unessential  discontinuity  of  the  function  f(x)',  and  if  k^  >0, 
the  point  is  one  of  essential  discontinuity. 

Let  now  a  function  %(#)  be  denned  for  the  whole  domain  as  follows: — 
At  every  point  of  continuity  of  f(x\  and  at  every  point  of  discontinuity 
at  which  kf  =  k^,  let  %(^)  =  0;  at  each  point  at  which  kf>k^,  let 


The  function  ^  (x)  is  not  necessarily  continuous  at  every  point  at  which  it 
is  zero.  At  a  point  xl  at  which  </>(#)  is  continuous,  the  measure  of  dis- 
continuity of  ^  (x)  is  kf,  or  ^  (x:) ;  but  this  is  not  necessarily  the  case  if  <£  (&•) 
be  not  continuous  at  a;l.  This  function  %(#)  may  be  called  a  point-wise 
discontin uous  null-function. 

By  subtracting  from  f(x)  a  function  fa  (x)  which  never  exceeds,  at  any 
point  x,  in  absolute  value,  the  value  of  ^  (x),  we  obtain  a  function  fa  (x),  of 
which  the  measure  of  discontinuity  is  everywhere  =  k$  (x}. 

The  function*  fa(x)  may  be  spoken  of  as  the  most  nearly  continuous 
function  associated  with  f(x). 

It  thus  appears  that  a  point-wise  discontinuous  function  can  always  be 
expressed  as  the  sum  of  a  point-wise  discontinuous  null-function  and  the 
most  nearly  continuous  function  associated  with  the  given  function. 

The  latter  function  fa  (x)  has  only  those  discontinuities  which  necessarily 
arise  from  the  values  of  the  given  function  at  its  points  of  continuity, 
and  is  independent  of  the  parts  of  the  discontinuities  which  arise  out 
of  the  functional  values  of  f(x)  at  the  points  of  discontinuity.  The  null- 
function  depends  upon  the  unessential  parts  of  the  discontinuity  of  f(x). 

EXAMPLES. 

l.t    Let  f(x)=0,  for  #=0,  -,  ^-  ,  5—,...;  and  for  all  other  positive  and  negative 

7T        2iTT        O7T 

values  of  x,  let  f(x)  =  cos  -  .    The  function  <£  (x),  associated  with  /(*•),  agrees  with  cos  -  at 
every   point   except  x  =  Q,   where  <f)  (x)  is  represented  by  (  — 1,    +1).     The  measure  of 

discontinuity  kf  is  zero  except  at  -,   — ,  5— ,  ...,  where  £/=!,  and  at  x=0,  where  &/=2; 

TT     ZTT     OTT 

*  This  definition  is  not  in  complete  agreement  with  that  of  Schoenflies,  see  Bericht,  p.  134,  to 
whom  the  term  is  due.  Some  erroneous  statements  of  Schoenflies,  in  this  connection,  were  pointed 
out  and  corrected  by  Hahn;  see  Monatshefte  f.  Math.,  vol.  xvi,  p.  312. 

f  See  Hahn,  loc.  cit. 
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the  measure  k+  vanishes  everywhere  except  at  a;  =  0,  where  £$  =  2.    The  function  x  (x) 

vanishes  except  at  -  ,  —  , ... ,  where  it  is  1 ;  it  vanishes  at  #=0,  but  is  discontinuous  at 
TT     2ir 

that  point. 

2.*  Let/(#)  vanish  except  at  the  points  #=„,  ;,,  j,  ...  -,t..,  where /(#)  =  !.  The 
function  <£  (x)  is  everywhere  zero,  and  thus  k$  is  everywhere  zero.  The  function  <£t  (#),  is 

everywhere  zero,  and  ^y  is  zero  except  at  0,  =  ,  -,   ...  -,...,   where  kf=\.    In   this  case 

SB     3  ?fc 

3.*  A  point-wise  discontinuous  function  f(x]  can  be  constructed  t  such  that  the 
function  <£  (x)  may  have,  at  a  point  x0,  the  values  belonging  to  a  prescribed  closed  set  G, 
in  accordance  with  Bettazzi's  theorem  (§  240).  If  G  be  unenumerable,  choose  an  enumer- 
able set  GI,  dense  in  G,  and  let  fflt  g^ ,  g3, ...  be  the  points  of  GI.  Take  a  set  of  intervals 

{8n},   where   &„  is   (^o+s^,  #0  +  5^1) »  an(i  define /(a;)  as  follows:— in  81?  83,  S5,  S7,  ... 


)=gi',  in  §2>  ^e>  ^ioj  •••  \etf(x)=gr<2 ;  in  84,  812,  §20?  •••  let/(-:';)—  ^3 5  ingeneral/(^)=5r», 
in  the  first  free  interval,  and  in  every  second  of  the  following  free  intervals ;  further  let 
f(x)=gi,  for  x £x0.  The  function  f(x)  is  point- wise  discontinuous,  the  points  of  dis- 
continuity being  #0  and  the  points  x0+  — .  The  function  (f>  (x)  has  two  values  at  the  points 

JO 

XQ  +  —  ;  and  at  #0  it  has  all  the  values  of  Gl ,  and  therefore  all  those  of  G. 

ifi 


FUNCTIONS    OF    BOUNDED    VARIATION. 

243.    Let  us  suppose  that  the  interval  (a,  b),  for  which  a  function  f(x)  is 
defined,  is  divided  into  a  number  of  parts 

(a?0,  a?!>  (a?!  ,  xz)  ,  .  .  (ocr^,  xr)  .  .  .  (arw_1  ,  xn\  where  XQ  =  a,  xn  =  6  ; 
these  parts  forming  a  net  with  closed  meshes. 

Consider  the  sum 


or  i  I  /<>,•)  - 

r=l 

r=w 

//"  the  function  be  such  that  the  sum   S   \f(xr}  —/(xr-^    is  less  than  some 

r=l 

fixed  positive  number,  for  all  possible  nets,  the  function  f(x)  is  said  to  be  of 
bounded^  variation  (a  variation  bornee)  in  the  interval  (a,  b). 

The  function  /(a;)  being  of  bounded  variation,  there  must  exist  a  number 
which  may  be  denoted  by  Vabf(x),  such  that,  for  every  net, 

T 

r  =  l 

*  See  Hahn,  loc.  cit. 

f  This  is  contrary  to  a  statement  of  Schoenflies,  see  Bericht,  vol.  i,  p.  135. 

£  See  Jordan's  Coum  d'  Analyse,  vol.  i,  p.  55. 

20  _  9 
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and  such  that,  if  e  be  an  arbitrarily  chosen  positive  number,  a  net 

\XQ,  #>i/,  \XD  &-2/J  •••  \*&n — 1>  *^n)> 

where  XQ  =  a,  xn  =  b,  can  be  so  determined  that 

T  \f(xr)  -/(*,_,)  \  >   Va»f(x}  -  6. 

The  number  Vabf(x)  which  satisfies  these  conditions  is  defined  to  be  the 
total  variation  off(x)  in  the  interval  (a,  b). 

In  case  the  function  is  not  of  bounded  variation  in  (a,  b),  Vab  f(x)  may  be 
regarded  as  infinite,  and  it  will  be  possible  to  determine  a  net,  for  which 

T  f(xr}-f(xr_l}\>N, 
where  N  is  an  arbitrarily  chosen  positive  number. 

A  function  of  bounded  variation  has  only  ordinary  points  of  discontinuity. 

Accordingly  the  set  of  points  of  discontinuity  of  such  a  function  is 
enumerable. 

To  see  that  this  statement  is  correct,  let  us  suppose  that,  at  a  point  £,  in 
(a,  b),  the  upper  and  lower  limits  of  the  function  on  one  side,  say  the  right, 
have  different  values  ;  thus  let  /(£  +  0)  be  >/(f?  +  0).  If  77  be  an  arbitrarily 
chosen  positive  number,  in  the  neighbourhood  of  f  there  is  an  indefinitely 
great  number  of  points  at  which  f(oc)  >f(£  +  0)  —  77,  and  also  such  a  set  at 
which  f(x)  </(£  +  0)  +  77.  It  follows  that  an  indefinitely  great  number  of 
non-overlapping  intervals  exists,  in  each  of  which  the  absolute  value  of  .the 
difference  of  the  functional  values  at  the  end-points  is 


which  is  a  positive  number  K,  provided  77  be  chosen  to  be  less  than 


A  set  of  m  intervals  can  be  so  chosen  that,  if  their  end-points  belong  to 

r*=n 

those  of  a  net,  their  contribution  to    S   \f(xr}—  /(#,._,)  \  is>m/c.    Since  m 

r=l 

may  be  taken  as  large  as  we  please,  it  is  clear  that  the  function  cannot  have 
bounded  variation. 

r=n 

It  is  obvious  that  f(b)  —/(a)  =  2   \f(sKr)  —f(x,r_^j\  ;  now  let  the  terms  in 

r=\ 

the  sum  on  the  right-hand  side  be  divided  into  two  parts,  those  for  which 
f(xr}—f(xr^}  is  positive,  and  those  for  which  it  is  negative.    Thus 

/(&)  -/(a)  =  2X  \f(xr}  -f(xr^}  +  22  {/(*,)  -f(xr^}} 

where,  in  21;  those  values  of  r  are  taken  for  which  /(#r)-/(X--i)  is  positive, 
and  in  22,  those  terms  for  which  it  is  negative. 
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If  the  function  is  of  bounded  variation,  the  value  of 

Si  {fM  -/(«r-i)}  -  **  IfM  -/(av-i)l 

cannot  exceed  Vabf(x).    Thus 


Si 


and  -  22  {/(av)  -/(«^,)}  ^  i  {  F.*/(*) 

It  follows  that  the  two  sums 

Si  {/(Or)  -/(aV-i)},     2*  f/0*r)  - 

are  both  bounded,  for  all  nets  fitted  on  to  (a,  6). 
Choosing  a  net  fitted  on  to  (a,  6),  for  which 


we  see  that     21  {/(av)  -/0«V-i)}  >  i  {  Fa*/(a?)  +/(6)  -/(a)  -  e}, 
-  2 


It  follows  that  the  numbers 

1  {  W(*)  +/(6)  -/(a)}, 
are  the  upper  boundaries  of 

^  {/.W  -/(^V-Ol,    -  22  {/  W  -/(arr-i)} 
for  all  possible  nets  fitted  on  to  the  interval  (a,  6). 

Denoting  these  upper  boundaries  by  Pab  f(x),  Nabf(x)  respectively,  we  have 


-/(a)  .  PB»/(ar)  -  W/O*)- 

The  numbers  Pabf(x),  —  Ncbf(x)  may  be  called  the  total  positive  variation, 
and  the  total  negative  variation,  of  /(a?)  in  (a,  6). 

The  following  'definition*  of  a  function  of  bounded  variation  is  equivalent 
to  the  one  given  above  : 

If  any  set  of  non-overlapping  intervals,  finite,  or  indefinitely  great,  in 
number,  be  defined  in  (a,  b),  and  the  sum.  or  limiting  sum,  of  f(ft)  —f(ot)  \,for 
all  the  intervals  (a,  /3)  of  the  set,  be  less  than  some  fixed  number,  independent  of 
the  particular  set  of  intervals,  the  function  f  (x)  is  said  to  have  bounded  varia- 
tion in  (a,  b). 

In  the  first  place,  we  see  that  a  function  which  satisfies  this  definition 
must  satisfy  the  definition  employed  above.  For  we  may  take  the  set  of 
intervals  to  be  those  of  any  net  fitted  on  to  (a,  b).  Conversely,  if  the  first 
definition  is  satisfied  by  f  (,*•),  we  may  arrange  any  infinite  set  of  non-over- 
lapping intervals  in  order  of  descending  magnitude  of  |/(/3)  —/(a)  and  any 
finite  part  of  the  set  containing  the  first  m  of  the  intervals  is  part  of  a  net  in 
(a,  6);  and  thus  2  f(/3)  —/(a)  !,  for  these  m  intervals,  is  less  than  Vabf(x}.  As 
this  holds  whatever  value  m  may  have,  we  see  that  the  second  definition  is 
satisfied  by  f(x). 

*  See  W.  H.  Young,  Quarterly  Journal,  vol.  XLII,  p.  57. 
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FUNCTION    OF   BOUNDED   VAEIATION    EXPRESSED   AS   THE    DIFFERENCE 
OF   TWO   MONOTONE    FUNCTIONS. 

244.  It  is  clear  that,  if  x  be  any  point  in  (a,  b),  a  function  /(#)  which 
has  bounded  variation  in  (a,  6)  has  also  bounded  variation  in  (a,  x)  ;  and 
consequently  the  positive,  and  the  negative,  variations  of  the  function  in 
(a,  x)  are  also  bounded. 

The  three  numbers  which  represent  these  total  variations  may  be  denoted 

by  Vaxf(®),  Paxf(x),  —  Naxf(x};  and  they  may  be  regarded  as  functions  of  x. 

i 
They  satisfy  the  relations 

Vaxf(x)  =  Paxf(x}  +  Naxf(x), 
f(x)  -/(a)  =  Paxf(x)  -  Naxf(x\ 

It  is  clear  that,  if  x'  >  x,  we  have  Pax'f(x)  ^  Paxf(x),  Nax'  f(x)  ^  Naxf(x); 
and  thus  the  two  functions  Paxf(x),  N  axf(x]  are  monotone  non-diminishing 
functions,  in  (a,  b).  For  simplicity,  we  denote  these  functions  by  P  (x),  N(x) 
respectively. 

Since  f(x)  =  [P  (x)  +f(a)  +  k}  -  {N(x)  +  k} 

and  f(x)  =  {kf  -  N  (x)}  -{k'-P  (x)  -f(a)}, 

where  k,  k'  are  arbitrarily  chosen  numbers,  we  see  that  : 

Every  function  of  bounded  variation  can  be  expressed,  in  an  indefinitely 
great  number  of  ways,  as  the  difference  of  two  monotone  functions,  both  of  which 
are  non-diminishing,  or  both  of  which  are  non-increasing. 

This  theorem  expresses  the  cardinal  property  of  functions  of  bounded 
variation.  It  is  of  great  value  in  Analysis,  as,  by  means  of  it,  properties  of 
monotone  functions  can  be  immediately  extended  to  the  wide  class  of  functions 
of  bounded  variation. 

The  converse  theorem  holds  that  : 

The  difference  of  two  bounded  monotone  functions,  both  non-diminishing,  or 
both  non-increasing,  is  a  function  of  bounded  variation. 

For,  consider  such  a  function  f  (x)  •=  P  (x)  —  N  (x\  where  P  (x),  N(x)  are 
non-diminishing  ;  then,  in  any  interval  (a,  /3),  we  have 

/(/9)  -/(a)  |  £  {P  (0)  -  P  (a)}  +  {N(ft  -  N  (a)}. 

•  Hence,  in  any  net  fitted  on  to  (a,  b),  the  sum  of  the  absolute  values  of  the 
differences  of  the  values  off(x)  at  the  ends  of  the  meshes  (a,  /3)  is 


£  P  (b)  -  P  (a)  +  N(b)-  N  (a)  ; 
and  thus  the  theorem  is  established. 
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It  has  now  been  shewn  that  it  is  necessary  and  sufficient,  in  order  that  a 
function  may  be  of  bounded  variation  in  the  interval  for  which  it  is  defined, 
that  it  can  be  expressed  as  the  difference  of  two  bounded  monotone  non 
diminishing  (or  non-increasing)  functions. 

The  sum,  difference,  or  product,  of  two  functions  of  bounded  variation  is 
also  of  bounded  variation. 

If  /j  (#)  =  P1  (a)  -  JVj  (#),  /2  (x)  =  P2  (#)  -  ^2  (#),  the   theorem  follows  at 
once  from  the  relations 


,  (a,)  =  (A  (a)  +  P2  (*)}  -  {N,  (x)  +  N2  (x)}, 
,  (*)  -/,  (*)  =  {P,  (x)  +  N2  (x)}  -  [P,  (x)  4-  N,  (x)}, 


FUNCTIONS   OF   BOUNDED   TOTAL   FLUCTUATION. 
245.    If,  in  the  net  (#„,  xj,  (x1  ,x2)...  (xn^  ,  xn},  where  x0  =  a,  xn  =  &,  we  take 

r=n 

the  sum  of  the  fluctuations  in  the  closed  meshes  of  the  net,  viz.   2   (Ur  —  Lr), 

r=l 

where  Ur,  Lr  are  the  upper  and  lower  boundaries  of  /(«)  in  the  closed  mesh 
(#r-  u  ^/)>  then  if  this  sum  does  not  exceed  some  fixed  positive  number,  inde- 
pendent of  the  particular  net,  the  function  f(x)  is  said  to  have  bounded  total 
fluctuation  in  the  interval  (a,  b}.  If  Fabf(x}  be  the  number  such  that,  for 
every  net, 

T(Z7;-  A)  *>.»/(«), 

r=\ 

and  such  that  a  net  can  be  fitted  on  to  (a,  b)  for  which 

r£(Ur-Lr)>F*f(x)-e, 

r=l 

where  e  is  an  arbitrarily  chosen  positive  number,  then  Fabf(x)  is  termed* 
the  total  fluctuation.  of/(#)  in  (a,  b)  ;  and  it  is  finite  for  a  function  of  bounded 
total  fluctuation. 

It  will  appear  that  the  class  of  functions  of  bounded  total  fluctuation  is 
identical  with  that  of  functions  of  bounded  variation. 

In  order  that  a  function  may  have  bounded  total  fluctuation  it  is  sufficient 
that  2  (  Uf  —  Lr),  when  taken  over  the  successive  nets  of  a  given  system  of  nets, 
should  have  a  definite  limit.  Moreover  the  limit  for  the  system  of  nets  is 

£F*f(x\  and  MW04 

This  limit  may  be  called  the  total  fluctuation  of  f(x)  for  the  particular 
system  of  nets. 

*  See  Study,  Math.  Annalen,  vol.  XLVII,  p.  55.    It  was  shewn  by  Study  that  the  definitions  of 
functions  of  bounded  total  fluctuation,  and  of  bounded  variation,  are  equivalent  to  one  another. 
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It  is  clear  that,  if  an  interval  (a,  /3)  be  divided  into  two  parts  (a,  7)  (7,  ft}, 
the  sum  of  the  fluctuations  in  (a,  7),  (7,  /3)  cannot  be  less  than  the  fluctuation 
in  (a,  /3).  Hence,  if  we  consider  the  successive  nets  D1}  D2,  ...,  of  a  system  of 
nets,  the  sum  of  the  fluctuations  in  the  meshes  of  Dm  is  ^  the  sum  in  the 
meshes  of  Dm+1.  Thus,  as  we  proceed  through  the  successive  nets  of  the 
system,  the  sums  of  the  fluctuations  in  the  meshes  of  a  net  form  a  sequence 
of  non-diminishing  numbers  which  have  a  finite  upper  limit,  unless  they 
increase  indefinitely.  Assume  the  former  to  be  the  case.  Let  us  now  consider  a 
second  system  of  nets  Z)/,  D2',  . . .  Dm',  —  Suppose  77  to  be  a  number  less  than 
the  breadth  of  all  the  meshes  of  Dm' ;  and  let  the  two  nets  Dr,  Dm'  be  super- 
imposed to  form  a  net  (Dr,  Dm').  We  may  suppose  r  so  large  that  Dr  contains 
no  mesh  of  breadth  >?;.  A  mesh  dr,  of  Dr,  with  fluctuation  Fr,  will  contain 
not  more  than  two  meshes  of  the  net  (Dr,  Dm'),  with  fluctuations  Ffl,  Frz\ 
we  have  then  Fr  £  Ffi  +  FTz  ^  %Fr.  Now  every  mesh  of  Dm'  is  made  up  of 
meshes  of  Dr  and  of  meshes  of  (Dr,  Dm'}.  Therefore  the  sum  of  the  fluctua- 
tions in  the  meshes  of  Dm'  is  ^  the  sum  of  the  fluctuations  in  the  undivided 
meshes  of  Dr  together  with  the  sum  of  the  fluctuations  such  as  FTl,  Frz.  It 
follows  that  the  sum  of  the  fluctuations  in  the  meshes  of  Dm'  cannot,  exceed 
twice  the  sum  of  the  fluctuations  in  the  meshes  of  Dr.  Since  a  number  r  can 
be  determined  so  as  to  correspond  to  each  number  m,  we  see  that  the  limit  of 
the  sum  of  the  fluctuations  in  Dm',  as  m  ~  oc  ,  cannot  exceed  twice  the  limit  of 
the  sum  of  the  fluctuations  in  Dr,  as  r  ~  oc  .  If  X'  be  the  former  limit,  and  X 
the  latter,  we  have  X'  ^  ZX.  Thus,  if  X  is  finite,  for  any  system  of  nets,  it  is 
finite  for  any  other  system  of  nets. 

Since  a  net  can  be  so  chosen  that  the  sum  of  the  fluctuations  in  its  meshes 
is  less  than  Fab  f(x)  by  less  than  an  arbitrarily  chosen  number,  a  system  of 
nets  can  be  so  chosen  that  the  limit  of  the  sums  for  the  nets  is  Fabf(x).  It 
thus  follows  that,  for  any  other  system  of  nets,  the  limit  of  the  sum  of  the 
fluctuations  is  ^  ^  Fab  /(#),  and  ^  Fttb  f(x). 

246.  Since  the  absolute  value  of  the  difference  of  the  functional  values 
at  the  end-points  of  an  interval  (a,  /3)  cannot  exceed  the  fluctuation  in  the 

r  =  n 

closed  interval,  it  is  clear  that  the  sum  ^  \f(xr}  —  f(xr-\) ''  cannot  exceed  the 

?•=! 

sum  of  the  fluctuations  in  the  intervals.    It  follows  that,  if  a  function  is  of 
bounded  total  fluctuation,  it  is  also  of  bounded  variation. 

If  a  function  have  at  no  point  an  external  saltas,  the  total  fluctuation,  and 
the  total  variation,  have  one  and  the  same  value,  ivhen  estimated  as  the  limiting 
values  over  a  system  of  nets ;  and  that  value  is  independent  of  the  particular 
system  of  nets. 

In  accordance  with  this  theorem,  V^f(x)  and  Fabf(x)  are  equal  to  one 
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another,  and  are  the  limits  of  the  sums  2 1  /(#,)  -/(#r-i)  ,  2  (l/r  -  Lr),  over  a 
net  Dn,  of  a  system  of  nets,  as  n  ~  oo  . 

To  prove  the  theorem,  let  Fw(r),  FJr)  denote  |/(a?r)-/(«r-i)  ,  (Ur-Lr} 
for  a  mesh  dr,  of  a  net  A..  We  have  clearly  Fmw  ^  -Fm(r).  It  will  be  shewn  that 
for  the  net  Dr+g,  where  s  is  sufficiently  large,  2  F(r+8)  ^^F^,  the  summation 
referring  to  all  the  meshes  of  Dr+s  that  are  in  the  mesh  dr,  of  the  net  Dr.  In 
the  case  in  which  Ur,  Lr  are  the  functional  values  at  the  ends  of  dr,  we  have 
Ur  —  Lr  =  f(xr}—f(xr_^)\.  When  this  is  not  the  case,  one  at  least  of  f(xr\ 
/(tf>-i)  is  between  Ur  and  Lr.  If  e  be  an  arbitrarily  chosen  positive  number, 
there  are  two  points  of  dr,  say  £,  f?a,  for  one  of  which  the  functional  value  is 
>Ur  —  e,  arid  for  the  other  <  Lr  +  e.  There  must  exist  a  neighbourhood  of  £ , 
on  one  side  at  least,  at  every  point  of  which  the  functional  value  is  >  Ur  —  2e; 
for  otherwise  the  functional  limit  at  £  would  be  <  Ur  —  e,  on  both  sides,  and 
this  cannot  be  the  case  if  there  is  no  external  saltus  at  ft.  It  follows  that, 
if  s  be  sufficiently  large,  there  must  be  an  end-point  of  a  mesh  of  Dr+s  at 
which  the  functional  value  is  >  Ur  —  2e.  Similarly  we  see  that,  if  s  be  suffi- 
ciently large,  there  is  an  end-point  of  a  mesh  of  Dr+g,  at  which  the  functional 
value  is  <  Lr  +  2e ;  and  one  at  least  of  these  end-points  is  not  an  end-point  of 
the  mesh  dr.  The  number  s  can  accordingly  be  so  chosen  that,  for  all  the 
meshes  of  the  net  Dr+s  which  are  in  the  mesh  dr  of  Dr,  the  sum  of  the  absolute 
values  of  the  differences  of  the  functional  values  is  >  Ur  —  Lr  -  4e.  Thus  the 
number  s  can  be  so  chosen  that  the  sum  of  the  absolute  values  of  the 
differences  of  the  functions  for  those  meshes  of  Dr+s  that  are  in  the  mesh  dr  is 
>  Ur  —  Lr  —  4e  +  W ;  where  W  is  the  absolute  difference  between  the  functional 
value  at  one  of  the  end-points  of  dr  and  the  functional  value  at  one  of  those 
end-points  of  a  mesh  of  Dr+s  at  which  it  is  >  Ur  -  2e  or  <  Lr  +  2e.  If  e  be  so 
chosen  that  W  >  4e,  (which  can  be  done,  because  the  functional  value  at  the 
end-point  of  dr  in  question  is  not  equal  to  Ur  or  Lr,)  we  see  that  the  sum  of 
the  absolute  differences  of  the  functional  values  in  those  meshes  of  Dr+8  that 
are  in  dr  is  >  Ur  —  Lr.  Moreover  s  can  be  so  chosen  that  this  holds  good  for 
each  mesh  dr>  of  the  net  Dr;  and  thus,  for  such  value  of  s,  we  have  V(r+f]  >  F{r]. 
This  inequality  combined  with  the  inequality  VM  ^  F(r}  shews  that 

lim  F"->  =  lim  V(r+s}  ^  lim  F{r^ ;  Km  V^  ^  lim  F(r\ 

r~oo  ?-~oo  r~»  r~oo  r~*> 

and  thus  that  the  two  limits  lim  V(r\  lim  F(r)  have  the  same  value,  finite  or 

r~x  r  ~  oo 

infinite. 

In  order  to  complete  the  proof  of  the  theorem  it  must  be  shewn  that  the 
limit  of  the  sum  of  the  fluctuations  in  the  meshes  of  a  net  belonging  to  a 
system  of  nets  is  the  same  for  all  systems  of  nets ;  there  being  as  before  no 
external  saltus  at  any  point. 

If  possible,  let  the  limit  for  a  particular  system  of  nets  have  a  value  G, 
less  than  Fabf(x).  A.  net  can  be  defined,  for  which  the  sum  of  the  fluctuations 
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in  the  m  meshes  is  >  G',  say  G  +  a.  Let  this  net  D'  be  superimposed  on  the 
net  Dr,  of  the  system  of  nets  ;  we  may  suppose  r  so  great  that  not  more 
than  one  of  the  end-points  of  the  meshes  of  D'  is  in  any  one  of  the  meshes  of 
Dr.  Let  us  suppose  that  one  of  these  end-points  is  x,  and  that  it  falls  in  the 
mesh  8,  of  Dr,  dividing  it  into  two  parts  B'  and  B"  ;  let  F  (B)  be  the  fluctua- 
tion in  8.  Since  there  is  no  external  saltus,  if  B,  and  consequently  B'  and  B", 
are  sufficiently  small,  we  have 


where  6rj  is  less  than  the  arbitrarily  chosen  number  77.    Moreover,  B'  and  B" 
being  sufficiently  small, 

F(B')  =  \  f(x}-f(x  -  0)  i  +  G2,  where  G2  <  r,, 
F  (B")  =  1  f(x}  -  f(x  +  0)  1  +  £3)  where  0,  <  17. 

Now  r  may  be  chosen  so  great,  and  consequently  all  the  meshes  of  Dr  so 
small,  that  these  conditions  are  satisfied  for  all  the  meshes  of  D'.  We  have 
then  F(&)  +  F  (B")  -  F(B)  =  G,  +  Gs  -  Gl}  since  f(x)  is  between  f(x+  0)  and 
f(x-  0).  Hence  the  sum  2  F  (B')  +  2F  (B")  -  2  F  (B)  <  2  (m  -  1)  77,  the  sum- 
mation being  taken  for  all  those  meshes  which  contain  one  of  the  m  —  1 
points  x;  moreover  2  F  (B')  +  2F  (8")  -  2  -F  (8)  ^  0.  Therefore  the  sum  of  the 
fluctuations  in  the  net  obtained  by  superimposing  D'  and  Dr  is  <  G  +  2  (m  —  1)  77. 
But  the  sum  of  these  fluctuations  is  certainly  ^  G  +  a,  and  since  77  is  arbitrarily 
small  and  independent  of  m,  these  two  conditions  are  incompatible  with  one 
another;  thus  a  must  be  zero.  It  follows  that  G  cannot  be  less  then  Fabf(x). 
Thus,  in  every  system  of  nets,  the  limit  of  the  sum  of  the  fluctuations  in  the 
meshes  of  a  net  is  Fabf(x).  It  then  follows,  from  the  first  part  of  the  theorem, 
that  the  limit  of  the  sum  of  the  absolute  differences  of  the  functional  values 
for  the  meshes  of  any  net  is  Vabf(x),  or  Fabf(x). 

In  case  there  be  points  at  which  there  is  an  external  saltus,  the  preceding 
proof  can  still  be  applied  to  shew  that  the  limit  of  the  sums  of  the  fluctuations 
in  two  systems  of  nets  has  the  same  value  for  the  two  systems,  provided  no 
point  at  which  there  is  an  external  saltus  be  an  end-point  of  a  mesh  in  either 
system.  Moreover,  in  case  there  be  an  external  saltus  at  the  point  x,  the  value 
of  F(B)  is  either  \f(x}-f(x  +  0)  i  +  G1  or  \f(x)-f(x-0)  +  Glt  and  we 
see  that  F(B')  +  F(8")  -  F  (B)  =  Ga  +  G3-  G,  +  s(x),  where  s(x)  denotes  the 
external  saltus  at  x,  and  is  equal  to  the  smaller  of  the  two  numbers 

!/(*)-/(*  +  o)|,  |/(*)-/(*-o)|. 

It  is  clear  that,  for  a  function  with  bounded  total  fluctuation,  the  sums  of 
the  saltus  on  the  right,  and  on  the  left,  2i/(#+0)—  f(x}\,  ^\f(x—0)—f(x)\, 
for  all  the  points  of  discontinuity,  are  finite,  and  there  can  therefore  be  only 
a  finite  number  of  points  at  which  the  external  saltus  exceeds  an  arbitrarily 
chosen  number  ft.  If  we  take  these  points,  say  n  —  I  in  number,  to  be  the 
points  x  above,  of  D',  we  see  that  the  sum  of  the  fluctuations  in  the  meshes  of 
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the  net  obtained  by  superimposing  D'  and  Dr  is  2  F(&)  +  Sp  +  7,  where  Sp  is 
the  sum  of  those  external  saltuses,  all  of  which  are  greater  than  @ ;  and  7  is 
the  sum  of  the  n-  1  numbers  (r2+  Cr3-  (ra,  and  is  therefore  arbitrarily  small. 
It  thus  appears  that  the  total  fluctuation  for  a  system  of  nets  such  that  no 
end-point  of  any  of  the  nets  is  a  point  at  which  there  is  an  external  saltus  is 
the  number  //,  (^  £  Fabf(x)),  the  lower  boundary  of  the  limiting  sum  for  all 
systems  of  nets.  If  those  points  at  which  the  external  saltus  is  >  0  be  end- 
points  of  meshes  of  nets  of  the  system,  the  limit  of  the  sum  of  the  fluctuations 
in  a  net  of  the  system  is  p  +  Sp .  If  a  sequence  of  descending  values  be  given 
to  ft,  the  sum  Sp  converges  to  a  fixed  number,  which  is  the  sum,  or  the 
limiting  sum,  of  all  the  external  saltuses.  Thus  we  have  the  following  theorem: 

If  a  function  with  bounded  total  fluctuation  be  such  that  there  are  points  at 
which  the  function  has  an  external  saltus,  then  the  difference  Fabf(x)  —  /j,  between 
the  upper  and  lower  boundaries  of  the  limiting  sum  of  the  fluctuations  in  the 
nets  of  a  system  is  equal  to  the  sum  of  the  external  saltuses.  If  a  system  of 
nets  be  such  that  no  point  at  which  there  is  an  external  saltus  is  an  end-point 
of  any  mesh  of  any  of  the  nets,  for  such  a  system  the  limiting  sum  of  the 
fluctuations  in  the  meshes  of  a  net  is  p.  If  however  every  point  at  which  there 
is  an  external  saltus  is  an  end-point  of  a  mesh  of  nets  of  the  system,  the  limiting 
sum  is  then  Fcbf(x). 

247.  It  has  now  been  shewn  that,  for  a  function  without  points  at 
which  there  is  an  external  saltus,  the  total  variation  and  the  total  fluctuation 
over  any  system  of  nets  are  identical,  being  both  finite,  or  both  infinite,  and 
that  they  are  independent  of  the  particular  system  of  nets  employed. 

If  f(x)  have  an  external  saltus  at  each  point  of  some  set  S,  the  total 
variation  of  f(x],  when  extended  over  a  system  of  nets  such  that  no  point  of 
S  is  an  end-point  of  any  mesh,  is  identical  with  the  total  fluctuation  of  that 
function  0  (x)  which  differs  fromf(x)  only  in  having  the  functional  values  at 
the  points  of  S  so  altered  that  the  external  saltus  is  at  every  point  of  8 
removed.  The  total  fluctuation  of  </>  (x)  is  obtained  by  removing  from  yu,  the  total 
fluctuation  of  /(#)  over  such  a  system  of  nets,  that  is  the  number  Fabf(x)  —  /n 
which  is  the  sum  of  the  external  saltuses.  Thus  the  total  fluctuation  of  (f>  (x) 
is  2/A  —  Fab  /(#•).  If,  on  the  other  hand,  we  employ  a  system  of  nets  such  that 
every  point  of  S  is  an  end-point  of  meshes  of  the  nets,  the  total  variation  and 
the  total  fluctuation  off(x)  have  the 'common  value  Fabf(x). 

It  thus  appears  that,ybr  a  function  of  bounded  total  fluctuation,  which  has 
points  with  an  external  saltus,  the  total  variation  over  a  system  of  nets  is 
Fabf(x),  or  2/z  —  Fabf(x),  or  has  some  value  between  these  two  numbers, 
according  to  the  particular  system  of  nets  employed. 

The  necessary  and  sufficient  conditions  that  a  function  f(x]  may  have 
bounded  variation  and  bounded  fluctuation  in  (a,  6)  are  that 
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(1)  The  points  of  discontinuity  must  all  be  of  the  first  species,  i.e. 
f(x  +  0),f(x  —  0)  must  everywhere  exist. 

(2)  The  sum  of  the  absolute  values  of  the  external  saltuses  must  be 
finite. 

(3)  A  system  of  nets  must  exist  for  which  the  sum  of  the  absolute 
values  of  the  differences  of  the  functional  values  at  the  end-points  of  the 
meshes  of  a  net  must  be  less  than  some  fixed  number,  for  all  the  nets  of  the 
system. 

248.    The  following  theorem  will  now  be  established: 

The  necessary  and  sufficient  condition  that  a  function  f(x)  is  of  bounded 
total  fluctuation  is  that  it  can  be  expressed  as  the  difference  of  two  bounded 
monotone  functions,  either  of  which  can  be  non-increasing  or  non-diminishing. 

A  comparison  of  this  theorem  with  that  in  §  244,  for  functions  of  bounded 
variation,  gives  a  second  proof  that  the  two  classes  of  functions  are  identical. 

To  prove  the  theorem  ;  let  Fax  be  the  upper  boundary  of  the  total  fluctua- 
tion in  the  interval  (a,  x).  We  then  see  that/(a?+/t)  -f(x)  ^  Fxx+h  ^  Fax+h-Fax; 
it  follows  that  Fax  —f(x)  is  a  monotone  non-diminishing  function.  The  function 
Fa*  +/0)  has  the  same  property,  since  f(x  +  h)  -f(x)  ^  -  Fx*+h  ^  Fax  -  Fax+h. 
Therefore,  if  fa  (x}  =  |  [Fa*  +/(*)},  fa(x)  =  i  {Fa*  -/(*)},  the  function  /(a?) 
can  be  expressed  as  the  difference  of  the  two  non-diminishing  monotone 
functions  fa  (x),  fa  (x). 

Conversely,  let  f(x)  =  fa  (x)  -  fa  (x);  where  fa  (x),  fa2  (x)  are  bounded  non- 
diminishing  monotone  functions.  In  any  interval,  the  fluctuation  of  f(x) 
cannot  exceed  the  sum  of  the  fluctuations  of  fa  (x)  and  fa  (x).  It  then  follows 
that  the  total  fluctuation  of  f(x}  in  (a,  6)  cannot  exceed 

[fa(b)-fa(a)]  +  [fa(b)-fa(a)]. 

EXAMPLES. 

1.*    The  function  denned  by  /(#)=#  sin  -,/(0)=0,  is  not  of  bounded  total  fluctuation 

OG 

in  the  interval  (0,  I/TT),  although  it  is  continuous  in  the  interval.     For,  in  the  interval 

[=  —  ),  sin  -  attains  the  value  (-1)1',  and  thus  the  fluctuation  in  this  interval 

Win-'    fir/' 

is  at  least  equal  to  !/(?•  +  |)TT.    The  total  fluctuation  in  the  interval  f  —  ,  —  j  is  at  least 


increases  without  limit  when  s  is  indefinitely  increased  ;  therefore  the  total  fluctuation  in 
(0,  I/TT)  is  not  finite. 

*  Lebesgue,  Lemons  sur  V  integration,  p.  -56. 
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2.*   The  function  defined  by/(#)  =  #2sin  -g,/(0)  =  0,  is  continuous  in  any  interval  con- 
taining .r=0,  and  is  everywhere  differentiable,  but  is  not  of  bounded  total  fluctuation. 
3.t    The  function  denned  by  /(#)=#2sin  (#-$),/(0)=0,  is  of  bounded  total  fluctuation 

in  the  interval  (0,  1/rr*).    In  the  interval  (  -         —  ,    -  — =)  ,  the  function  has  a  single 

\(r+l»)t       (rir}*/ 

maximum,  or  else  a  single  minimum,  and  the  absolute  value  of  the  function  at  this  point 

3  3  °°  1 

is  at  most  l/(r7r)7.     The  total  fluctuation  in  (0,  1/n-4")  cannot  exceed  22 -,  which  is 

finite. 

RESOLUTION  OF  A  FUNCTION  OF  BOUNDED  VARIATION. 

249.  If  f(x)  have  bounded  variation  in  the  interval  (a,  6),  it  can  be  ex- 
pressed by  P(x)  —N(x\  where  P(x),  —N(x)  are  the  positive,  and  the  negative, 
variations  respectively  of  f(x)  in  the  interval  (a,  x).  Let  &,&,...  j?», ...  be  the 
points  of  discontinuity  of  P  (*•),  and  let  s  (x)  denote  the  sum 


where  the  summation  is  taken  for  those  points  £  that  are  in  the  closed  interval 
(a,  x).  Thus  s  (x}  is  a  bounded  non-diminishing  monotone  function.  Since  the 
sum  2  (P  (£  +  0)  —  P  (£—0)}  is  convergent,  the  sum  can  be  divided  into  two 

b 

parts,  the  one  a  finite  sum 


2  {P(£  +  0)-P(£  -<))},  and  the  other    2    {P(£+  0)-  P(£-  0)}, 

b,m  b,m+l 

where  m  is  chosen  so  great  as  to  include  the  indices  of  all  those  points  of  the 
finite  set  at  which  P  (£  -I-  0)  —  P  (£  —  0)  =  e  and  is  also  so  great  that  the  second 
sum  is  <  e;  where  e  is  an  arbitrarily  chosen  positive  number. 

We  also  have  s(#)  =  2  {P(£  +  0)-P(f-0)}  +rj(x);  where,  in  the  first 


x,m 


sum,  only  those  of  the  set  £,  %2,  ...  £m  occur  which  are  in  the  closed  interval 
(a,  x},  and  where  77  (a?)  ^  77  (b)  <  e.  Let  us  consider  the  function  P  (x)  -  s  (as),  or 
P  (ar)  -  2  {P  (£  +  0)  -  P  (f  -  0)}  -  77  (#).  At  a  point  x,  at  which  P  (a)  has  a 

x,  m 

saltus  <  e,  the  saltus  of  P  (x}  —  s  (x}  is  <  2e  ;  for  the  saltus  of  77  (x)  cannot 
exceed  e,  and  the  function  2  {P  (£  +  0)  -  P  (£  -  0)}  is  continuous.  At  a  point 

x,m 

of  the  finite  set  at  which  P  (x}  has  a  saltus  ^  e,  P  (#)  -  2  [P  ((?  +  0)  -  P  (£  -  0)] 

r,  a; 

is  continuous.  Therefore  the  saltus  of  P(x}—  s  (x)  is  everywhere  <  2e.  Since 
e  is  arbitrary,  it  follows  that  P  (x)  —  s  (x)  is  continuous  in  the  closed  interval 
(a,  6).  A  similar  result  holds  for  the  function  N  (x).  Therefore  f(x)  is  the  sum 
of  a  continuous  function  <f>  (x),  of  bounded  variation  in  (a,  b),  and  of  a  function 

*  Lebesgue,  Annali  di  Mat.  (3  A),  vol.  vn,  p.  270. 
t  Lebesgue,  Lemons  sur  I'integration,  p.  56. 
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s  (x)  —  s  (x),  which  may  be  denoted  by  ^  (x).  The  function  %  (x)  is  of  bounded 
variation,  and  has  the  same  points  of  discontinuity  as  f(x).  We  have 
f  (x)  =  (f>  (x)  +  %  (x},  where  ^  (x)  denotes 


The  result  obtained  may  be  stated  as  follows: 

A  function  of  bounded  variation  in  the  interval  (a,  b)  is  the  sum  of  a  con- 
tinuous function  of  bounded  variation  and  of  a  function  of  bounded  variation 
with  the  same  discontinuities  as  the  given  function,  of  which  the  total 
variation  is  the  sum  of  the  saltuses  of  the  given  function  at  its  points  of  dis- 
continuity. 

KECTIFIABLE  CURVES. 

250.  Let  t  be  a  variable  denned  for  all  values  in  a  continuous  interval 
(t0,  ti),  and  let  fi(t),  f2(t)  be  two  single-valued  and  bounded  functions  of  t, 
denned  in  the  interval  (tQ,  ^);  the  equations  #=/i  (t),  y=fa(t)  may  be  said  to 
define  an  arc  of  a  plane  curve,  in  a  generalized  sense  of  the  term.  Let  a  net 
TO,  TI}  T2,  ...  Tn,  where  TO  =  t0,  rn  =  t1}  be  fitted  on  to  the  interval  (t0,  tj),  then 
the  points  on  the  curve  corresponding  to  the  end-points  of  the  meshes  of  the 
net  may  be  denoted  by  P0,  PI,  •••  Pwi  and  we  consider  the  unclosed  polygon 
of  which  the  sides  are  P0P1,P1P2,  .  .  .  PH_jPn,  inscribed  in  the  arc  of  the  curve. 
The  length  of  the  polygonal  line  P0P1P2  ...  Pn,  measured  by 

}•=-»  i 

S  {(xr  -  x^)*  +  (yr  -  yr-i)°}~, 

r  =  l 

depends  upon  the  particular  net  that  has  been  fitted  on  to  (t0,  ^),  and  may  be 
described  as  the  length  of  a  polygonal  arc  inscribed  in  the  arc  of  curve. 

If  the  arc  be  such  that  the  lengths  of  all  the  inscribed  polygonal  arcs  have  a 
finite  upper  boundary,  the  curve  is  said  to  be  rectifiable,  and  the  length  of  the  arc 
is  defined  to  be  the  value  of  that  upper  boundary.  Otherwise  the  length  of  the  arc 
is  regarded  as  infinite. 

This  general  definition  is  due  to  Peano  *.  An  earlier  definition  given  by 
Jordan^,  and  in  the  case  of  an  arc  defined  by  an  equation  y=f(x),  by  Scheeffer  J, 
is  of  a  less  general  character,  as  it  requires  that  the  lengths  of  any  sequence  of 
polygonal  arcs  should  converge  to  a  fixed  limit,  independent  of  the  particular 
sequence,  provided  the  polygonal  arcs  correspond  to  a  system  of  nets  fitted  on 
to  (t0,  £j).  It  will  be  shewn  that  when  the  arc  is  continuous,  the  definition  or 
Peano  is  equivalent  to  that  of  Jordan  and  Scheeffer. 

*  Rend.  Lincei  (4),  vol.  vi,  1,  p.  54.     See  also  Lebesgue,  Ann.  di  Mat.  (3  A),  vol.  vn,  p.  282, 
where  a  more  general  definition  is  introduced. 

t  Complex  Rendus,  vol.  xcn  (1881),  p.  228;  and  Cours  d'  Analyse,  vol.  in,  p.  594. 
J  Acta  Math.,  vol.  v,  p.  49.    See  also  Study,  Math.  Annalen,  vol.  XLVII,  p.  298. 
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The  following  theorem  will  be  established: 

The  necessary  and  sufficient  condition  that  the  arc  defined  by  x  =/,  (t), 
y  =/2  (t),  for  tQ  S.  t  £  ti ,  may  be  rectifiable  is  that  the  functions  f^  (t),  f2  (t)  should 
be  of  bounded  variation  in  the  interval  (t0,  ^). 

We  see  that 

•j=  [\xr  -  ov-i  |  + 1  yr  -  yr-i  1}  ^  [Or  -  av-i)8  +  (yr  -  yr-iY$ 

£  [\Xr-Xr_l\  +    yr-yr_1\}- 

by  taking  the  sums  of  all  the  expressions  for  r  =  1,  2,  3,  . . .  n,  it  is  clear  that  the 
necessary  and  sufficient  condition  that  the  perimeter  of  the  unclosed  polygon 
should  be  less  than  some  fixed  positive  number  is  that  this  should  also  hold  as 

r—n  r-n 

regards  S  |  xr  —  xr^  ,  and  2  |  yr  —  yr_a  ;  and  this  is  equivalent  to  the  condition 

r-l  r-l 

stated  in  the  theorem. 

From  the  known  character  of  functions  of  bounded  variation,  it  is  seen  that 
a  curve  that  is  rectifiable  in  the  interval  (t0,  t^  is  also  rectifiable  in  any  interval 
contained  in  (t0,  ^). 

251.  In  case  the  functions  /i  (t),  /2  (t)  are  both  continuous  in  the  interval 
(t0,  <i),  it  will  be  shewn  that  the  lengths  of  the  polygonal  arcs  of  any  sequence, 
such  that  the  greatest  side  of  a  polygon  converges  to  zero,  converge  to  the 
length  I  of  the  curve,  as  above  defined,  whenever  the  curve  is  rectifiable.  To 
establish  this  result  the  following  theorem  will  be  proved: 

If  the  continuous  arc  defined  by  x  =  /i  (t),  y  =  /2  (t),  t0£t  £tly  be  rectifiable, 
and  of  length  I,  then,  corresponding  to  an  arbitrarily  chosen  positive  number  e, 
a  positive  number  d  can  be  determined,  so  that  the  perimeter  of  any  unclosed 
polygon  inscribed  in  the  arc  is  >l  —  e,  provided  all  the  sides  of  the  polygon  are  <  d. 

The  assumption  that  the  arc  is  rectifiable,  and  of  length  I,  implies  that  a 
net  D0  can  be  fitted  on  to  the  interval  (t0>  ^),  such  that  the  length  of  the  corre- 
sponding polygonal  arc  inscribed  in  the  curve  is  >  I  —  ^e.  Let  D  be  any  other 
net  fitted  on  to  the  interval  (£0,  ^).  The  net  (D0,  D)  obtained  by  superimposing 
the  two  nets  D0,D,  is  such  that  the  corresponding  polygonal  arc  has  a  length 
>  I  —  |e.  Since  the  functions  /i  (t),  f2  (t)  are  continuous,  if  T?  be  a  prescribed 
positive  number,  a  number  8  can  be  determined  so  that,  in  any  interval  of  t  less 
than  8,  the  fluctuations  of  /3  (t),  f2  (t)  are  both  less  than  77.  We  shall  suppose 
D  so  chosen  that  all  its  meshes  are  of  breadth  less  than  8.  Let  s  be  the  number 
of  meshes  in  D0;  we  consider  the  excess  of  the  length  of  the  polygonal  arc 
corresponding  to  (D0,  D)  over  that  which  corresponds  to  D.  Consider  an  end- 
point  tu  of  the  meshes  of  D0,  that  is  contained  in  the  interior  of  a  mesh  of  D; 
we  can  suppose  the  meshes  of  D  so  small  that  only  one  such  point  can  be  con- 
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tained  in  a  mesh  of  D,  and  we  also  suppose  all  the  meshes  of  D  to  be  less 
than  S.  If  tr,  tr+l  be  that  mesh  of  D,  we  see  that  the  excess  of  the  sum  of  the 
chords  joining  tr,  tu  and  (tu,  tr+i)  over  that  joining  tr,  tr+1  is 

i-  -    'l      I  '        ---= 

and  this  is  less  than  477. 

We  now  see  that  the  perimeter  of  the  polygonal  line  which  corresponds 
to  D  is  >  I  —  Je  —  4^77. 

If  77  is  so  chosen  that  4^77  <  £e,  the  length  of  the  polygonal  arc  that  corre- 
sponds to  D  is  >  I  —  e.  In  order  that  this  may  be  the  case,  D  has  only  to  satisfy 
the  condition  that  the  maximum  breadth  of  its  meshes  is  less  than  some  fixed 
number  8,  and  this  condition  will  be  satisfied  if  the  greatest  side  of  the  corre- 
sponding polygonal  line  is  less  than  some  fixed  number  d. 

If  s  denote  the  length  of  the  arc  corresponding  to  (t0,  t),  where  t0<t  £tlt 
s  is  a  continuous  monotone  function  of  t',  and  it  is  clear  that  x  and  y  may  be 
regarded  as  functions  of  s,  say  #  =  </>  (s),  y  =  ^r  (s).  Since  A#/As,  A?// As  cannot 
be  numerically  greater  than  1,  all  the  derivatives  of  <j>  (s),  i/r  (s)  are  bounded,  and 
in  the  interval  (—1, 1).  It  will  be  shewn  in  §  298  that,  for  all  values  of  s  in  the 
linear  interval  (0, 1),  with  the  possible  exception  of  those  belonging  to  a  set  of 

(l  IT    (I  ?/ 

measure  zero,  the  differential  coefficients  -?->  7   exist,  and  are  finite.   The  above 

ds  as 

theory  is  applicable,  without  essential  change,  to  the  case  of  a  "curve"  in  three- 
dimensional  space,  defined  by  x  =/a  (t),  y  =/2  (t),  z  =/3  (t). 

The  case  of  an  arc  of  a  curve  y  =/(&•),  defined  for  the  interval  (a,  6),  of  x, 
may  be  considered  as  the  particular  case  of  the  above  which  arises  when  t  is 
identical  with  x.  We  see  then  that,  the  necessary  and  sufficient  condition  that 
the  arc  defined  by  y  =  f(x],  a^x^b,  may  be  rectifiable  is  that  the  function 
f(x)  is  of  bounded  variation  in  the  interval  (a,  6). 

EXAMPLES. 

1  2r  +  l 

1.  In*  the  interval  (0,  1),  let  f(%}  =  wni  when  x  is  any  of  the  points  -    — ,  and  let 

f(x}=0,  at  all  other  points.    In  accordance  with  Peano's  definition,  the  length  of  the  arc  is 

1  +  22    -^f-  =  2.    According  to  the  definition  of  Jordan  and  Scheeffer,  the  arc  is  not  rectifi' 
M=i    2-n 

able;  for  a  net  can  be  fitted  on  to  the  interval  (0,  1),  such  that  all  its  end-points  are 
represented  by  irrational  values  of  x,  except  the  points  0,  1 ;  the  length  of  the  correspond- 
ing polygonal  arc  is  1 . 

2.  An  arc  of  the  curve  y  =  x  sin  -  which  contains  the  origin  is  not  rectifiable ;  but  a  similar 
arc  of  the  curve  y  =  x~  sin  (x~%]  is  rectifiable. 

*  See  Schoenflies,  Bericht,  vol.  n,  p.  243. 
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THE  VARIATION  OF  A  FUNCTION  OF  BOUNDED  VARIATION 
OVER  A  LINEAR  SET  OF  POINTS. 

252.  Let  (f>  (x)  be  a  bounded  monotone  non-diminishing  function,  defined 
for  the  interval  (a,  b),  of  a?.  If  £  =  <£  (x),  the  segment  (a,  b)  has  a  functional  image 
on  a  segment  of  which  the  end-points  are  $  (a),  </>  (6)  respectively,  on  which  the 
points  £  are  represented.  If  <f>  (x)  be  continuous,  the  functional  image  consists 
of  all  the  points  of  the  interval  (</>  (a),  <£  (6)),  of  £;  but  if  (f>  (x)  be  discontinuous 
at  a  point  x,  there  are  no  points  of  the  functional  image  in  the  closed  interval 
(<£  (x  —  0),  <f>  (x'+  0))  except  the  single  point  <f>  (x').  We  may  now  suppose  that, 
to  each  point  x,  at  which  <£  (x}  is  continuous,  there  corresponds  the  point  %=<$>(x) 
on  the  ^-segment,  but  that,  to  each  point  x  ',  at  which  </>  (x}  is  discontinuous, 
there  corresponds  the  closed  interval  (<£  (x  —  0),  $  (x  -f  0))  on  the  ^-segment. 
If  0(x)  denote  any  set  of  points  in  the  ^-segment  (a,  b),  there  is  a  corresponding 
set  G^  in  the  ^-segment;  to  any  point  of  G(a;),  at  which  <£  (x)  is  discontinuous, 
there  corresponds  a  component  of  G^\  consisting  of  all  the  points  of  a  closed 
interval.  If  G(x]  consists  of  all  the  points  of  a  closed  interval,  6r(f)  consists  of  all 
the  points  of  a  closed  interval.  Similarly,  if  G(x]  consists  of  the  points  of  an  open 
interval,  G&  consists  also  of  the  points  of  an  open  interval.  It  now  follows  that, 
if  G(x}  be  measurable  (B),  on  the  ^-segment,  G^  is  measurable  (B),  on  the 
^-segment.  If  G(x)  be  any  measurable  set,  it  is  contained  in  a  set  GJX\  and  it 
contains  a  set  G2(x},  such  that  G^  and  G£x]  are  measurable  (B),  and  that  the 
set  GJX)  —  G2(x)  has  the  measure  zero.  The  two  sets  Gfi\  G^  are  both  measur- 
able (B),  but  it  is  not  necessarily  the  case  that  G^  —  G^  has  measure  zero. 
Unless  G-i(x)  and  G2{x>  can  be  so  chosen  that  the  measure  of  G^  —  G^  is  zero,  the 
set  G&  is  not  necessarily  measurable.  Even  if  the  function  <£  (x)  be  continuous 
in  (a,  b),  it  is  not  necessarily  the  case  that  there  corresponds  a  measurable 
set  G^  to  any  measurable  set  G{x}.  Only  when  </>  (x)  is  such  that,  to  every  #-set 
of  measure  zero,  there  corresponds  a  £-set  of  measure  zero,  is  G®  necessarily 
measurable  when  Gw  is  measurable. 

In  general,  there  corresponds  to  a  measurable  set  G(x]  a  set  G^\  with  exterior 
and  interior  measures  me(G^), 


The  exterior  and  the  interior  measures  of  the  set  G®,  on  the  ^-segment,  which 
corresponds  to  the  set  G,  on  the  x-segment,  are  defined  to  be  the  upper  and  lower 
variations  respectively,  of  the  function  <j>  (x)  over  the  set  G  ;  these  may  be  denoted 
by  V(G)(f>(x),  V(0)(j)(x)  respectively.  In  case  V(G]  $  (x}  =  V  (G}  <j>  (x),  their 
common  value  V(G]  fy  (x)  is  said  to  be  the  variation  of  the  monotone  function  <f>  (x) 
over  the  set  of  points  G. 

It  is  clear  from  the  definition  that 

V  (fl>  <f>(x)+V  (f<«)  <£  (x)  =  <j)  (b)  -  <f>  (a). 


H. 
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If  G(x}  be  measurable  (B),  since  G®  is  also  measurable  (B),  it  follows  that 
(f>(x)  has  a  definite  variation  over  the  set  G(x).  The  variation  of  </>  (x)  over  a 
closed  interval  (a,  /3),  contained  in  (a,  b),  is  </>  (/3  +  0)  —  <£  (a  —  0);  and  the  varia- 
tion over  the  open  interval  (or,  /3)  is  </>  (/3  —  0)  —  </>  (a  +  0).  In  case  a  =  a,  we 
take  </>  (a  —  0)  =  </>  (a);  and  in  case  /3  =•  6,  we  take  <£  (6  +  0)  =  <£  (6). 

If  /(#)  be  a  function  of  bounded  variation,  defined  for  the  interval  (a,  b), 
f(x)  —f(a)  is  expressible  as  the  difference  of  two  bounded  monotone  non- 
diminishing  functions  fa  (x)  and  fa  (x},  where  fa  (x)  and  —  fa  (x)  are  the  positive 
and  the  negative  variations  of  f(x)  in  (a,  x},  (see  §  244). 

The  sum  of  the  variations  of  fa  (x\  fa  (x)  over  a  set  G  x),  for  which  these 
variations  exist,  may  be  defined  to  be  the  total  variation  of  the  function  f  (x),  of 
bounded  variation,  over  the  set  G(x\ 

FUNCTIONS    OF   TWO   VARIABLES   THAT   ARE   OF   BOUNDED    VARIATION. 

253.  The  definition  of  a  function  of  bounded  variation,  in  an  interval, 
can  be  extended  to  the  case  of  a  function  of  two  or  more  variables,  defined  in 
a  given  cell.  It  will  be  sufficient  to  consider  the  case  of  a  function  f(x(l),  #(2)), 
defined  in  a  closed  cell  (a(1),  a(2);  6(1),  6(2)).  Let  a  set  of  n+  1  points,  in  the 
cell,  two  of  which  are  the  points  (a(1),  a(2)),  (6(1),  6(2)),  be  considered;  they  may 
be  denoted  by  (>0(1>,  #0<2>),  (a?/1),  a?^)  ...  (xr(l\  x™)  ...  (xn(l\  xn^);  where 

x0{l)  =  a*1',  x™  =  tt(2),  xnw  =  6(1),  xn(»  =  b(2). 
Further,  let  the  points  be  such  that,  for  each  value  of  r(=  1,  2,  3,  ...  n), 

T   (1)    >   T         (I)     T  (2)   >    T         (2)  • 
t^/r        fs  U/r_j       ,  Jjr        t=  -LI—  I       > 

and  consider  the  sum 


r=l 

If  this  sum  does  not  exceed  some  fixed  positive  number,  whatever  value 
n  may  have,  and  however  the  points  are  chosen,  subject  to  the  conditions 
above  stated,  the  function  f(x(l\  x('2))  is  said*  to  have  bounded  variation  in 
the  cell  (a(1),  a(2);  b(l\  6(2>).  The  upper  boundary  of  the  above  sum  may  then 

be  denoted  by  Vffi^fW",  ^ 

A  function  $  (x(l\  x(2))  is  said  to  be  a  monotone  non-diminishing  function 
in  the  cell,  if  </>  (acw,  x(2))  ^(f>(x(l)',  x(2}')  for  every  pair  of  points  (x(l\  x{2)),  (xm>,  x{2)/) 
such  thata;'1'  =  xw',  x(2)  ~  x(2)l.  If  the  inequalities  be  all  reversed,  we  have  the 
definition  of  a  non-increasing  monotone  function. 

If  we  consider  separately  those  terms  of  the  sum 


See  Arzela,  Bologna  Rend.  vol.  ix  (1904-5). 
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which  are  positive  and  those  which  are  negative,  and  denote  the  two  parts 
respectively  by  Sj,  22,  we  have 


further 

(1>,  av«)-(*M»>,  av-iw)},  -2, 


are  such  that  their  sum  cannot  exceed  Fl-i/    w?/(a?( 
It  follows  that  Sj  and  —  22  have  upper  boundaries 


and 

which  may  be  denoted  respectively  by 

P(ft(1''  b('^  f(x"  x^}  and  AT(6<1)'  b™]  fix™  x^ 
r  (aW,  a  (2>)  /  v*    '  *   ;,  ai  a  iv  ^ofl)>  a(2)j  /  \x   ,  x   ). 

These  numbers  may  be  called  the  total  positive  variation  and  the  total  nega- 
tive variation  off(x^,  a;(2))  in  the  cell  («»',  a<2>;  6(1),  6(2>). 

If  we  take  any  point  (x(l\  x(2})  in  the  given  cell,  we  may  denote  by 

PO(1),  tf<2>),  ^(^(1),  a'21) 

the  positive  and  the  negative  total  variations  of  the  function  in  the  cell 
(a(1\  a(2);  x(l\  x™}.  We  have  then 

v  K),  «<?)  /  0*(1)>  ^<2))  =  p  (^(1)'  *(2))  +  ^  (^(1)'  ^(2))> 

/0(1>,  a;'2>)  -/(a(l),  a(2))  =  P(>(1),  ^(2))  -  ^(«(l),  ^(2)), 

and  therefore,  since  the  two  functions  P  (x(l\  #(2)),  N(x{l),  #(2))  are  clearly  mono- 
tone and  non-diminishing,  we  see  thaty(#(l),  #<2))  can  be  expressed  as  the 
difference  of  two  such  functions,  or  also  as  the  difference  of  two  monotone 
non-increasing  functions. 

It  is  seen  at  once  that  any  function  which  is  the  difference  of  two  bounded 
monotone  functions  is  of  bounded  variation  in  the  cell.  It  has  thus  been 
shewn  that: 

The  necessary  and  sufficient  condition  that  a  function  f(x(l\  #<2))  should 
be  of  bounded  variation  in  the  cell  (a,  b)  is  that  it  should  be  expressible  as  the 
difference  of  two  bounded  monotone  functions. 

If  Ur,  Lr  are  the  upper  and  lower  boundaries  of  /(#(1),  #(2))  in  the  cell 

r=n 

(#V_!(1),  #r-i<2);  ®r(l\  ®r'2})>  we  consider  the  sum  2  (Ur  —  Lr),  where  the  n+I  points 

r=l 

satisfy  the  same  conditions  as  before.  If  this  sum  is  less  than  a  'fixed  positive 
number,  however  such  a  system  of  points  be  chosen,  the  upper  boundary  of 
the  sum  is  called  the  total  fluctuation  of  the  function  in  the  cell  (a(1),  a(2);  6(l),  6<2)), 

and  it  may  be  denoted  by  F,aW'  a(2)\/(#(1V#(a))-  The  function  is  then  said  to  be 
of  bounded  total  fluctuation  in  the  cell. 

21—2 
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We  see  that 


1  >      jr,  (*<«,    *<2>)  _    TT  (J!  W  +  feO',   S^  +  ft®) 

•*'  (a'1',  a<2')  (a»),  a<2>) 

and  that  the  function  F^J^xJ^f  (x(l\  #(2))  is  monotone  and  non-diminishing. 

As  in  §  248,  it  thus  appears  that  the  necessary  and  sufficient  condition  that 
f(x(l\  #(2))  should  be  of  bounded  total  fluctuation  is  that  it  should  be  the  dif- 
ference of  two  bounded  monotone  functions.  Comparing  this  with  the  corre- 
sponding theorem  for  functions  of  bounded  variation,  we  see  that  the  classes  of 
functions  of  bounded  variation,  and  of  bounded  total  fluctuation,  are  identical. 

254.    A  definition  of  functions  of  bounded  variation,  differing  from  that 
given  in  §  253,  has  been  employed  by  Hardy*  and  by  Krausef. 

Let  us  consider  the  number 

A1X  X)  /<*(i)'  *(2))  =  /^(1)>  £(2)>  -./w  a<2)>  -/<«(1)'  £(2)>  +/(«(i)>  a(2))> 

related  to  the  cell  (a(1),  a(2);  /3(I),  /3(2)),  where  /3(1)  ^  a(1),  /3<2)  ^  a(2).    It  is  seen  at 
once  that,  if  a  net  be  fitted  on  to  the  cell  (a(1),  a<2);  6(]),  b('2)), 

A  (b(1).  5B))  /Y^>  ^2>)  -  2  A  (/3<1)'  ^'2))  /WVV 

2  '  ' 


where  the  summation  on  the  right-hand  side  is  taken  for  all  the  meshes 
(a(1),  a(s);  yS(1),  ^(2))  which  make  up  the  cell  (a(l),  a(2);  6(1),  6(2)). 

If  the  sum  £  j  A  ra)'  £;(J/(#(l),  ic<2))  is  less  than  some  fixed  positive  number, 

for  all  possible  nets  that  can  be  fitted  on  to  the  cell  (<x(l),  a(2);  b(1\  b('2)),  and  if 
further  the  function  /(#,  y)  is,  for  each  value  of  x,  a  function  of  bounded 
variation  with  respect  to  y,  and  for  each  value  of  y  a  function  of  bounded 
variation  with  respect  to  x,  then  /  (x,  y)  is  said  to  be  a  function  of  bounded 
variation  in  the  cell. 

It  has  been  pointed  outjbyW.  H.  Young  that  there  is  a  certain  redundancy 
in  the  last  two  conditions.  It  will  in  fact  be  shewn  that  it  is  sufficient  that 
f(x,  y)  should  be  of  bounded  variation  with  respect  to  y,  for  one  fixed  value 
of  x,  and  with  respect  to  x  for  one  fixed  value  of  y;  it  being  assumed  that  the 
first  condition  is  satisfied. 

If  the  first  condition  is  satisfied,  let  V(awa®\  /(^'(1).  *'(2))  be  the  upper 
boundary  of  the  sum  ^  \  A  jr  ft)'  P(J  f(x(l},  x{2>)  \  for  all  nets  that  can  be  fitted  on  to 

the  cell  (a(1),  a(2)  ;  b(l},  6(2)).  If  the  sum  X  be  divided  into  two  parts,  2^  and  22, 
where  2j  denotes  the  sum  of  those  terms  for  which  A  is  positive,  and  S2  denotes 

*  Quarterly  Journal  of  Math.,  vol.  xxxvn,  p.  57  et  seq. 

f  Leipziger  Ber.,  vol.  LV,  pp.  164,  239  ;  see  also  Vergerio,  Gior.  di  Mat.,  1911,  p.  181. 

I  Proc.  Loud.  Math.  Soc.  (2),  vol.  xi,  p.  142. 
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the  sura  of  those  terms  for  which  A  is  negative,  we  see  that  2j  A/(#(1>,  x{'2>)  and 
—  22  A/(aj(1),#(2))  have  finite  upper  boundaries 

P(a<l>,  a(2)  ;  /3(1),  /3<2)),  ^(a'1',  a<2>  ;  £<»,  /3<2'). 

The  two  functions  P  (<&»>,  tf<2>),  #(0(1>,  a?w)i  Defined  as  PK!'][2/(0W,*W), 

JV/^n'^J  f(x(l},  #(2))  respectively,  are  monotone  functions,  in  the  sense  that, 

if  a?<»  5  £<»,  a?w  S*<«,thenP(ar'lll  x®)  £  P  (v("  ,x®),  N  (x 

We  have 


Assuming  that  f(xm,  a(2))  is  a  function  of  xm  ,  of  bounded  variation  in  the 
interval  (a(1),  6(1)),  it  is  equal  to  p(x(^)  —  n(xw),  where  p(#(1)),  rc(#(1))  are 
monotone  functions  of  x(l].  Similarly,  assuming  that/(a(1),  x(z])  is  of  bounded 
variation  in  the  interval  (a(2),  6(2)),  of  #(2),  it  is  representable  as  the  difference 
j9'(a;(2))  -w'(a>(2))  of  two  monotone  functions  of  «(2). 

If  P  (a?ft>,  a'<-2>)  =  P  O(1),  a?(2>)  +p  («'»)  +  /  (a;®), 

2\T(>(1),  a:(2))  =  JV(«(I),  ar(2))  +  n  («(l))  +  n'  (.«(2)), 
we  have 

/(«<»,  «(2))  =  P(^(1),  «(2))-iV:(«(1),  0(a))  -/(«(1),  a(a)), 

where  P,  N  are  monotone  non-diminishing  functions.  It  has  now  been  shewn 
that  a  function  f(oc(l},  #(2))  of  bounded  variation,  in  accordance  with  the  definition 
here  given,  is  expressible  as  the  difference  of  two  bounded  monotone  functions. 

The  definition  here  given  of  a  function  of  bounded  variation  is  less  general 
than  that  of  Arzela,  given  in  §  253.  An  example  has  been  constructed*  by 
Kiistermann,  of  a  function  which  is  of  bounded  variation  in  accordance  with 
Arzela's  definition,  but  not  in  accordance  with  that  here  considered. 

The  variation  of  f(x(l\  k('2))  for  any  fixed  value  &(2  ,  of  y(2\  clearly  does  not 
exceed  the  sum  of  the  variations  of  f(x(l\-  a  2)),  of  P  (x(l\  &(2)),  and  of  N(x{1\  &(2)). 
It  is  therefore  bounded,  on  the  assumption  that  the  variation  of  f(x(l\  a(z})  is 
bounded.  Similarly,  on  the  assumption  that  f(a(l\  x(2  )  is  bounded,  that  of 
f(k(l\  x('i])  is  bounded,  for  any  fixed  value  k(l\  of  x(l]. 

QUASI-MONOTONE    FUNCTIONS. 

255.    A  function  f(x(l\  x('2]}  may  satisfy  the  condition 

A  I*  !".;  *3/^(i)>  *-(2))  =  °.  for  ^(i)  -  ^(i)'  *(2)  =  ^ 

and  yet/(a;(1),  ai(2))  need  not  be  monotone  and  non-diminishing  with  respect  to 
x(*\  for  each  constant  value  of  ,/;(l>  ;  nor  need  it  be  monotone  and  non-diminishing 
with  respect  to  x(1\  for  each  constant  value  of  x(-\  For  the  expression  may  be 
written  in  either  of  the  forms 

{/(X(l\  X®)  ~f(xM,  X(»)}  -  {f(x(l),  *(2))  ~f(x(l\  A'(2))}, 

[f(x(i\  x(»)-f(x(l\  x®)}  -  {f(xM,  x(*)-f(xw,  x(*}\. 
*  Math.  Annalen,  vol.  LXXVII,  p.  474. 


326  Functions  of  a  real  variable  [OH.  v 

The  expression  in  any  one  of  the  four  brackets  may  have  either  sign,  con- 
sistently with  the  condition  stated,  except  that  the  first  bracket  in  either  case 
cannot  be  negative  whilst  the  second  is  positive. 

We  may  consequently  consider  four  types  of  functions  f(x,  y]  which  satisfy 
the  condition  A  gj','  *(*j  f(x(l\  x(2))  ^  0,  for  xw  ^  x(l\  xw  ^  x(*\  viz. : 

(1)  Those  for  which  f(x(l\x(-})  is  monotone  and  non-diminishing  with  respect 
to  x(l\  for  every  constant  value  of  #(2);  and  is  also  monotone  and  non-diminishing 
with  respect  to  #(2),  for  every  constant  value  of  x(l}. 

(2)  Those  for  which  f(x(l\  #(2))  is  monotone  and  non-diminishing  with 
respect  to  #(1),for  every  constant  value  of  #(2),and  is  monotone  and  non-increasing 
with  respect  to  #(2),  for  every  constant  value  of  x(l). 

(3)  Those  for  which  f(x(l),  a;(2))  is  monotone  and  non-increasing  with  respect 
to  x(l\  and  is  monotone  and  non-diminishing  with  respect  to  #(2). 

(4)  Those  for  which  f(x(l\  #(2))  is  monotone  and  non-increasing  with  respect 
to  x(l\  and  is  monotone  and  non- increasing  with  respect  to  x(*\ 

A  function  f{x(l\  x(<i))  which  satisfies  the  condition  A/(#(l),  #(2))  ^  0,  for 
every  pair  of  points,  or  else  the  condition  A/(#(l),  x{2))  £  0,  for  every  pair  of 
points,  and  also  one  of  the  four  conditions  given  above,  everywhere  in  its 
domain,  may  be  said  to  be  a  quasi-monotone  function  of  one  of  the  four  types 
which  correspond  to  the  conditions  (1),  (2),  (3),  (4),  respectively. 

It  is  easy  to  see  that  this  definition  can  be  extended*  to  the  case  of 
functions  of  any  number  of  variables. 

Since/(a?(1),  x(*)  =  A/(^(1>,  x(*}  +/(a;(1  V2))  +/(a(1)>  «(2))  -/(«(l),  «(2))  it  is  seen 
that  every  quasi-monotone  function  is  the  sum  of  a  monotone  function  of  the 
two  variables,  and  of  monotone  functions  of  the  separate  variables,  such  that 
any  one  of  the  three  may  be  either  non-increasing  or  non-diminishing.  The 
converse  does  not  however  hold ;  in  fact  a  function  can  be  monotone  without 
necessarily  being  quasi-monotone. 

THE   MAXIMA,   MINIMA,   AND   LINES   OF   INVARIABILITY   OF   CONTINUOUS 

FUNCTIONS. 

256.  Consider  a  point  x1  within  the  interval  (a,  6),  in  which  a  continuous- 
function  is  defined;  it  may  happen  that  a  neighbourhood  (xl  —  8,  x^  +  8)  of  the 
point  x-i  can  be  found,  by  taking  8  sufficiently  small,  which  is  such  that  /(#)• 
has  the  same  value  at  all  points  in  the  neighbourhood ;  then  the  point  xl  is 
called  a  point  of  linear  invariability  of  the  function.  If  the  same  holds  for  a 
neighbourhood  of  xl  on  the  right  only,  or  on  the  left  only,  then  the  point  x1 
is  called  a  limiting  point  of  linear  invariability, 

*  See  W.  H.  Young,  Proc.  Land.  Math.  Soc.  (2),  vol.  xvi,  p.  277.  The  term  monotone  is  there; 
applied  to  a  function  described  above  as  quasi-monotone. 
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It  can  be  shewn  that,  if  a  point  OPI  of  linear  invariability  exist,  and  the 
function  be  not  constant  in  the  whole  interval  (a,  b),  then  there  exist  two 
limiting  points  of  linear  invariability,  one  of  which,  however,  may  be  at  one 
of  the  ends  of  the  interval  (a,  b).  Suppose  the  function  not  to  be  constant 
throughout  the  interval  (xlf  6) ;  the  points  x  of  this  interval  may  be  divided 
into  two  classes,  in  one  of  which  a  is  such  that,  in  the  interval  (x1}  x),  the 
function  has  the  constant  value  /(#i),  and  in  the  other  class  x  is  such  that 
(#i,  x}  contains  points  at  which  the  function  has  values  differing  from  /(#a) ; 
a  section  is  thus  made  of  the  interval  (xlt  b).  This  section  defines  a  point 
which  is  the  required  limiting  point  of  the  linear  invariability.  If  the  same 
argument  be  applied  to  the  interval  (a,  x^}  we  see  that  there  is  another 
limiting  point  in  this  interval,  unless  the  function  be  throughout  equal  to 

/to- 

In  the  interval  (a,  b)  there  may  be  a  finite  set,  or  an  indefinitely 
great,  but  enumerable,  set  of  lines  of  invariability ;  each  point  within  such  a 
line  is  a  point  of  invariability,  and  the  ends  of  such  lines  are  limiting  points 
of  invariability. 

If  the  point  x^  be  not  a  point  of  invariability,  it  may  happen  that  a  neigh- 
bourhood (#!  —  e,  x-i  +  e')  exists,  such  that,  for  every  point  in  the  interior  of  this 
neighbourhood,  not  identical  with  xl}  the  condition  f(x)  <f(xl)  is  satisfied;  in 
this  case  #j  is  said  to  be  a  point  at  which  the  function  has  a  proper  maximum. 
In  case  the  neighbourhood  be  such  that  at  every  point  x  within  it,  except  at 
xlt  the  condition  f  (x)  >f(x1}  is  satisfied,  the  point  x-±  is  said  to  be  a  point  at 
which  the  function  has  a  proper  minimum. 

It  may  happen  that,  when  x^  is  not  a  proper  maximum,  a  neighbourhood 
(#!  —  e,  x-i  +  e')  exists  which  is  such  that  at  no  point  within  it  the  condition 
f(x)  >/(#i)  is  satisfied,  whilst  not  at  every  point  is  the  condition  f(x)  <f(xl) 
satisfied.  If  this  is  the  case  for  every  neighbourhood  interior  to  (^  —  e,  x1  +  e') 
then  ^  is  said  to  be  a  point  at  which  there  is  an  improper  maximum  of  the 
function.  If  the  condition  f(x)  =/(#0  is  satisfied,  but  the  condition  f(x)  >/(#i) 
is  not  everywhere  satisfied,  then  x-^  is  said  to  be  a  point  at  which  there  is  an 
improper  minimum  of  the  function. 

A  line  of  invariability,  of  which  the  end-points  are  a,  ft,  these  being  both 
interior  to  (a,  b),  is  said  to  be  a  maximum  of  the  function,  if  both  a,  ft  be 
improper  maxima,  and  it  is  said  to  be  a  minimum,  if  both  a,  ft  be  improper 
minima. 

It  is  clear  that,  in  any  arbitrarily  small  neighbourhood  of  an  improper 
maximum  or  minimum,  there  is  an  indefinitely  great  number  of  points  at 
which  the  functional  value  is  equal  to  that  at  the  maximum  or  minimum. 

At  any  maximum  or  minimum  there  is  a  greatest  neighbourhood  (#j  —  B, 
x,  —  B')  at  every  interior  point  of  which  the  condition  f(x) <f(xl),  f(x)  ^f(x^), 
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or  f(x)  >f(xi),  f(x)  ^/(X)  is  satisfied.  At  end-points  of  such  greatest 
neighbourhood,  it  follows  from  the  condition  of  continuity  of  the  function, 
that  the  functional  value  is  equal  tof(xi),  unless  such  end-point  coincides  with 
a  or  with  b. 

It  has  been  shewn  in  §213  that  there  exists  either  one  point,  or  a  set  of 
points,  in  (a,  b),  such  that  the  functional  value  at  this  point,  or  at  all  the  points 
of  the  set,  is  greater  than  at  all  other  points  in  the  interval ;  and  it  is  to  be 
remarked  that  this  set  of  points  may  contain  lines  of  invariability.  Every  such 
point,  unless  it  be  an  end-point,  is  said  to  be  a  point  of  absolute  maximum  of 
the  function  in  the  interval  (a,  b),  and  may  be  either  a  proper  or  an  improper 
maximum.  A  similar  definition  applies  to  an  absolute  minimum. 

In  case  an  extreme  point  of  the  continuous  function  (see  §  209)  be  at  a, 
or  at  b,  such  point  is  spoken  of  as  an  upper  or  lower  extreme,  but  not  always 
as  a  maximum  or  minimum  of  the  function.  If  f(a)  and  f(b)  be  equal,  and 
the  function  be  not  constant  in  (a,  b),  then  there  is  at  least  one  maximum  or 
one  minimum  point,  or  one  line  of  invariability,  in  the  interior  of  (a,  b).  This 
is  also  true  when  f(a}  J=f(b),  unless  the  function  be  monotone. 

257.  If,  within  the  interval  (a,  b),  there  be  two  points,  or  two  lines  of  invari- 
ability, at  which  the  continuous  function  is  a  maximum,  proper  or  improper, 
then  there  is  between  them  at  least  one  point,  or  one  line  of  invariability,  at 
which  the  function  is  a  proper  or  improper  minimum;  thus  maxima  and 
minima  occur  alternately. 

Suppose  that  a,  /3  are  two  points  at  which  the  function  is  a  maximum, 
and  that  (a,  /3)  is  not  entirely  a  line  of  invariability ;  also  that  no  maximum 
occurs  between  a.  and  /3.  We  know  that,  between  a  and  /3,  there  is  a  point,  or 
a  set  of  points,  at  which  the  function  is  less  than  at  all  other  points  in  the 
sub-interval ;  and  since  a  and  ft  cannot  belong  to  such  set,  there  is  therefore 
a  minimum  at  a  point,  or  at  points  on  a  line  of  invariability,  between  a  and 
@,  and  this  minimum  is  less  than  either  of  the  maxima  at  a.  and  ft. 

Between  a  maximum  and  the  next  minimum  of  a  continuous  function  the 
function  is  said  to  make  an  oscillation,  the  amplitude  of  which  is  the  excess  of 
the  maximum  over  the  minimum. 

If  x1  be  a  point  in  (a,  b),  it  may  be  possible  to  choose  e  so  small  that,  within 
the  interval  (&\,  xl  +  e),  no  maxima  or  minima  occur,  so  that  the  function  is 
monotone  in  this  interval.  It  may  however  be  the  case  that,  however  small 
c  is  taken,  there  still  occur  maxima  and  minima  in  (xl}  xl  +  e).  In  this  case 
the  number  of  oscillations  of  the  function  must  be  indefinitely  great,  however 
small  e  may  be  chosen ;  for  if  there  were  a  finite  number  only,  a  number  ej 
could  be  found  such  that  all  the  maxima  and  minima  were  in  the  interval 
(xl  +  €1,  Xi  4-  e),  and  thus  in  (xl}  Xi  +  e^  the  function  would  be  monotone,  which 
is  contrary  to  the  hypothesis  made. 
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It  thus  appears  that,  in  the  neighbourhood  of  a  particular  point,  a  con- 
tinuous function  may  have  an  indefinitely  great  number  of  oscillations.  An 
improper  maximum  or  minimum,  not  in  a  line  of  invariability,  is  certainly 
such  a  point. 

The  proper  maxima  and  minima  of  a  continuous  function  form  an  enumer- 
able, or  a  finite,  set  of  points. 

Consider  (#,  —  e,  #j  +  17),  the  greatest  neighbourhood  of  a  point  of  proper 
maximum  xlt  which  is  such  that,  for  all  other  points  x  within  the  neighbour- 
hood, f(x)  <f(al).  There  can,  in  a  finite  interval,  be  only  a  finite  number  of 
such  points  x^  for  which  e  >oc,  rj>  a,  where  a  is  a  fixed  positive  number;  for 
if  there  were  an  infinite  number  of  such  points,  they  would  have  a  limiting 
point  £,  and  we  could  choose  two  points  #/,  a,\"  of  the  set,  such  that  the  dis- 
tance of  each  from  £  is  less  than  |a;  then  each  of  these  points  would  lie 
within  the  neighbourhood  belonging  to  the  other,  and  thus  we  should  have 
f(®i)>f(x"),  and  also/(.T1")  >/(#/),  which  is  impossible;  thus  the  set  must 
be  finite.  Now  choose  a  sequence  of  descending  values  of  a  which  converges 
to  zero,  say  a1}  a2,  ...  aM,  ...  ;  the  number  mn  of  maxima  ^  such  that  for  each 
e  >  an,  i)  >  an  being  finite,  we  have  m-^  m2,  ...  mn,  ...  all  finite  ;  and  hence  the 
whole  set  of  maxima  forms  an  enumerable  set. 


If  x1  be  an  improper  maximum  point,  and  /(#i)  =  A,  then  a  neighbourhood 
(#j  —  €,#!  +  77)  can  be  found  which  contains  an  infinite  set  of  points  GA  such 
that  f(x)  =  A  ,  for  each  point  of  the  set.  If  x'  be  an  isolated  point  of  the  set 
GA  ,  then  x  is  clearly  a  proper  maximum  of  the  function  ;  and  if  x"  be  a  point 
of  GA,  which  is  a  limiting  point  of  the  set,  a?"  is  an  improper  maximum.  The 
points  #!  —  e,  xl  +  v  need  not  be  maxima,  even  though  they  be  limiting  points 
of  GA  .  The  condition  of  continuity  of  the  function  ensures  that  the  set  GA  is 
a  closed  one  ;  for,  at  any  limiting  point  of  the  set,  the  functional  value  is  the 
limit  of  a  sequence,  each  member  of  which  is  A,  and  this  value  is  therefore 
itself  A. 

Corresponding,  to  a  given  A,  there  may  be  a  finite,  or  an  infinite,  set  of 
detached  intervals  such  as  (x^  —  e,  xl  +  ij),  each  one  of  which  contains  a  closed 
set  such  that  each  isolated  poini  of  it  is  a  proper  maximum,  and  each  limiting 
point  (except  an  end-point)  is  an  improper  maximum.  The  set  GA  may  con- 
tain perfect  components,  and  thus  the  improper  maxima  at  which  A  is  the 
functional  value  may  form  a  set  of  the  cardinal  number  of  the  continuum.  A 
similar  result  holds  for  minima. 

It  can  further  be  shewn  that  the  values  of  a  continuous  function  at  all 
its  maxima  and  minima  form  a  set  which  is  either  finite  or  enumerably 
infinite. 
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258.  If,  in  the  interval  (a,  6),  the  function  have  only  a  finite  number  of 
maxima  and  minima,  counting  any  line  of  invariability  which  is  a  maximum 
or  minimum  as  one  maximum  or  minimum,  the  interval  can  be  divided  into  a 
finite  number  of  parts  in  each  of  which  the  function  is  monotone ;  the  function 
is  then  said  to  be*  in  general  monotone  (abtheilungsweise  monoton). 

If  the  function  have  an  indefinitely  great  number  of  maxima  and  minima, 
which  occur  either  at  points  or  at  lines  of  invariability,  the  function  then  makes 
an  infinite  number  of  oscillations ;  and  these  may  occur  in  the  neighbourhoods 
either  of  a  finite  number  of  points,  or  of  an  infinite  number  of  points. 

It  can  be  shewn  that,  in  the  case  of  a  continuous  function,  although  there 
may  be  an  infinite  number  of  oscillations  of  the  function,  there  can  be  only 
a  finite  number  of  which  the  amplitude  exceeds  an  arbitrarily  small  fixed 
number  a-. 

For  it  has  been  shewn  in  §  217  that  a  number  e  can  be  determined,  such 
that,  in  any  sub-interval  of  length  e,  the  fluctuation  of  the  function  does  not 
exceed  a  ;  therefore  in  each  of  the  sub-intervals 

(a,  a  4-  e),     (a.  4-  e,  a  +  2e),  ...  (a  +  ne,  b), 

the  fluctuation  of  the  function  is  not  greater  than  <r.  It  follows  that  no  oscillation 
of  the  function  which  is  greater  than  a  can  be  completed  in  one  of  these  sub- 
intervals,  and  that  such  an  oscillation  must  require  two  at  least  of  these  sub- 
intervals  for  its  completion ;  hence  the  number  of  such  oscillations  in  (a,  b) 
cannot  exceed  the  finite  number  n.  As  the  number  tr  is  diminished  indefinitely, 
it  may  happen  that  the  number  of  oscillations  of  which  the  amplitude  exceeds 
a  is  increased  indefinitely. 

If  the  point  a.  is  an  isolated  point  of  discontinuity  of  a  function  /(#),  and 
if  /(a  +  0),/(a  +  0)  be  unequal,  there  is  an  infinite  number  of  oscillations  in 
the  interval  (a,  a  +  e).  This  may  also  be  the  case  if  /(a  +  0)  have  a  unique  value. 


THE  DERIVATIVES  OF  FUNCTIONS. 

259.    If  a  function  f(x)  be  defined  for  all  points  in  the  interval  (a,  b),  then, 

f(x)  —  fix) 
for  a  point  x^  in  this  interval,  we  may  regard  the  function  J—  -  as  a 

function  F  (x),  of  x,  which  is  defined  for  all  values  of  x  in  (a,  b),  except  for  the 
point  X-L.  This  function  F  (x),  although  undefined  at  the  point  x1}  has  finite 
or  infinite  functional  limits  at  that  point,  in  accordance  with  the  definitions 
in  §  220. 

*  This  term  is  due  to  C.  Neumann ;  see  his  work  Ueber  die  nach  Kreis-  Kugel-  und  Cylinder- 
functionen  fortschreitendoi  Reihen. 
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If  the  limits  F  (xl  -f  0),  F(xl  —  0)  both  exist,  and  have  the  same  value,  either 
finite,  or  +  oo  ,  or  —  oo  ,  this  value  is  called  the  differential  coefficient  at  xl 
of  the  function  /(#).  At  the  point  a,  if  F  (a  +  0)  exists,  it  is  frequently  said 
to  be  the  differential  coefficient  of  f(x)  at  a;  and  at  the  point  b,  if  F(b  —  0) 
exists,  it  is  said  to  be  the  differential  coefficient  of  f(x)  at  6. 

The  condition  that/(#)  may  possess  a  finite  differential  coefficient  at  xi  is 
that,  corresponding  to  each  arbitrarily  chosen  positive  number  e,  a  neighbour- 
hood (x1  —  B,  #!  +  B)  can  be  found,  such  that 


for  every  pair  of  points  £,  f  which  lie  within  this  neighbourhood,  or  within 
such  part  of  it  as  is  interior  to  (a,  b). 

In  other  words,  the  condition  is  that  a  neighbourhood  of  &\  can  be  found 

f(x)  —  f(x  ) 
such  that  the  fluctuation  of  the  function  -^  —  .          within  it,  or  within  such 

x  —  a\ 

part  of  it  as  lies  in  (a,  b),  may  be  as  small  as  we  please. 

If,  corresponding  to  an  arbitrarily  large  positive  number  A,  a  neighbourhood 

f(x)  —  f(x) 
(x1  —  B,  xl  +  8)  can  be  found,  such  that  -     —  y         >  A,  for  every  point  x  (=£  x^ 

3u  ~~  CC  j 

in  that  neighbourhood,  f(x)  has  the  differential  coefficient  +  oo  at  the  point  xlt 

When  a  differential  coefficient  of  f(x)  exists  at  the  point  #1;  the  function 
is  said  to  be  differentiable  at  xl  ;  and  the  differential  coefficient  at  that  point 
may  be  denoted  by/'  (x^). 

In  geometrical  language,  when  the  function  f(x)  has  a  differential  co- 
efficient at  xlt  the  curve  y=f(x)  is  said  to  have  a  tangent  at  the  point 
(xl,f(x1)).  In  case  the  differential  coefficient  is  -f  oo  ,  or  —  x  ,  the  tangent  is 
parallel  to  the  y-axis,  and  the  point  is  a  point  of  inflexion  of  the  curve. 

That  a  function  f(x)  may  have  a  finite  differential  coefficient  at  xl,  it  is 
necessary,  but  not  sufficient,  that  xl  should  be  a  point  of  continuity  of  the  function. 

At  a  point  of  discontinuity  OKI,  off(x),  there  always  exists  a  positive  number 
<r,  such  that  in  any  neighbourhood  of  xl}  however  small,  points  £  exist  such  that 
i  /(£)  ~f(xi)  \  >  a  j  hence  if  A  be  any  arbitrarily  great  positive  number,  in  the 
interval  (^  —  B,  xl  +  8),  where  B  <  <r/A,  there  exist  points  £  such  that 


—  > 


f(x\  —  f(x  } 
and  it  is  thus  impossible  that </—  -  should  have  a  definite  finite  limit 

at  #!.    On  the  other  hand,  the  condition  for  the  existence  of  a  finite  differential 
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f(x)  —  f(x ) 
coefficient,  viz.  that  —     — J      }  should  have  an  arbitrarily  small  fluctuation 

tJu  ~~"  OC-\ 

within  a  sufficiently  small  neighbourhood  of  xly  is  not  necessarily  satisfied 
when  the  condition  of  continuity,  viz.  that  f(x)  should  have  an  arbitrarily  small 
fluctuation  within  a  sufficiently  small  neighbourhood  of  xlt  is  satisfied. 

If,  at  a  point  x,  the  function  f(x)  have  a  finite  differential  coefficient  f  (x), 
or  have  an  infinite  differential  coefficient  of  fixed  sign,  an  interval  (as  —  8e,  x  4-  8e) 

f(x  \  _  f(x  \ 
can  be  so  determined*  that  the  ratio  —      — ^    -  is  between  /'  (x)  +  €  and 

f  (x)  —  e,  where  xl  is  any  point  in  the  interval  (x  —  Be,  x),  and  x2  is  any  point 
in  the  interval  (x,  x  +  8e) ;  or  that  the  same  ratio  is  >  N,  an  assigned  positive 
number,  if  /'  (x)  =  +  oo  ,  or  <  -  N,  if  /'  (x)  =  -  oo  . 

A  continuous  function  f(x),  defined  for  the  interval  (a,  &),  which  has  a 
differential  coefficient  at  every  point  of  the  interval,  is  said  to  be  differentiate 
in  its  domain.  Continuous  functions  exist,  which  at  no  point  in  their  domain 
possess  a  differential  coefficient.  The  first  example  of  such  a  function  was  given 
by  Weierstrass;  the  construction  of  such  functions  will  be  considered  in  Vol.  II. 

That  a  continuous  function  possesses  a  differential  coefficient  was  formerly 
regarded  as  obvious  from  geometrical  intuition,  it  being  supposed  that  such 
functions  were  necessarily  representable  by  curves  possessing  definite  tangents 
at  every  point.  The  first  attempt  to  prove  the  existence  of  a  differential 
coefficient  of  a  continuous  function  was  that  of  Ampere  f ;  this  proof  was, 
however,  insufficient,  even  in  the  case  of  those  continuous  functions  which  make 
only  a  finite  number  of  oscillations  in  the  intervals  for  which  they  are  defined. 
It  is  now  fully  recognized  that  the  class  of  continuous  functions  is  much  wider 
than  that  of  functions  capable  of  an  approximate  graphical  representation  ;  and 
that  the  conditions  for  the  existence  of  definite  differential  coefficients  are  of 
a  much  more  stringent  character  than  would  be  the  case  if  they  were  included 
under  the  bare  condition  of  continuity  of  the  function. 

260.    It  may  happen  that,  at  a  point  xl ,  the  function  —     — - —  -  possesses 

finite,  or  even  indefinitely  great,  limits  on  the  right  and  on  the  left  at  #, ,  which 
differ  from  one  another ;  the  function  is  then  said  to  have  derivatives,  on  the 
right,  and  on  the  left,  at  <cl.  These  are  frequently  spoken  of  as  the  progressive 
and  regressive  differential  coefficients,  or  derivatives,  respectively.  A  function 
may  possess  a  progressive  derivative  and  no  regressive  derivative,  or  the 
reverse. 

When,  at  the  point  xly  the  function  f(x)  has  no  differential  coefficient,  and 

f(x\  —  /(X) 
neither  progressive  nor  regressive  derivatives,  the  function  —  -  has,  i 


in 

x  — 


*  See  Bromwich's  Theory  of  Infinite  Series,  p.  490. 
t  Joitrn.  Ecol.  polyt.,  vol.  vi,  1806,  p.  148. 
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accordance  with  §220,  four  extreme  limits,  an  upper,  and  a  lower  one, 
on  the  right,  and  on  the  left.  Any  one  of  these  four  may  be  either  finite  or 
infinite. 

There  is  in  general  an  aggregate  of  functional  limits,  on  the  right,  of 

J\x)~J\xv  ^  an(j  a|so  an  aggregate  of  functional  limits  on  the  left.    The  upper 

x  —  x-i 
and  lower  limits,  on  the  right,  which  are  the  upper  and  lower  boundaries  of 

the  aggregate  of  functional  limits  of  —    — J        at  x1}  on  the  right,  are  defined 

OC  ^  &'i 

to  be  the  upper  and  lower  extreme  derivatives  of  f(x),  at  x1}  on  the  right,  or 
simply,  the  upper  and  lower  derivatives  on  the  right,  and  these  are  in  accordance 
with  the  notation  of  Scheeffer*,  denoted  by  D+f(x),  D+f(x)  respectively. 
Similarly,  the  upper  and  lower  derivatives  of  f(x)  at  xlt  on  the  left,  are  the 
upper  and  lower  boundaries  of  the  aggregate  of  functional  limits  on  the  left, 
and  are  denoted  by  D~f(x),  D_f(x}  respectively. 

As  x  has  the  values  of  a  sequence  converging  to  x1  on  one  side,  the  values 

f(x\  —  f  (x  } 
of  •/_:     — J-±  U  mav  converge  either  to  the  upper  or  to  the  lower  derivative  of 

OC  ^~  ct/j 

f(x)  at  #! ,  on  that  side,  or  they  may  converge  to  some  number  lying  between  these 
extreme  derivatives.  In  the  last  case  they  are  said  to  converge  to  a  median 
derivative  on  that  side.  The  upper  and  lower  derivatives  on  one  side,  together 
with  any  median  derivatives  on  that  side  which  may  exist,  make  up  the  aggre- 

f(x)  —  f(x ) 
gate  of  functional  limits  of  —     — —   -  at  x1}  on  the  side  under  consideration. 

SC    "™~     I// J 

In  case  the  function  f(x)  is  continuous  in  an  open  interval  on  the  right  of 
xl}  the  median  derivatives  have  every  value  between  D+f(x)  and  D+f(x)  (see 
§  229),  but  when  f(x)  is  not  continuous  in  such  an  open  interval  on  the  right  of 
x1 ,  the  aggregate  of  derivatives  on  the  right  may  be  a  closed  discontinuous  set 
having  D+f(x)  and  D+f(x)  for  its  upper  and  lower  extremes.  A  similar  remark 
applies  to  the  case  of  the  derivatives  on  the  left. 

It  is  frequently  convenient  in  this  general  case  to  speak  of  the  derivatives 
of  f(x)  on  the  right,  and  on  the  left,  as  existent,  but  indefinite  in  value  :  and  in 
this  case  D+f(xl),  D+f(x^)  are  regarded  as  the  limits  of  indeterminancy  of  the 
derivative  on  the  right,  and  D~f(xl),  D_f(x1)  as  those  of  the  derivative  on 
the  left. 

The  definitions  which  have  been  given  for  the  case  in  which  the  domain 
of  the  function  is  continuous  are  applicable,  without  essential  change,  to  the 
case  in  which  the  domain  is  any  perfect  set  of  points.  At  a  point  of  the  set 
which  is  a  limiting  point  on  both  sides,  there  exist  in  general  the  four  extreme 

*  Acta  Mathematica,  vol.  v.  The  same  limits  were  considered  by  Du  Bois  Keymond,  Programm, 
Freiburg,  1870,  also  Miluch.  Abh.,  vol.  xn,  p.  125,  under  the  name  Unbestimmtheitsgrenzen. 
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derivatives  D+f(x),  D+f(x),  D~f(x),  D_f(x),  two  or  more  of  which  may  have 
equal  values  ;  and  at  a  point  of  the  perfect  set,  which  is  a  limiting  point  on  one 
side  only,  there  exist  of  course  only  the  two  derivatives  on  that  side.  If  the 
domain  be  any  closed  set,  the  derivatives  exist  only  at  those  points  which  are 
limiting  points  of  the  set. 

A  function  defined  for  a  perfect  set  may,  by  the  method  of  correspondence, 
be  correlated  with  a  function  defined  for  a  continuous  interval,  the  relative 
order  of  two  points  in  the  continuous  interval  being  the  same  as  that  of  the 
corresponding  points  in  the  perfect  set  (see  §163);  and  thus  all  properties  of 
derivatives  of  functions  defined  for  a  continuous  interval  have  their  analogues 
in  the  case  in  which  the  domain  is  any  perfect  set. 

EXAMPLES. 

1.    If/  (x)  =  x  sin  -  ,  /(O)  =  0  ;  we  have  ~~-,  —  -  =  sin  y  ,  and  for  arbitrarily  small  values 

of  h,  this  oscillates  between  1  and  -1.  The  function  /(#),  although  continuous  at  #  =  0, 
possesses  no  differential  coefficient  at  that  point  ;  in  fact 


2.  If/(#)  =  #2sin  -,  /(0)=0,  the  differential  coefficient  /'  (x)  exists  for  every  value  of 
x,  and  is  finite.    At  the  point  x  =  0,f  (#)  is  zero,  but  has  a  discontinuity  of  the  second  kind. 

3.  Let*  f(x)  =  ^x  (  1  +x  sin  -)  ,  for  x>0  ;  f(x)  -  -  \!^~x  (  1  +x  sin  -J  ,  for  x  <  0  ;  and 

\  XJ  \  x  ) 

/(0)=0.  In  this  case/"  (x)  everywhere  exists;  its  value  at  x  =  Q  is  +QO,  and  although  it 
has  a  finite  value  at  every  point  except  at  x  =  0,  it  oscillates  in  the  neighbourhood  of  that 
point  between  indefinitely  great  positive  and  negative  values. 

.4.t    The  function  defined  by  /  (a?)  *=#{!+£  sin  (log  #2)},  and/(0)  =  0,  is  everywhere  con- 
tinuous, and  is  monotone,  but  has  no  differential  coefficient  at  ,v  =  0. 

_^_ 
5.J    Let/(./J?)  =  e   #2sin-,  /(0)  =  0;    this  function  has  at  every  point  a   differential 

X 

coefficient,  and  this  is  continuous  at  #  =  0.  The  differential  coefficient  vanishes  at  #=0, 
and  at  an  infinite  number  of-  points  in  the  neighbourhood  of  x  =  0.  The  function  /'  (x), 
like  /(a?),  has  an  infinite  number  of  oscillations  in  a  neighbourhood  of  #  =  0. 


THE  DIFFERENTIAL  COEFFICIENTS  OF  CONTINUOUS  FUNCTIONS. 

261.  Let  us  suppose  that  a  continuous  function,  defined  for  a  continuous 
closed  interval,  is  such  that,  at  every  point  interior  to  an  interval  (a,  /3),  there 
exists  a  differential  coefficient ;  this  differential  coefficient  may  at  any  point 
have  a  finite  value,  which  may  be  zero,  or  it  may  have  an  infinite  value,  of  which, 

*  Diui,  Grundlagen,  p.  112. 

t  Pringsheim,  Encyklopadie  dcr  Math.  Wissensch.,  n  A.  i,  p.  22. 

J  Dini,  Grundlagen,  p.  313. 
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however,  the  sign  is  definite.  It  will  be  observed  that  f(x)  is  assumed  to  be 
continuous  at  the  points  a,  /3,  but  it  is  not  assumed  that  definite  derivatives 
exist  at  those  points.  It  will  be  shewn  that,  unless  the  function  be  constant 
throughout  (a,  ft),  there  exists  at  least  one  point  in  the  interior  of  (a,  ft)  at  which 
the  differential  coefficient  has  a  definite  value,  different  from  zero. 

Suppose  /(a),  /(/?)  to  be  unequal.  If  they  be  not  unequal,  and  the 
function  be  not  constant  throughout  (a,  ft),  we  can  replace  the  interval  (a,  ft), 
by  another  one  contained  in  it,  for  which  the  functional  values  at  the  ends 
are  unequal.  Let  us  consider  the  function 


F  (x)  =/(*)  -/(a)  -  |^  {/(£)  -/(«)}. 

F(a)  and  F(ft)  vanish,  and  F '(x)  is  continuous  in  (a,  ft),  and  has  a  differential 
coefficient  in  the  ordinary  sense  at  each  point,  with  the  possible  exception  of 
a  and  ft;  therefore  it  follows,  by  the  theorem  of  §  213,  that  there  is  at  least  one 
point  #!,  in  the  interior  of  (a,  ft),  at  which  F  (x)  is  a  maximum  or  minimum : 
this  is  the  case  even  if  F(x)  be  everywhere  zero  in  the  interval.  A  number  e 
can-  therefore  be  found  such  that  F '(x1  +  B)  —  F(xl),  F(xl  —  8)  —  F(x1)  have  the 
same  sign,  or  else  vanish,  provided  B  <  e  ;  and  consequently  the  derivatives  at 
#1}  on  the  right  and  left,  must  have  opposite  signs,  unless  both  of  them  be  zero; 
therefore  the  differential  coefficient  at  ac1}  which  must  exist,  must  be  zero.  It 

follows  that  /'  (x^  —  — -~ — ^--  =  0,  and  thus  the  point  xl  is  the  point  of 

p  —  a 

which  the  existence  was  to  be  proved.  From  this  theorem  we  deduce  the 
following  general  theorem : 

If  f(x)  be  continuous  in  the  closed  interval  (a,  b),  and  be  such  that  it  has  a 
differential  coefficient  at  every  point  in  the  interior  of  the  interval,  and  if  there 
be  in  (a,  b)  no  lines  of  invariability,  then  there  exists  in  (a,  b)  an  everywhere 
dense  set  of  points  at  which  the  differential  coefficient  has  finite  values  differing 
from  zero. 

This  is  proved  at  once  by  applying  the  foregoing  theorem  to  any  interval 
contained  in  (a,  b).  There  may  be,  in  (a,  b),  infinite  sets  of  points  at  which  the 
differential  coefficient  is  either  zero  or  infinite. 

262.  The  following  theorem,  known  as  the  mean  value  theorem  of  the 
Differential  Calculus,  has  been  established  by  the  proof  in  §  261 : 

If  the  function  f(x)  be  continuous  in  the  closed  interval  (a,  b),  and  if  the 
differential  coefficient  f  (x)  exist  at  every  point  interior  to  the  interval  (a,  b), 
being  either  finite,  or  infinite  with  fixed  sign,  then  a  point  a  +  0  (b  —  a)  exists, 
where  6  is  some  number  such  that  0  <  6  <  1,  for  which 


f(b)  =/(a)  +  (b  -  a)f  (a  +  6b-  a). 
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If  we  change  the  notation,  so  that  (x,  x  +  h)  is  the  given  interval,  the 
result  may  be  written 

f(x  +  h)  =f(x)  +  hf  (x  +  6h}. 

It  will  be  observed  that  no  assumption  is  made  as  to  the  existence  of  a 
derivative  on  the  right  at  the  point  a,  or  x,  or  as  to  the  existence  of  a 
derivative  on  the  left  at  b,  or  x  +  h.  It  has  only  been  assumed  that  the 
function  is  continuous  on  the  right,  and  on  the  left,  at  these  points  respectively. 
In  case  however  it  is  known  that  the  limit  of  /'  (x),  as  x  ~  a,  has  a  definite 
value  X,  then  f(x)  has  a  definite  derivative  on  the  right  at  the  point  a,  and  its 
value  is  X. 


For,  since  —     —  =^—  -  =f(a  +  Ox  —  a),  and  the  expression  on  the  right- 

CO  ~*™  Ot 

hand  side  converges  to  X,  as  x  -  a,  it  follows  that  lim   —  —  —         =  X. 


A  corollary  from  the  mean  value  theorem  is  that,  if  f(x  +  h)  =f(x),  then 
f  (x)  must  be  zero  at  one  point  at  least  in  the  interior  of  the  interval  (x,  x+h). 

The  following  theorem  may  be  proved  by  means  of  the  mean  value  theorem: 

If  f(x)  have  a  continuous  differential  coefficient  f  (x),  at  every  point  of  an 
open  interval  (a,  b),  then  f(x  +  h)  —f(x)  =  h{f'(x)  +  p  (x,  h)},  for  every  pair  of 
points  x,  x  +  h  of  the  open  interval  ;  where  p  (x,  h)  converges  to  zero,  as  h~Q, 
uniformly  for  all  points  x  of  any  closed  interval  (a,  0)  contained  in  (a,  b). 

Since  f(x  +  h)  —f(x)  =  hf  (x  4-  6h),  where  0  <  6  <  1,  for  any  pair  of  points 
x,x+h,  within  (a,i),and  since/'(#)  is  continuous  in  the  closed  interval  (a  —  hlt 
/3  +  Aj),  where  (a,  /9)  is  a  closed  interval  contained  in  (a,  b),  and  h^  is  so  small  that 
the  closed  interval  (a  —  h1}  j3  +  h^)  is  also  in  (a,  b),  we  have,  for  every  value  of  x 
in  (a,  /3),  /'  (x  +  6h)  —  f  (x)  \  <  e,  if  h  <  hl}  provided  h{  is  taken  sufficiently 
small.  This  follows  from  the  uniform  continuity  off  (x)  in  the  closed  interval 
(a  —  hl}  ft  +  hj.  We  may  therefore  take  /'  (x  +  6h)  -f'(x)  +  p  (x,  h);  where 
p  (x,  h)  <  e,  provided  h  <  h},  and  x  is  any  point  in  (or,  ft).  Since  e  is 
arbitrary,  p  (x,  h)  converges  uniformly  to  zero,  as  h  ~  0,  for  all  points  x  in  (a,  /3). 

263.    An  important  extension  of  the  mean  value  theorem  is  the  following: 

Iff(x)  be  continuous  in  the  closed  interval  (x,  x  +  h),  and  have  a  differential 
coefficient  at  every  point  of  the  interval,  with  the  possible  exception  of  the 
end-points;  and  if  F  (x)  be  another  function  which  is  also  continuous  in  the 
same  interval,  and  at  every  interior  point  has  a  differential  coefficient,  whilst  at 
the  end-points  there  may  be  no  definite  derivatives,  or  they  may  be  zero,  or 
infinite,  then,  provided  f'(x)  and  F'  (x)  have  no  common  zeros  or  common 
infinities  in  the  open  interval, 

j\x  +  h)-f(x)  =  f_(^±0h) 
F(x  +  h)-F(x)     F'(x+6h) 

for  some  value  of  6  such  that  0  <  6  <  1. 
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To  prove  the  theorem,  let 


and  let  it  be  assumed  that  F(x  +  h)  —  F(x)  is  not  zero.  Since  <£>  (x)  =  (f)(x  +  h), 
and  </>  (£)  satisfies  the  conditions  of  the  mean  value  theorem,  <£'  (£)  must  vanish 
for  some  value  x  +  0A,  of  f,  interior  to  the  interval  (#,  #  +  A)  ;  <£'  (£)  exists, 
since  f'(j;),F'(J;)  cannot  both  be  infinite.  We  have  then 


from  which  the  theorem  follows,  since  F'(x  +  0h)  and  f'(x+  Oh)  cannot  be 
both  infinite,  or  both  zero.    In  the  case  in  which  /(#  +  h)  =f(x),  we  have 


for  some  suitable  value  of  0;  and  then,  since  F(x  +  h)—F(x),  F'  (x  +  Oh)  are 
not  zero,  the  theorem  still  holds. 

264.  The  last  theorem  may  be  applied  to  obtain  a  proof  of  the  legitimacy, 
under  certain  conditions,  of  a  well-known  method  of  evaluating  limits  which 

0   oo 
appear  in  the  so-called  indeterminate  forms  ^,  —  . 

0   oo 

Let  the  two  functions  f  (x),  F(x)  be  both  continuous  at  all  points  interior  to 
the  interval  (a,  a  +  /3),  and  let  the  limits/  (a.  +  0),  F  (a  +  0)  both  exist,  and  be  zero; 
if  finite  differential  coefficients  f'(x),  F'  (x)  exist  at  every  interior  point  of 
(a,  a  +  #),  and  ifh(>0)  can  be  so  determined  that,  within  (a,  a.  +  h),f'(x)  and 
F  '  (x)  are  not  both  zero,  and  not  both  infinite  together  at  any  point,  then  if  the 
limit 


exist  as  a  definite  number,  or  be  infinite  with  a  fixed  sign,  the  limit 

" 


also  exists,  and  the  two  have  the  same  value. 

Moreover,  if  the  first  limit  has  no  unique  value,  the  two  limits  have  the  same 
limits  of  indeterminancy. 

The  two  functional  values  /(a),  F  (o.)  may  both  be  defined  to  be  zero,  and 
thus  the  functions  f(x\  F  (x)  are  continuous  in  any  interval  (a,  a  +  h),  where 
h  <  /3.  We  have  then,  from  the  extension  of  the  mean  value  theorem, 

f(a+h)_f(a+0h) 


where  0  is  some  number  between  0  and  1.     Since  Oh  converges  to  zero  when 
h  does  so,  the  theorem  follows  at  once  from  this  equality. 

H.  22 
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If  f  (x),  F  (x)  have  both  differential  coefficients  f  (x),  F'  (x)  at  all  points 
of  the  open  interval  (or,  a  +  /3),  and  if  /(a  +  0),  F(a  +  0),  both  exist,  and 
are  infinite  with  a  fixed  sign;  then,  provided  h(>0,  £  /3)  can  be  so  determined 
that  f  (x),  F'  (x)  are  not  both  zero*,  or  both  infinity,  at  any  point  in  the  open 

interval  (a,  a  +  h),  if  lim  J~-  -  =-  exists,  as  a  finite  number,  or  is  infinite  with 
T,~O  f  (a+n) 

a  fixed  sign,  also    lim  "^  --  r^  exists,  and  the  two  have  the  same   value. 
h~o  Jf  (a+h) 

Moreover,  if  the  first  limit  has  no  unique  value,  the  limits  of  indeterminancy  of 
the  two  limits  are  the  same. 

Consider  the  interval  (a  +  8lt  a  +  82)  interior  to  (a,  a  +  h);  we  have  then 

/  (a  +-8.J  -/(«  +  &)  =/'(«  +  S.) 

f  '  (a  +  82)  -  F(a  +  8,)     F'  (a  +  8,)  ' 

where  83  is  a  number  which  lies  between  ^  and  82;  it  being  assumed  that 
F  (a  +  S2)  =|=  F  (a  +  Sj)  4=  0.     This  equation  may  be  written  in  the  form 

, 


Taking  S2  as  fixed,  if  e  be  an  arbitrarily  small  positive  number,  a  positive 
number  8'  (<  S2)  can  be  so  determined  that  for  every  value  of  S1(>0)  which 
is  <&',  the  inequalities 


are  both  satisfied;  this  follows  from  the  fact  that  ^(a  +  0)  is  infinite  with 
fixed  sign. 

We  have  now 

.j^^,  +  (i  -a  /•(«+&) 

^(a  +  80     V^  >}F'(a+§sy 

where  j  r)   <e,  and    f  |  <e,  for  all  values  of  81  such  that  0  <  Bl  <  8';  where  82  is 
fixed,  and  B'  (<  82)  can  then  be  determined. 

If  we  assign  to  82  the  values  in  a  sequence  that  converges  to  zero,  8'  will 
have  the  values  in  a  similar  sequence. 

If  U,  L  denote  the  upper  and  the  lower  limits  of  -£777  -  TT  ,  as  h  ~  0,  we  see 

F  (a  +  h) 

that  4ir    +  <?\  lies  between  U  +  e  and  L  -  e,  provided  that  3.2  is  sufficiently 
f   (a  -I-  o3) 

small. 

*  This  generalization  of  the  conditions  to  he  satisfied  by  the  two  functions  in  these  theorems 
is  due  to  W.  H.  Young;  see  Proc.  Lond.  Math.  Soc.  (2),  vol.  vni,  p.  51.  For  a  history  of  these 
theorems,  seePringsheim,  Encycl.  d.  Math.  Wissensch.,  n  A.  i,  p.  26.  A  detailed  investigation  of  the 
theorems  is  given  hy  Stolz,  Grundzilge,  vol.  i,  p.  77  et  seq.  ;  Stolz  assumes  that  F  (x)  is  monotone 
in  the  interval. 
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Then  -  1   is  between  e  +  (1  +  e)(tf  +  e)  and  -  e  +  (1  -e)(Z  -e);  and 


therefore,  since  e  is  arbitrary,  any  value  of  lira  ~  --  .  -  is  in  the  interval 

k^o  J)  (a.  +  fi) 

(L,  U). 


If  U  =  L.  then  lim  ^-7  -  r\  has  the  unique  value  Km  4^  - 

' 


265.  Let  f(x),  F  (x)  be  both  zero  at  the  point  oc  =  xl,  and  let  us  suppose 
that  /'  (#1),  F1  (xj)  both  exist,  and  are  finite,  the  -latter  not  being  zero.  It  is 
unnecessary  to  assume  the  existence  of/'  (x},  F'  (x)  for  x  =|=  #j. 

From  the  definition  of/'  (a^),  Fr  (x^,  we  have 


F(xl  +  h)=h{F'(xl)+P'(h}\, 
where  p  (h),  p  (k)  converge  to  zero,  as  h  «*  0.    Thence  we  have 


Ffa+h) 
from  which  it  follows  that 


This  includes  the  case  in  which  /'(«i)  =  0,  ^'(i»i)  4=  0,  whe^n  the  limit  is  zero. 

Next,  let  it  be  assumed  that/(X),/'  («i),  •••f(n~1}  M  (see  §  270)  all  exist, 
and  have  the  value  0;  and  that  F(xl},  F'  (x^,  ...  .P^'-1)  (^)  all  exist,  and  have 
the  value  0.  Also  let  it  be  assumed  that  f(n)  (x^,  F(n']  (a?j)  exist,  and  are 
finite,  neither  of  them  being  zero.  It  will  also  be  assumed  that,  in  some 
neighbourhood  of  the  point  #1}  the  first  n  —  1  differential  coefficients  o 
and  the  first  n'  —I  differential  coefficients  of  F(x},  all  exist. 


We  have  then, 


I    /.\ 

=f(n]  (a?i);  since  f(n~*  (x,}  =  0. 


fc~0  « 

It  then  follows,  by  the  employment  of  the  theorem  of  §  263,  that 
,im/(+A)  _        fto  +  U)  f'(Xl  +  k) 


_]•  _ 

—  mil  j——  —  --  - 

^~o         nl  h  n\ 


oo ••> 
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Similarly,  we  have  lira  —    *  , — -  = ~-^;  and  thus  we  see  that,  since 

1^,0         h  n  \ 


V  l  n       -f 

lim  iv7~  rr(  =  °'  lf  n  <  n 

h~0  JF  (0,'j  +  fl) 

=  ±  00  ,    if  ft/  >  71 

=  /n)(>i)AF(n)<>i)>  if  w'  =  w- 
The  following  theorem  has  been  established: 

If  f(x},  F(x)  are  zero  when  x  =  xl,  and  have,  in  a  neighbourhood  of  that 
point,  differential  coefficients  of  the  first  n  —  I,  and  n'  —  1  orders  respectively, 
which  are  all  zero  at  the  point  x  =  a;1;  and  if,  moreover,  the  differential  co- 
efficients of  f(x),  F(x),  of  orders  n,  n,  respectively,  exist  and  are  finite,  at  the 

f  (x) 
point   x  =  xl,   both   of  them,  being  different  from  zero,  then  lim  ~L  ,     has  a 


x  ~  x, 


Unique  value  0,  "LjA  -£  ,  ±  oo  ;  according  as  n  <  n,  n  =  n,  or  n  >n;  the  sign  of 
*    \xi) 

±  oo  being  that  offn  (x1)/Fn'(xl). 

If,  for  every  value  of  x^  in  an  open  interval  (a,  jB),f'(x},  F'  (x)  are  both 
continuous,  and  F'  (x)  ^  0,  we  see,  as  in  §  262,  that 


where  p  (x,  h),  p  (x,  h)  converge  to  0,  as  h  ~  0,  uniformly  for  all  points  x  in  a 
closed  interval  (a,  6)  contained  in  (a,  /3).    We  then  see  that 

*      f(x+h}-f(x}  f(x) 

fh  --  TV  —  TT?-\  converges  to  -fjrri 
F(x+h)-F(x)  F  (x} 

uniformly  in  the  interval  (a,  b),  provided  j  F'  (x)  \  exceed  some  fixed  positive 
number,  for  all  points  in  the  interval. 

266.  If  the  function  f(x)  have  a  discontinuity  of  the  second  kind  at  the 
point  a,  at  least  on  the  side  which  is  towards  the  interval  (a,  a  +  A),  but  the 
function  have  ajinite  differential  coefficient  at  every  point  of  the  interval  (a,  a.  +  h), 
except  at  the  point  a,  then  the  differential  coefficient  has,  in  any  arbitrarily  small 
neighbourhood  of  a,  every  finite  value. 

Let  X  be  any  fixed  number,  positive,  negative,  or  zero.  The  function 
f(x)  —  \x  has  a  finite  differential  coefficient  at  all  interior  points  of  (a,  a  +  h), 
and  it  has  a  discontinuity  of  the  second  kind  at  the  point  a.  If  8  (<  h)  be 
arbitrarily  small,  in  the  interior  of  the  neighbourhood  (a,  a  +  £),/(#)  —\x  has 
maxima  and  minima,  for  it  cannot  be  monotone  in  that  interval  ;  and  at  such 
points  /'  (x)  —  \  =  0.  Since  X  is  arbitrary,  it  thus  appears  that  in  the  neigh- 
bourhood of  a,  f  (x)  has  every  finite  value. 
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The  mean  value  theorem  /(a  +  h)  -/(a)  =  hf'(a  +  Oh},  where  0  <  0  <  1, 
affords,  information  as  to  the  existence  and  value  of  the  derivative  at  a,  on  the 
right,  provided  f(x)  satisfies,  in  a  neighbourhood  of  a  on  the  right,  the  con- 
ditions under  which  the  theorem  holds.  By  considering  both  sides  of  a, 
information  may  be  obtained  as  to  the  existence  of  a  differential  coefficient 
at  a. 

(1)    If  the  function  f  (x)  have  a  functional  limit  at  a  on  the  right,  then 


h 

has  a  definite  limit  for  h  ~  0,  either  finite,  or  infinite  with  fixed  sign,  and  this 
is  equal  to  that  of/'  (x).  It  follows  that,  in  this  case,  a  derivative  at  a  on  the 
right  exists,  and  is  either  finite,  or  infinite  with  fixed  sign. 

(2)  If  the  function  f  (x)  have  no  limit  at  a  on  the  right,  it  may  still 
happen  that  /'  (a  +  0h)  has  a  definite  limit  at  a  on  the  right,  because  a  +  Oh 
is  not  necessarily  capable  of  having  all  values  within  a  neighbourhood  of  a. 
In  this  case,  either  (i)  the  derivative  at  a  on  the  right  may  be  definite,  and 
lie  between  the  upper  and  lower  limits  of  /'  (x)  at  a  on  the  right,  or  it  may 
be  equal  to  one  or  other  of  those  limits  ;  or  (ii)  there  may  be  no  definite  de- 
rivative at  a  on  the  right,  but  D+f(a),  D+f(a)  may  have  different  values,  and 
these  are  certainly  both  finite  in  case  the  upper  and  lower  functional  limits 
of  f  (x)  at  a  are  both  finite. 

(3)  The  derivative  on  the  right  at  a  can  only  exist,  and  be  infinite,  (i)  if 
f  (x)  have  an  infinite  limit  on  the  right  at  a,  or  (ii)  if  it  have  an  infinite  upper 

limit  on  the  right  at  a.  In  either  of  the  cases,  (i)  and  (ii),  f  (x)  may  be  every- 
where finite  within  a  neighbourhood  of  a  on  the  right,  or  it  may  be  infinite  at 
some  points  in  such  a  neighbourhood. 

(4)  If  the  derivative  at  a  on  the  right  exist,  and  be  finite,  then  either 
(i)  f'(x)  has  a  definite  limit  at  a  on  the  right,  equal  to  the  derivative  at  a,  or 
(ii)  f  (x)  has  no  definite  limit  at  a  on  the  right,  but  a  sequence  of  points  can 
be  determined,  of  which  a  is  the  limiting  point,  such  that  the  values  of  /'  (x) 
for  points  of  that  sequence  converge  to  the  value  of  the  derivative  at  a.    At 
points  which  do  not  belong  to  the  sequence,  the  values  off  (x)  may  be  either 
finite  or  infinite. 

267.  Iff(x)  be  continuous  in  a  given  closed  interval,  and  have  at  every  point, 
with  the  possible  exception  of  an  enumerable  set  G,  a  differential  coefficient  of 
value  zero,  the  function  is  constant  throughout  the  ivhole  interval. 

At  the  points  of  G  we  may  suppose  it  to  be  unknown  whether  a  differential 
•coefficient  exists,  or,  if  one  does  exist,  what  values  it  has. 
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A  more  general  form  of  this  theorem  is  obtained  by  considering  not  the 
differential  coefficient,  but  any  one  of  the  four  derivatives,  thus : 

If  f(x)  be  continuous  in  a  given  closed  interval,  and  one  of  the  four  de- 
rivatives D+f(x),  D+f(x),  D~f(x),  D^f(x\  be  such  that  it  is  zero  at  every  point 
of  the  interval,  with  the  exception  of  points  belonging  to  an  enumerable  set  G,  at 
luhich  nothing  is  known  as  to  its  value,  then  the  function  is  constant  throughout 
the  interval. 

To  prove  the  generalized  theorem  for  the  case  of  the  function  D+f(x}> 
suppose  that,  if  possible,  f(x)  —/(or)  has  at  some  point  x^  a  value  different  from 
zero,  say  the  positive  value  p ;  and  let  <£  (x,  k)  denote  f(x)  —/"(a)  —  k(x  —  a). 
Then  </>  (a,  &)  =  0,  (f>(x1,  k)  =  p  —  k  (x^  —  a).  Choose  any  fixed  positive  number 

79  —  Q 

q  <  p,  then  <f>  (#: ,  k)  >  q,  provided  k  <  —     —  ,  or  say  k<K.    Since  <f>  (x,  k)  is 

$?j  ~~~  Ct 

continuous  in  (a,  6),  and  <f>(a,  k)  is  zero,  whilst  </>  (x1)  k)  >  q,  there  exists  an 
upper  limit  of  those  values  of  x  between  0  and  x1}  for  which  <£  (x,  k)  £  q, 
and  this  upper  limit  is  attained  for  some  value  £,  of  x,  which  is  such  that 
£  <  #1}  and  </>  (£,  k)  =  q.  Since  <£  (£  +  h,  k)  >  q,  provided  0  <  h  ^  x1  —  %,  we  see 

that,  since  —  — y — —      -  is  positive,  D+<j)  (£,  k)  is  positive,  if  it  be  not 

zero.  Now  if  ff  were  a  point  not  belonging  to  G,  the  value  of  D+<£  (f,  h)  would 
reduce  to  —  k ;  and  therefore  f  must  belong  to  G. 

The  number  q  being  fixed,  £  depends  only  on  k;  and,  corresponding  to  a 
given  value  of  £,  there  is  only  one  value  of  k ;  for 

*(&*)- *(&*")  "•(#-*)  <*-«). 

which  cannot  vanish  unless  k  =  k',  since  (f>  (a,  k)  is  zero,  and  therefore  <  q.  For 
a  given  value  of  k,  the  corresponding  number  of  values  of  f,  all  of  which 
necessarily  belong  to  G,  must  be  either  finite,  or  enumerably  infinite,  since 
every  part  of  an  enumerable  aggregate  is  either  finite  or  enumerable.  There- 
fore, to  each  value  of  k,  in  the  continuous  interval  (a,  K  —  (3),  there  corresponds. 
a  finite,  or  enumerable,  set  of  values  of  £,  and  it  would  hence  follow  that  the 
continuum  (a,  K  —  j3)  is  itself  enumerable,  which  we  know  is  not  the  case. 
It  has  thus  been  shewn  that,  for  no  point  can  f(x)  —f(a)  have  a  positive 
value  ;  and  similarly,  by  considering  f(x)  —f(a)  +  k  (x  —  a),  it  can  be  shewn 
that  f(x)—f(a)  can  nowhere  have  a  negative  value;  hence  f(x)=f(a} 
throughout  the  whole  interval  (a,  6).  The  case  in  which  one  of  the  other  three 
derivatives  vanishes,  except  at  points  of  G,  can  be  treated  in  a  similar  manner. 

A  continuous  function  can  exist,  which  is  constant  in  each  interval  con- 
tiguous to  a  non-dense  closed  set  G,  and  is  not  everywhere  constant,  provided 
G  is  unenumerable ;  but  if  G  be  enumerable,  the  function  must  everywhere 
have  the  same  constant  value. 
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The  following  theorem*  which  is  of  importance  in  the  theory  of  Integration 
will  now  be  established  : 

If  two  functions  be  each  continuous  in  a  given  closed  interval,  and  if,  of  one 
of  the  four  derivatives  it  be  known  that,  for  the  two  functions,  this  derivative  has 
equal  finite  values  at  each  point  of  the  interval,  with  the  exception  of  an  enu- 
merable set  of  points  at  which  nothing  is  known  as  regards  the  two  derivatives, 
then  the  two  functions  differ  from  one  another  only  by  a  constant,  which  must  be 
the  same  for  the  whole  interval. 

It  must  first  be  observed  that  the  proof  of  the  preceding  theorem  suffices 
to  shew  that,  if  D+f(x)  £  0,  at  every  point  of  (a,  b)  not  belonging  to  the  set  G, 
then  f(x)  —f(a)^  0,  for  every  point  x  of  the  interval.  Similarly,  if  D+f(x)  ^  0, 
everywhere  in  the  interval,  except  at  the  points  of  G,  then  f(x)  —f(d)  =  0,  at 
every  point  of  the  interval. 

If  now  fi(x),  /2  (x)  be  two  continuous  functions  such  that 

D+f1(x)  =  D+fs(x) 

at  every  point  of  (a,  b)  not  belonging  to  G,  let/(#)  =f(x1)  —f(x2).  If  e  be  an 
arbitrarily  small  positive  number,  then  for  any  point  x  not  belonging  to  G,  the 
condition 


IV 

is  satisfied  for  a  set  of  positive  values  of  h  which  are  arbitrarily  small.    Also 
we  have,  for  all  sufficiently  small  values  of  h, 


hence,  since  D+f,  (x)  =  D*f,  (x),  we  see  that  /(^  +  *)  -/(«)  >  _  ^  for  all 

ft 

values  of  h  belonging  to  some  set.  It  follows  that  D+f(x)  >  —  2e,  and  thence  f 
that  D+f(x)  =  0,  since  e  is  arbitrary.  By  interchanging  f±  (x)  andy*2  (x),  we  see 
that  D+  {—  f(x)}  =  0.  From  these  two  results  we  deduce  that  f(x)  —f(a)  =  0, 
and  that  /(a)  —  /(a?)  =  0,  throughout  the  interval  (a,  &);  therefore  /(#)  is 
everywhere  equal  to  f(a),  and  thus  the  theorem  is  established. 

268.    At  a  point  x,  at  which  the  continuous  function  f(x)  is  a  maximum, 
since,  for  a  sufficiently  small  neighbourhood  of  such  a  point  x,  the  differences 

f(x  +  h}-f(x},    f(x-h}-f(x} 

are  both  negative  or  zero,  for  all  points  x  ±  h  in  the  neighbourhood,  it  is  clear 
that  each  of  the  derivatives  D+f(x),  D+f(x}  is  either  negative  or  zero,  and 
that  each  of  the  derivatives  D~f(x),  D_f(x)  is  either  positive  or  zero.  In 
case  the  function  possess  definite  derivatives  on  the  right  and  on  the  left  at 

*  Scheeffer,  Acta  Math.,  vol.  v,  p.  283. 

t  It  is  erroneously  stated  by  Dini,  that  D+/(«)=0.     See  Grundlayen,  p.  275. 
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the  point  oc,  the  first  of  these  is  zero  or  negative,  or  possibly  —  oo ,  whilst  the 
second  is  zero  or  positive,  or  possibly  +  oo  . 

If,  at  the  point  x,  a  definite  differential  coefficient  exist,  it  must  consequently 
be  zero.  In  the  case  of  a  minimum  the  corresponding  statements  hold,  where 
the  positive  sign  takes  the  place  of  the  negative  one,  and  the  reverse.  The 
following  theorem  has  now  been  established : 

If  a  continuous  function  possess  a  differential  coefficient  at  a  point  x  at  which 
the  function  is  a  maximum  or  minimum,  then  the  differential  coefficient  at  x 
must  be  zero, 

If  a  function  f(x)  have,  at  ao1}  a  differential  coefficient  not  equal  to  zero, 
there  is  a  neighbourhood  of  xl  in  which  f(x)  is  monotone.  This  may  be  stated 
in  the  form  that,  corresponding  to  a  suitable  value  of  h,  a  number  k  can  be 
determined,  so  that,  to  each  value  of  y  in  the  interval  (yl  —  k,  y-^  +  k),  where 
2/j  ^f^j),  there  corresponds  one,  and  only  one,  value  of  x  in  the  interval 
(#!  —  h,  xl  +  fi ). 

FUNCTIONS  WITH  LINES  OF  INVARIABILITY. 

269.  A  continuous  function  may  be  such  that,  in  the  interval  (a,  b),  there 
exists  an  everywhere  dense  set  of  non-overlapping  intervals,  each  one  of  which 
is  a  line  of  invariability  of  the  function.  Within  each  interval  of  the  set,  the 
function  has  its  differential  coefficient  equal  to  zero  ;  it  therefore  follows  from 
the  theorem  in  §  267,  that  the  closed  set  of  points,  of  which  the  given  set  of 
intervals  is  the  complementary  set,  cannot  be  an  enumerable  set,  otherwise 
the  function  would  be  constant  in  the  whole  interval  (a,  b).  It  is  further  clear 
that  no  two  of  the  intervals  can  abut  on  one  another  ;  for  the  condition  of  con- 
tinuity of  the  function  at  their  common  end-point  would  ensure  that  the  values 
of  the  function  in  the  two  intervals  were  the  same,  and  thus  the  two  intervals 
would  really  belong  to  the  same  line  of  invariability.  It  follows  that  the  end- 
points  and  external  points  of  an  everywhere  dense  set  of  lines  of  invariability 
of  a  continuous  function  must  form  a  perfect  non-dense  set  of  points. 

That  a  continuous  function  with  an  everywhere  dense  set  of  lines  of  in- 
variability can  actually  exist  can  be  easily  shewn  as  follows : — Make  the  points 
of  a  non-dense  perfect  set  correspond  in  order  to  the  points  of  a  continuous 
interval  (a,  b),  then,  as  has  been  shewn  in  §  163,  the  correspondence  may  be 
such  that  the  whole  of  a  complementary  interval  of  the  perfect  set  corresponds 
to  one  point  of  the  continuous  interval.  If  a  continuous  function  be  defined 
for  the  continuous  interval,  we  may  define  a  new  function  which  has  at  each 
point  of  the  perfect  set  the  same  value  as  the  original  function  has  at  the 
corresponding  point  of  the  continuous  interval ;  and  since  all  the  points  of  a 
complementary  interval  of  the  perfect  set  correspond  to  the  same  point  of  the 
continuous  interval,  the  new  function  is  such  that  it  has  an  everywhere  dense 
set  of  lines  of  invariability. 
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EXAMPLES. 

1.   Take*  the  non-dense  perfect  set  defined  in  Ex.  1,  §  83,  by 

a?"a+jp"K""hF"K"' 

where  every  cn  is  either  0  or  2.    A  complementary  interval  has  as  its  end-points 

r  «g  T  •  •  •  +  o»-l  """  o^  >         "o  "I"  02  "r  •  •  •  "I  o  ,i  - 1    '    oil  > 
o  o  o  o       o  o  o 

which  may  be  denoted  by  («„,  &„).   Let  the  function /(#)  be  defined  as  follows : — Fora  point 
x  of  the  interval  (0,  1)  belonging  to  the  perfect  set,  let 


ft*\         1    f°l  A.°2  J-          -LC»-L         \    • 

•f(*)==2\2      2*  2s        V5 


when  x  is  in  the  interval  (a,,,  &„),  let  f  (^}=f  (av)=f  (bv}.  The  function  /(#)  so  defined  is 
continuous,  and  varies  from  0  to  1,  and  is  constant  in  each  of  the  intervals  (av,  &„)  com- 
plementary to  the  non-dense  perfect  set. 

2.  f  Let  the  numbers  in  the  interval  (0,  1)  be  expressed  in  a  scale  %=2m-l,  of  odd 
degree;  thus  x=  —  +  -i \  +  ...,  where  Q:£.ar<n,  and  the  number  of  digits  ar  is  finite  or 
infinite.  For  any  number  x  represented  in  this  manner,  for  which  all  the  a's  are  even 

integers,  let  f(x)  equal  -  (  —  H — 1-f ...);    In  case  any  of  the  &'s  are  odd,  let  ak  be  the 

'2  \wi      tn          J 

first  one  which  is  odd,  and  let  f(.v)  then  equal  ~  (-1  +  "|+...  +^]  +  \  (^f}  .    This 

2  \m      m?  mk~ij      2  \  mk  ) 

function  f  (x)  is  continuous,  and  varies  from  0  to  1 ;  for  an  infinite  set  of  points  it  has  no 
differential  coefficient,  and  for  all  other  values  of  x,  f  (.^)  =  0. 


THE  SUCCESSIVE  DIFFERENTIAL  COEFFICIENTS  OF  A  CONTINUOUS  FUNCTION. 

270.  If  a  continuous  function  f(x),  defined  for  the  closed  interval  (a,  b), 
have  at  every  point  a  differential  coefficient  f  (x),  which  is  itself  continuous 
throughout  the  interval,  the  function  f  (x)  ,may  itself  have  a  differential 
coefficient  f"  (so),  which  is  called  the  second  differential  coefficient,  or  the 
second  derivative,  of  f(x). 

The  second  differential  coefficient/'  (a^), of /(&•),  at  a  point  xl,  at  which/'  (x^) 

f'(xl  -f  h)  —  f  (x) 

exists  as  a  definite  number,  may  be  defined  as  lim  J      1 ( — - — ^— ^- ,  when  this 

h~o  h 

limit  exists,  as  a  finite  number,  or  is  infinite  with  a  fixed  sign.  It  is  not 
necessary  for  the  existence  of  the  second  differential  coefficient  f"  (x^),  that 
/'  (xl  +  h)  should  have  a  definite  value,  for  all  points  xl  +  h  in  a  neighbour- 
hood of  xl . 

*  Cantor,  Acta  Math.,  vol.  iv,  p.  386.    See  also  Scheeffer,  Acta  Math.,  vol.  v,  p.  289. 
t  Grave,  Comptes  Rendus,  vol.  cxxvn,  p.  1005. 
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It  may  happen  that  the  four  ratios 

\  {D+ffa  +  h)  -f  fa)},    \  {D+ffa  +  h)  -f  fa)}, 

\  {D-ffa  +  h)  -/fa)},     \  [D_ffa  +  h)  -ffa)}, 

all  have  one  and  the  same  limit,  finite,  or  infinite,  when  h  ~  0;  and  that  this 
should  be  the  case  is  sufficient  for  the  existence  of/"  fa),  as  here  defined. 

The  second  differential  coefficient/"^)  is  thus  defined  as  the  repeated  limit 
lim  lim  /(^  +  h  +  k)  -ffa  +  h)  -ffa  +  k)  +ffa)  ^ 

A~0  k~0  hk 

when  this  repeated  limit  exists;  it  being  assumed  that  lim  ^  ^  +  f  ~^  ^ 

fc~0  & 

has  a  definite  value. 

A  more  restricted  definition  of  /"  fa)  is  frequently  employed,  in  which  it 
is  assumed  that  f'(x)  exists  as  a  definite  number,  nob  only  at  xl}  but  in  a 
neighbourhood  of  x^  In  accordance  with  this  more  restricted  definition,  it  is 
not  sufficient  for  the  existence  of  f"  fa)  that  the  repeated  limit 

lim  lim  /(*.  +  h  +  k)  -ffa  +  h)  -ffa  +  k)  +ffa) 

A~0   k~Q  htC 

should  have  a  definite,  finite  value,  or  be  infinite  with  fixed  sign. 

For  the  sake  of  generality  the  less  restricted  definition  of  /"  fa),  given 
above,  will  be  here  employed. 

It  is  easily  seen  how  the  definition  may  be  extended  to  the  case  of  a 
differential  coefficient  f(n)  (x)  of  any  order  n. 

271.  In  case  the  function  f(x)  possesses  a  finite  differential  coefficient, 
of  order  n,  at  the  point  x  =  x^,  and  in  which  there  is  a  neighbourhood  of  that 
point  in  which  f'(x),f"  (x),  ...,f{n~1}  (x)  all  exist,  let 

ffa  +  h)  -ffa)  -  hf  fa)  -  h~f"  fa)  -  ...  -  (-^y,/<w-1)  fa) 

be  denoted  by  (f>(h).    Since  <j>(h),  </>'  (h),  ...,</>  {n~1}  (h)  all  exist,  and  are  zero, 
when  h  =  0,  it  follows  from  the  theorem  of  §  263  that 


lim  =  1  0  <»>  (0)  =  -,/»>  fa); 

h^o    hn        n\^  n\J 

hn 
and  then  that  </>  (h)  =  —  [f{n>  fa)  +  p  (h)},  where  p  (h)  converges  to  0,  as  h  ~  0. 

We  have  thus  the  following  theorem: 

Iff(x)  possesses  an  nth  differential  coefficient  at  x  =  xl  ,  and  if  the  first  n  —  1 
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. 
differential  coefficients  exist  everywhere  in  some  neighbourhood  of  the  point 

#  =  #1,  then 
f(x,  +  h)  =/<«,)  +  hf  (*,)  +  £f  (a,)  +  ...  +  --, 


where  Km  p  (h)  =  0. 
;»~o 

We  find,  by  employing  this  theorem,  that  when  f"  (x^)  exists,  and  /'  (x) 
exists  in  a  neighbourhood  of  xl} 

f(x,  +  h  +  k)  -f(x,  +  h)  -f(x,  +  k)  +/(aQ  =^,,  ^  +  ^  ^  ^ 

llK 

where  the  double  limit  lira  cr  (h,  k}  exists  and  has  the  value  zero. 

h~0 
k~0 

It  follows  that  the  double  limit  of  the  expression  * 

f(x,  +  h  +  k)  -f(x,  +  h)  -f(x,  + 


hk 

exists  and  is  equal  to/"^).    The  converse  does  not  in  general  hold. 

In  particular  the  limits 

f(x1  +  2h)-2f(xl+h)+f(x1)  f(Xl  +  h}+f(xl-h} 


exist  and  have  the  value  /"  (ao^. 

The  converse  of  this  does  not  hold;  for  either  of  these  limits  may  exist,  and 
yet  f"  (X)  need  not  exist,  nor  even  /'  (x^).    An  illustration  of  this  is  the  case 

of  the  function  defined  by  /(O)  =0,  /  (x)  =  x  sin2-,  for  #2>0;  at  the  point 
x=Q,f'(Q)  does  not  exist,  and  yet  lim 


272.    The  following  theorem,  due  to  Schwarz*,  is  of  fundamental  importance 
in  the  theory  of  Fourier's  series. 

If,  in  an  interval  (a,  0),  in  which  f  (x)  is  continuous, 


converge,  for  each  value  of  x  in  (a,  /3),  to  the  limit  zero,  for  h  ~  0,  then  the  function 
f(x)  is  a  linear  function  in  the  whole  interval,  and  consequently  f  (x},  f"  (x) 
everywhere  exist,  and  the  latter  is  everywhere  zero. 
Let  us  consider  the  function 


$  (X)  =  ±   \f(x)  -/(«)- 
V. 
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where  A;  is  a  constant.    The  function  <£  (x\  whichever  sign  be  takeny  is  con- 
tinuous in  (a,  ft),  and  vanishes  at  a  and  ft.    We  find  at  once 


and  therefore,  for  each  value  of  x  in  (a,  ft),  a  positive  number  e  can  be  found, 
such  that  <f>  (x  +  h)  —  2<£  (x)  +  ^>  (x  —  h)  is  positive  and  greater  than  zero,  for  all 
values  of  h  which  are  numerically  less  than  e. 

If  </>  (x)  can  be  anywhere  positive  in  (a,  ft),  there  must  be  a  point  xl  at 
which  it  has  the  greatest  positive  value,  and  this  point  is  neither  a  nor  ft, 
since  <j>  (a),  <£  (ft)  both  vanish.  If  77  be  sufficiently  small, 

<f>  (x,  +  77)  -  <f>  (as,)  ^  0,    £  fa  -  ri)  -  (/>  (x,)  ^  0, 
hence  </>  fa  +  77)  -  20  fa)  +  <j>fa-  11) 

is,  for  all  sufficiently  small  values  of  77,  either  negative  or  zero,  which  is  con- 
trary to  what  was  shewn  above.  It  follows  that  0  (x)  is  everywhere  negative 
in  (a,  ft),  arid  cannot  be  zero  except  at  a  and  ft. 

This  holds  whichever  sign  be  taken  in  defining  </>  (x).  Now  k*  (x  —  a)(x  —  ft) 
is  always  negative,  except  at  a  and  ft,  and  may  be  taken  to  have  its  numerically 
greatest  value  as  small  as  we  please,  since  k  is  at  our  choice.  It  follows  that 

f(x)  -f(«)  -|~"[/(/3)  -/(«)] 

r-'          ^* 

can  nowhere  in  the  interval  be  different  from  zero;  for,  if  at  any  point  it  had 
a  value  p,  by  choosing  k  such  that  k2  (x  —  a)  (x  —  ft)  is  numerically  everywhere 
<  p  ,  the  function  0  (x)  could  be  made  positive  at  the  point,  by  proper  choice 
of  the  ambiguous  sign.  It  has  thus  been  shewn  that/(#;)  is  linear  in  (a,  ft). 

273.  Schwarz's  theorem  can  be  extended  to  the  case  in  which  there  is  an 
enumerable  set  of  points  in  the  interval  (a,  ft},  at  which  it  is  not  known  that 
the  limit  in  question  exists,  or  is  zero,  provided  a  certain  condition  be  satisfied 
at  each  point  of  the  enumerable  set.  The  following  theorem  will  be  estab- 
lished : 

If,  in  an  interval  (a,  ft),  in  ivhichf(x)  is  continuous,  the  expression 

-  A) 


converge,  for  each  value  of  x  in  (a,  ft),  to  the  limit  zero,  for  h~0,  except  that, 
for  an  enumerable  set  of  points  G,  this  is  not  known  to  be  the  case,  then,  provided 
that  at  each  point  x,  of  G,  the  expression 


h 

converge  to  the  limit  zero,  for  h~Q,  the  function  f(x)  is  a  linear  function  in  the 
whole  interval  (a,  ft). 
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It  should  be  observed  that  the  condition 


is  certainly  satisfied  at  any  point  x  at  which  the  differential  coefficient  f  (x) 
exists,  and  is  finite. 

To  prove  the  theorem,  let  it  be  assumed  that 


has  a  positive  value  |),  at  some  point  xl  interior  to  (a,  /3);  and  let 
*  (*,  k)  =f(x)  -/(a)  1  1^  {/OS)  -  /(«)}  +  *  (*  -  «)2, 
where  k  is  a  positive  number.    We  have 


</>  (a,  k)  =  0,  (/>  (/3,  &)  =  k(@-  a)2,  and  <£  (^,  A?)  =  p  -F  k  (x,  -  a)2; 
and  hence,  provided 


the  number  <^(xl,k)  is  greater  than  <f>(/3,k),  and  than  <f)  (a,  k).  We  shall 
suppose  &  to  be  so  chosen  that  this  condition  is  satisfied;  it  then  follows  that 
<f>(x,k)  has  a  maximum  between  a  and  /3.  The  absolute  maximum  value  of 
(f>  (x,  k)  may  be  attained  once,  or  a  finite  number  of  times,  or  an  infinite 
number  of  times,  in  the  interval  (a,  /3). 

The  points  x  at  which  this  maximum  is  attained  have  an  upper  extreme 
x  (<  /3),  which  must  itself  be  a  point  at  which  the  maximum  of  <f>  (x,  k)  is 
attained,  as  is  seen,  in  the  case  in  which  x  is  an  upper  limit,  from  the  condi- 
tion of  continuity  of  the  function.  We  have  therefore 

0  (x  +  h,  k)  -  </>  (x,  k)  £  0,   and    </>(«-  h,  k)  -  <f>  (x,  k)  £  0, 
if  h  be  sufficiently  small;  from  which  we  conclude  that,  in  case 


,  ,    )  +  <f>(x-h,  k} 

lim  exist, 

fc.~o  h2 

its  value  is  SO.  It  follows  that  x  must  belong  to  G;  because  the  value  of  this 
limit  is  2k,  and  therefore  >  0,  for  any  point  which  does  not  belong  to  G.  Since 
x  is  a  point  of  G,  we  have 

('<£  (x  +  h,  k)  —  <f)  (x,  k)      (f>  (x  —  h,  k)  —  <£  (x,  k)\  _ 
lim  •<  —  -,  -  H  r  —  ?•  —  U  , 

h~o  I  A  h  } 

and  since  the  two  fractions  have  the  same  sign,  it  follows  that 

<j>(x  +  li,k)-4>(x,k')  ,.     4>(x-h,k)-4>(x,k) 

lira  -  —j—  =0,    and    lim  —  —  j—  =  0. 

7t~0  *  7(~0  'l 
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From  this  result  we  deduce  that 

r     f(x-h)-f(x) 
_        J_±  -  /    j  \  /  _ 


--  _  - 

h~o  h  h~o  -A 

To  each  value  of  k  in  the  interval  (0,  K  ),  there  corresponds  one  value  of  a;, 
and  it  is  impossible  that  the  same  value  of  x  can  correspond  to  two  different 
values  k1}  k2  of  k.  For  if  this  were  the  case,  we  should  have 

&!  (x  —  a)  =  &2  (x  —  a), 

and  therefore  k1  =  k2,  since  «>  o.  Now  it  is  impossible  that  the  set  of  points 
k,  interior  to  the  interval  (0,  K),  can  be  such  that  to  each  such  point  there 
corresponds  a  distinct  point  x  belonging  to  the  enumerable  set  G.  We  con- 
clude that  it  is  impossible  that  • 

f(x)  -/(a)  - 


can  have  a  positive  value  p  at  any  point  oc^  of  the  interval  (a,  y8);  and  it  can 
be  shewn  in  a  similar  manner  that  there  can  be  no  negative  value  of  the 
same  function  in  the  interval.  It  follows  that  the  function  must  everywhere 
be  zero,  and  therefore  that  f(x}  is  linear  in  the  interval  (a,  /S). 

274.  Let  us  suppose  that  the  continuous  function  f(x)  is  linear  in  each 
interval  contiguous  to  an  enumerable  closed  set  G,  contained  in  (a,  /3),  and  that 
it  possesses  everywhere  in  the  interval  (a,  /9)  a  finite  differential  coefficient. 

In  this  case  both  the  conditions  of  the  theorem  of  §  273  are  satisfied,  and 
therefore  the  function  is  linear  in  the  whole  interval  (or,  /3)  ;  we  have  then  the 
theorem: 

lff(x)  be  a  continuous  function  possessing  everywhere  in  the  interval  (a,  /3) 
a  finite  differential  coefficient,  and  the  function  be  linear  in  each  one  of  an 
everywhere  dense  set  of  intervals  complementary  to  an  enumerable  closed  set  of 
points  G,  thenf(x)  is  a  linear  function  in  (a,  {3).  • 

If  the  closed  set  of  points  G  were  unenumerable,  the  preceding  reasoning 
would  no  longer  be  applicable,  except  that,  at  an  isolated  point  of  G,  it  would 
establish  that  the  linear  functions  in  the  two  intervals  which-  abut  on  one 
another  at  the  isolated  point  must  be  identical.  Confining  therefore  our  atten- 
tion to  the  case  in  which  G  is  a  perfect  set,  we  see  that  a  continuous  function 
possessing  everywhere  a  finite  differential  coefficient  may  exist,  which  is  linear 
in  each  sub-interval  complementary  to  a  non-dense  perfect  set  of  points  con- 
tained in  the  interval  for  which  the  function  is  defined,  and  yet  the  function 
need  not  be  linear  in  the  whole  interval. 

The  existence  of  such  functions  will  be  effectively  established  in  Chapter  vil, 
where  it  will  be  shewn  that  they  may  be  obtained  by  the  integration  of  con- 
tinuous functions  which  have  an  everywhere  dense  set  of  lines  of  invariability. 
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OSCILLATING   CONTINUOUS   FUNCTIONS. 

275.  Let  us  suppose  that  the  continuous  function  f(x)  has  no  lines  of 
invariability  in  the  interval  (a,  ft),  and  that  everywhere  in  this  interval  it  has 
a  finite  differential  coefficient.  If,  within  (a,  ft),  there  be  a  maximum  or  mini- 
mum of  f(x),  then  at  such  a  point  f'(x),  which  exists  and  is  finite,  must  be 
zero.  If  the  maxima  and  minima  in  (a,  ft)  be  everywhere  dense,  then  f'(x) 
vanishes  at  every  point  of  the  everywhere  dense  set;  and  if  f  (x)  were  con- 
tinuous throughout  (a,  (3)  it  would  follow  that  it  was  everywhere  zero,  which 
would  be  contrary  to  the  hypothesis  that  (a,  ft)  is  not  a  line  of  invariability. 

It  follows  from  this  that,  if  in  an  interval  (a,  ft),  which  contains  no  lines 
of  invariability  of  the  continuous  function  f(x),  the  differential  coefficient  f  (x) 
everywhere  exists,  and  is  continuous,  there  must  be  in  the  interval  an  everywhere 
dense  set  of  sub-intervals  in  each  of  which  the  function  is  monotone. 

We  have  further  the  following  theorem  : 

Iff(x)  be  continuous  in  (a,  ft),  and  have  no  lines  of  invariability,  but  have 
an  everywhere  dense  set  of  maxima  and  minima,  there  must  be  in  the  interval 
an  everywhere  dense  set  of  points  at  each  of  which  f  (x)  either  does  not  exist,  or 
does  exist  and  is  discontinuous. 

A  continuous  function  f(x)  which,  in  a  given  interval  (a,  ft),  has  no  lines 
of  invariability,  but  has  an  everywhere  dense  set  of  maxima  and  minima,  is 
said  to  be  a  continuous  function  which  is  everywhere  oscillating  in  the  interval 
(a,  ft).  Such  a  function  cannot  have  a  differential  coefficient  which  is  con- 
tinuous throughout  the  interval. 

The  continuous  functions  which  are  everywhere  oscillating  in  an  interval 
may  be  divided  into  two  classes. 

(1)  The  function  may  be  such  that,  if  the  constants  I,  m  be  properly 
chosen,  the  function  f(x)  +  Ix  +  m  is  monotone  in  the  interval.    In  this  case 
f(x)  is  expressible  as  the  difference  of  two  monotone  functions,  and  thus 
belongs  to  the  class  of  functions  with  bounded  total  fluctuation.    These  func- 
tions may  be  said  to  be  of  the  first  species,  or  to  be  functions  with  removable 
oscillations. 

(2)  Such  functions  as  do  not  belong  to  (1)  may  be  said  to  be  of  the  second 
species,  or  to  be  functions  with  irremovable  oscillations. 

In  order  to  bring  to  light  the  essential  distinction  between  the  two  classes 
of  functions,  as  exhibited  by  the  properties  of  their  derivatives,  we  first  of  all 
remark  that,  if  D+f(x)  have  a  positive  lower  boundary  c,  for  all  points  x  in 
the  interval  (a,  ft),  then  at  each  point  f(x  +  h)  —f(x)  is  essentially  positive 
for  all  positive  values  of  h  which  are  less  than  some  number  8,  dependent  on 
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x]  hence  the  function  is  monotone  in  the  interval.  The  function  would  also 
be  monotone  in  case  the  specified  condition  were  that  D+f(x)  has  a  negative 
upper  boundary  for  all  values  of  x  in  (a,  /3). 

Now  suppose  that  D+f(x)  has  a  definite  negative  lower  boundary  in  (a,  /3)  ; 
let  this  be  —  c,  and  consider  the  function  <f>  (x)  =f(x)  +  lx  +  m,  where  l>c; 
we  have  then  D+<f>(x)  =  l  +  D+f(x)  =  l  —  c;  hence  the  function  <f>(x)  is  mono- 
tone in  (a,  /3).  Thus  /  (x)  is  expressible  as  the  difference  of  the  two  monotone 
functions  (f>  (x}  and  Ix  +  m.  Similarly,  if  we  had  taken  the  condition  that 
D+f(x)  has  a  definite  positive  upper  limit  c,  the  function  f(x)  +  lx  +  m,  where 
I  <  —  c,  could  be  shewn  to  be  monotone. 

It  is  clear  that,  instead  of  the  linear  function  Ix  +  m,  we  might  have  used 
any  continuous  differentiable  function  whose  differential  coefficient  was  >  c, 
or  <  —  c,  throughout  the  interval,  in  the  two  cases. 

The  argument  would  have  been  unaltered  if  it  had  been  assumed  that 
there  were  a  finite  or  infinite  set  of  lines  of  invariability  in  (a,  /3). 

It  has  thus  been  shewn  that  : 

If  the  continuous  function  f(x)  be  such  that  either  D+f(x)  has  a  negative 
lower  boundary  for  all  values  of  x  in  (a,  /3),  or  that  D+f(x)  has  a  positive  upper 
boundary,  then  all  maxima  and  minima  of  the  oscillating  function  f(x)  are 
removed  by  adding  to  f(x)  a  properly  chosen  linear  function,  and  thus  the 
function  is  of  the  first  species,  and  is  of  bounded  total  fluctuation. 

In  particular,  the  conditions  of  the  theorem  are  satisfied  if  the  derivative, 
on  one  side,  without  necessarily  having  a  definite  value  at  any  point,  be  such 
that  for  the  whole  interval  it  is  numerically  less  than  some  fixed  positive 
number. 

A  function,  such  that  for  a  given  interval, 

D+f(x)  \,     \  D+f(x)  ,     \  D~f(x)  |,      D.f(x) 

are  all  less  than  some  fixed  number,  is  said  to  be  a  function  with  bounded 
derivatives.  Such  a  function  has  a  bounded  variation  in  the  interval,  and  if  it 
be  everywhere  oscillating,  it  is  of  the  first  species. 

If  all  the  four  derivatives  are,  at  every  point,  less  than  the  positive  number 
If,  in  which  case  the  function  is  necessarily  continuous,  a  neighbourhood  of 
any  point  x  may  be  so  determined  that,  if  |  be  any  point  in  that  neighbour- 
hood, [/(£)—  f(x}  i  =  M.  %  —  x\.  Hence  the  fluctuation  in  the  neighbourhood 


By  the  Heine-Borel  theorem  all  the  points  of  (a,  b)  are  interior  to  a  finite 
number  of  the  intervals  B.  It  follows  that  the  interval  (a,  b)  may  be  divided 
into  a  finite  number  of  parts,  each  of  length  less  than  a  prescribed  number  d, 
such  that  the  sum  of  the  fluctuations  in  those  parts  is  ^  2M  (b  —  a).  Since  d 
is  arbitrarily  small,  it  follows  that  the  total  fluctuation  of  f(x)  in  (a,  b}  cannot 
exceed  2M(b  —  a). 
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It  is  not  sufficient,  in  order  that  a  function  may  be  of  limited  total  fluctua- 
tion, that,  at  each  point,  all  four  derivatives  are  finite;  but  it  has  been  shewn 
to  be  sufficient  that  their  values  in  the  whole  interval  should  be  bounded. 

276.  Let  us  suppose  that,  for  a  set  of  points  G,  everywhere  dense  in  (a,  6), 
progressive  and  regressive  derivatives  at  a  point  of  G  exist,  and  are  infinite, 
but  of  opposite  signs.  At  any  point  #0>  of  G,  a  neighbourhood  can  be  found, 
containing  #0,  such  that,  for  any  point  x  in  it,  f(x)—f(x^)  is  of  fixed  sign  for 
the  whole  neighbourhood,  and  is  never  zero,  except  when  x  =  XQ\  it  follows  that 
#„  is  a  proper  maximum,  or  minimum,  of  the  function. 

It  will  be  shewn  that,  in  any  interval  (a,  /3)  contained  in  (a,  6),  there  is 
an  infinite  number  of  points  at  which  the  function  has  the  same  value.  Let 
|  be  a  maximum  point  of  f(x),  within  (a,  /9),  and  let  (£  -  77,  f  +  e)  be  the 
greatest  interval  enclosing  f  ,  within  which  /(#)  -/(£)  is  negative;  suppose  that 
the  absolute  minimum  of  the  function  for  this  interval  is  in  (£  —  ??>  £);  taking 
a  maximum  point  £  in  the  interval  (£,  £  +  e),  then  in  (£  —  77,  £)  there  is  a 
point  £/  at  which  /((•/)  =/(|i),  since  /(£)  lies  between  the  greatest  and  least 
values  of  the  continuous  function  in  (£  —  77,  £)• 

Now  there  is  a  maximum  interval  (£1  —  771,  ^i+O  for  the  point  £x,  and 
this  lies  within  (£,  £  +  e)  ;  and  in  this  interval  we  may  as  before  find  a  maxi- 
mum point  £2,  such  that  a  point  £2'  also  exists  within  the  interval,  for  which 
/(£2)  =/(£,').  There  is  also  a  point  fa"  in  (£  -  77,  £),  such  that 


We  may  proceed  in  this  manner,  until  we  find  n  points 


such  that  /(^_o  =/(r»-i)  =  •  •  •  =/(Ci1}>- 

Now  let  £„  be  a  limiting  point  of  £1}  ^2,  ^3,  .  .  .  £n,  .  .  .  ;  and  let  ^w'  be  a  limit- 
ing point  of  I/,  |2',  •••»  and  £w"  be  a  limiting  point  of  £2",  |3",  ...;  then 


Thus  the  points  fw,  £/,  U',  ...  form  an  infinite  set,  in  (a,  /3),  at  which  the 
functional  values  are  the  same. 

The  points  £„,,  fw',  f^,",  ...  have  a  limiting  point  ^0,  at  which  the  functional 
value  is  the  same  as  for  the  set  itself;  therefore 


hence,  at  £0)  either  the  derivative  is  determinate,  and  equal  to  zero,  or  else  it  is 
indeterminate,  with  zero  lying  between  its  upper  and  lower  limits.  Thus  it 
has  been  shewn  that*: 

*  Konig,  Monatshefte  f.  Math.  u.  Physik,  vol.  i.    The  above  proof  is  that  given  by  Schoenflies, 
Bericht,  vol.  i,  p.  1(50. 

H.  23 
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If  a  continuous  function  have  an  everywhere  dense  set  of  points  at  which 
there  are  progressive  and  regressive  derivatives  that  are  infinite  and  of  oppo- 
site signs,  there  is  an  everywhere  dense  set  of  points  at  each  of  which  the 
derivative  is  either  indeterminate  or  else  zero.  Thus  a  continuous  function  can- 
not at  all  points  have  infinite  progressive  and  regressive  derivatives  of  opposite 
signs. 

If  we  apply  the  above  theorem  to  the  function  f(x)  —  ex,  where  c  is  a  pre- 
scribed constant,  then,  since  f(oc)  —  ex  has  an  infinite  derivative  at  the  same 
points  as  those  for  which  f(x)  has  an  infinite  derivative,  we  obtain  the  follow- 
ing theorem : 

If  the  continuous  function,  f(x),  have  at  an  everywhere  dense  set  of  points 
infinite  progressive  and  regressive  derivatives  of  opposite  signs,  there  is  an  every- 
where dense  set  of  points  at  each  of  which  the  derivative  either  has  the  pre- 
scribed value  c,  or  is  indeterminate,  and  such  that  c  lies  between  its  upper  and 
lower  limits. 

In  geometrical  language,  a  point  x,  at  which  there  are  infinite  progressive 
and  regressive  derivatives,  of  opposite  signs,  is  a  point  at  which  the  curve 
y  =f(x)  has  a  cusp,  with  its  tangent  perpendicular  to  the  a>axis.  At  a  maxi- 
mum the  cusp  points  in  the  positive  direction  of  the  y-axis,  the  progressive 
derivative  being  —  oo ,  and  the  regressive  derivative  4-  oc .  At  a  minimum 
the  cusp  points  in  the  negative  direction  of  the  y-axis,  the  progressive  deriva- 
tive being  +  oo  ,  and  the  regressive  derivative  being  —  oc  . 

PROPERTIES   OF   INCREMENTARY    RATIOS. 

277.    If  Xi  be  a  point  of  the  interval  (a,  b),  in  which  f(x)  is  defined,  the 

f(x\  _  f(x  \ 
function  --  — — ,  for  points  x  such  that  x1<x  ^b,  may  be  called  the  incre- 

CO  ~~~  $/ 1 

mentary  ratio  at  x\  on  the  right ;  it  may  be  denoted  by  /  (x,  x^ ;  and  in 
case  y  (a;)  be  a  continuous  function,  this  incrementary  ratio  is  also  continuous 
at  every  point  of  its  domain.  This  incrementary  function  has  an  upper  and 
a  lower  boundary  for  its  whole  domain  (xl<x^b)\  these  upper  and  lower 
boundaries  may  be  denoted  by  U(x^),  L(ac1),  and  either  of  them  may  be  finite 
or  infinite;  however  U  (xj)  can  only  be  infinite  with  the  positive  sign,  and 
L(x^  only  with  the  negative  sign.  U(xl),  L(xl]  being  regarded  as  functions 
of  acl}  defined  for  every  point  of  (a,  b),  except  the  point  b,  the  function  U(x1)  has 
a  finite  or  infinite  upper  boundary  for  its  whole  domain,  which  we  may  denote 
by  U',  and  the  function  L  (x^  has  a  finite  or  infinite  lower  boundary  for  its 
whole  domain,  which  we  may  denote  by  L.  There  exist  therefore  two  numbers 
U,  L,  which  may  have  the  improper  values  +  oo  ,  —  oc  respectively,  such  that 
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for  every  pair  of  values  of  xly  xz,  where  xz>xlt  always  lies  between  them,  or 
is  equal  to  one  of  them. 

The  incrementary  ratio  on  the  left  of  a  point  can  be  defined  in  a  similar 
manner;  and  we  thus  define  two  functions  U'fa),  L'faJ,  at  xlt  as  the  upper 
and  lower  boundaries  of  these  incrementary  ratios. 

It  is  easily  seen  that  U',  the  upper  boundary  of  U'(x)  in  the  interval  (a,  b), 
is  identical  with  U,  and  that  L',  the  lower  boundary  of  L'  (x),  is  identical  with 
L.  Thus  U,  L  are  the  upper  and  lower  boundaries  of  I(x1}  xz)  for  every 
possible  pair  of  points  (xl}  #2)  m  the  interval  (a,  b). 

278.    When  the  function  f  (x)  is  continuous  (bounded  or  unbounded)  in  (a,  b), 
U(x)  is  a  lower  semi-continuous  function,  and  L  (x)  is  an  upper  semi-continuous 
function.    Accordingly,  U(x)  and  L(x)  are  point-wise  discontinuous,  when  they 
are  not  continuous. 

If  U(xl)  is  finite,  xz  can  be  so  determined  that  U(x^  —  I  (xl,  x2)  <  |e;  and 
also  a  neighbourhood  (xl  —  17,  xl  +  tj),  of  x1}  not  containing  xz,  can  be  so  deter- 
mined that,  if  Xi  be  any  point  in  this  neighbourhood, 


for  I(x,x2)  is  continuous  with  respect  to  x,  at  x±.  We  have,  in  this  neigh- 
bourhood, 

/  (#/,  #2)  >f(®i,  #2)  —  i  e  >  U(x^)  —  e. 

It  follows  that  U(x-[)  >  U(xi)  —  e;  or  U  (x)  is  lower  semi-continuous  at  #,.  If 
U  (K\)  =  +  oo  ,  xz  can  be  so  determined  that  I  ((cl}  a?2)  >  N  +  J  e,  where  N  is  an 
arbitrarily  chosen  positive  number.  If  f(x^)  is  finite,  the  neighbourhood 
(#!  —  77,  #!  +  77),  of  xl}  may  be  determined  as  before;  and  then  I(x1',a;.2)>  N, 
and  thus  U(x^)  >  N',  which  is  the  condition  that  U  (x)  should  be  lower  semi- 
continuous  at  x1  .  If  f  (X)  =  +  oo  ,  the  neighbourhood  can  be  so  determined  that 
I  (x\>  #2)  >  N,  and  thus  U(x^)>N\  hence  U  (x)  is  lower  semi-continuous  at 
#j.  That  L(x)  is  upper  semi-continuous  can  be  proved  in  a  similar  manner. 

The  incrementary  ratio  I(x,x'\  where  f(x)  is  continuous  (bounded  or  un- 
bounded) in  the  closed  interval  (a,  b),  assumes  every  value  between  its  upper  and 
lower  boundaries  U  and  L,  for  a  pair  of  points  x,  x  interior  to  the  interval 
(a,  b). 

Let  k  be  a  number  between  U  and  L,  thus  U>k  >  L.  The  function  U(x) 
being  a  lower  semi-continuous  function,  the  set  of  points  at  which  U(x)£kis 
a  closed  set  6^;  and  similarly,  since  L  (x)  is  an  upper  semi-continuous  func- 
tion, the  set  at  which  L  (x)  =  k  is  a  closed  set»6r2. 

If,  at  any  point  a,  U(a)  =  L(a)  =  k,  the  function  has  the  value 

/(o)  +  A(«-o), 

for  all  points  x  in  the  interval  (a,  b).  Assume  first  that  no  such  points  a  exist; 
then  since  L(x}<  U  (x),  the  two  sets  Gl}  G2  can  have  no  points  in  common. 

23—2 
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Since  the  continuum  (a,  b)  cannot  be  the  sum  of  the  two  closed  sets  G1}  G2,  it 
follows  that  there  exists  at  least  one  point  which  belongs  neither  to  Gl  nor  to 
GS,  and  thus  at  which  U(d)>k>L (a).  If  a  be  a  point  at  which  this  condi- 
tion is  satisfied,  in  a  certain  neighbourhood  of  a  the  conditions 

U(x)  >  U(a)  -e,  L(x)<L  (a)  +  e 

will  be  satisfied;  and,  if  e  be  sufficiently  small,  U(x)  >  k  >  L  (x),  at  all  points 
in  this  neighbourhood.  Thus  each  point  a,  at  which  U  (a)  >  k  >  L  (a),  is  an 
interior  point  of  a  set  of  points  at  all  of  which  U  (x)  >k>  L  (x).  In  case  there 
is  a  point  a  such  that  U(a)  =  L(a)  =  k,  and  if  this  does  not  hold  for  any 
smaller  value  of  x  than  a,  it  is  clear  that,  in  the  interval  (a,  a),  the  upper  and 
lower  boundaries  of  I  (x)  are  U  and  L.  The  above  proof  applied  to  (a,  a)  shews 
that  in  this  interval  there  exists  a  set  of  points  at  which  U(x)  >  k  >  L  (x). 

At  any  such  point  x  interior  to  (a,  b),  we  can  choose  positive  numbers 
h1}  /*2  such  that 

71  /y»     />»    L 1  j    /»      |   "*^    /»    **v     /    { IP     IT    -\—    It     l 
I  w/j  \AJ  ^^  '^l/          ***   "^  -*•    v     )  *^      2/j 

where  x  +  h^  <  b,  x  +  h2  <  b.  Since  I  (x,as  +  h)  is  continuous  in  the  interval  of 
h,  of  which  Aj,  A2  are  the  end-points,  a  value  of  h  between  h^  and  h2  exists  such 
that  1  (x,  x  +  h)  =  k,  which  establishes  the  theorem. 

If,  at  every  point  in  (a,  b),  f(x)  have  a  differential  coefficient  (finite,  or  in- 
finite with  fixed  sign),  then  f'(x)  has  every  value  betiueen  its  upper  and  lower 
boundaries  in  the  interval  (a,  b). 

For,  if  k  be  any  number  between  U  and  L,  since,  in  accordance  with  the 
above  theorem,  a  pair  of  interior  points  a,  /3  exists,  such  that  /  (a,  ft)  =  k,  it 
follows  from  the  theorem  of  §  261  that  there  exists  a  point  as,  in  (a,  ft),  at  which 
f'(x)  =  k.  Since  k  is  any  arbitrarily  chosen  number  between  U  and  L,f'(x) 
takes  all  values  between  £7"  and  L.  It  is  clear  that  £7  and  L  must  be  the  upper 
and  lower  boundaries  of  /'  (x),  for  if  /'  (x)  >  U,  x  +  h  could  be  so  determined 
that  I  (as,  x  +  Ji)>  U,  which  is  impossible. 

PROPERTIES   OF   THE    DERIVATIVES   OF    CONTINUOUS   FUNCTIONS. 

279.  Let  f(x)  be  continuous  in  the  interval  (a,  b),  and  let  U  and  L  be  the 
upper  and  lower  boundaries  of  the  incrementary  ratios  above  defined.  Take 
(a,  /3),  any  interval  in  (a,  b),  and  consider  the  function 

<£  (x)  =f(x)  -/(a)  -  |^  [/(£)  -/(a)]. 

Since  <£  (a)  =  0,  <j>  (/3)  =  0,  unless  6  (x)  be  constant  through  (a,  (3),  there  must 
be  within  (a,  /3)  a  maximum  or  minimum  of  (/>  (x);  and  thus  at  least  one  point 
xl  exists  within  (a,  (3),  such  that 

</>  (a?i  ±  h)  -  </>  O'O  ^  0, 
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for  all  sufficiently  small  values  of  h,  or  else 

<£  (&\  ±h)-<j>  W  =  0,     «^ 
for  all  sufficiently  small  values  of  h.    At  such  a  point 


and  /  (X  -  h,  x^  ^  /  (a,  ft), 

or  else  /  (^  +  h,  x^  ^  I  (a,  ft), 

and  /  (#1  -  /«,  #1)  =  /  (a,  /3). 

If  0  (#)  have  an  infinite  number  of  maxima  and  minima  in  (a,  ft),  there  is, 
in  (a,  ft),  an  infinite  number  of  points  at  which  the  first  of  the  conditions  for 
<f>  (x)  holds,  and  also  an  infinite  number  at  which  the  second  holds.  If  there 
be  only  a  finite  number  of  maxima  and  minima  of  <f)  (x)  in  (a,  ft),  then  this 
interval  can  be  divided  into  a  number  of  portions  in  each  of  which  the  func- 
tion (j>  (x)  is  monotone;  and  in  any  one  of  these  portions  either 

/  (x  ±  h,  x)^I  (a,  ft), 
at  all  points  within  the  sub-interval,  or  else 

/  (x  ±  h,x)^I  (a,  ft), 

for  every  x  within  the  portion,  and  for  sufficiently  small  values  of  h.  We  see 
then  that  /  (a,  ft)  and  I  (x  ±  h,  x)  lie  between  U  and  L. 

Thus,  in  every  interval  (a,  ft)  contained  in  (a,  b),  in  which  <£  (x)  has  an  in- 
finite number  of  maxima  and  minima,  there  are  (1)  an  infinity  of  points  x  at 
which  I(x  +  h,  x),  for  all  sufficiently  small  values  of  h,  lies  between  L  and 
I  (a,  ft)',  and  (2)  an  infinity  of  points  at  which  the  same  is  true  of  I  (x  —  h,  x) 
(3)  an  infinity  of  points  at  which  I(x  +  h,x),  for  all  sufficiently  small  values 
of  h,  lies  between  U  and  I  (a,  ft);  and  (4)  an  infinity  of  points  at  which  the 
same  is  true  of  I  (x  —  h,  x). 

CG  —  '   Ci 

In  case  f(x)  -/(a)  -  ^37-  [/(£)  —  /(*)]  nave  only  a  finite  number  of 

maxima  and  minima  in  (a,  ft),  there  are  in  (a,  ft)  finite  intervals  such  that  all 
the  points  in  one  of  them  belong  to  both  the  sets  (1)  and  (2),  and  also  finite 
intervals  in  which  all  the  points  belong  to  both  the  sets  (3)  and  (4);  each  of 
these  sets  of  intervals  is  finite,  and  an  interval  of  one  set  is  followed  by  one  of 
the  other  set. 

The  number  L  being  the  lower  limit  of  the  function  L  (x)  in  the  interval 
(a,  6),  there  exists  a  point  xl  such  that  L  is  the  lower  limit  of  the  values  of 
L(x)  in  any  arbitrarily  small  neighbourhood  of  xl;  and  it  follows  that  in  such 
neighbourhood  of  xl  there  are  points  f  such  that  I(%  +  h,  %),  for  an  infinity  of 
values  of  h,  differs  from  L  by  less  than  a  prescribed  positive  number  e.  There- 
fore there  is,  in  (a,  b),  an  infinity  of  pairs  of  points  (or,  ft),  one  of  which  is  arbi- 
trarily near  xlt  such  that  /(«,  ft)  differs  from  L  by  less  than  e. 
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Similarly,  it  may  be  shewn  that,  in  (a,  6),  there  is  an  infinity  of  pairs  of 
points  a,  ft,  such  that  I  (a,  ft)  differs  from  U  by  less  than  the  prescribed 
number  e. 

If  U  or  L  be  infinite,  there  exists  an  infinity  of  pairs  of  points  such  that 
I  (a,  ft)  is  arithmetically  greater  than  a  prescribed  number  c,  and  has  the 
same  sign  as  the  infinite  U  or  L. 

We  can  consequently  choose  the  interval  (a,  ft)  so  that 

/  (or,  ft)  =  L  +  TJ,   or  else  so  that   7  (a,  ft)  =  U  -  77, 

where  TJ  <  e,  provided  U  and  L  are  finite.  If  one,  or  both,  of  U,  L  be  infinite, 
(a,  ft)  can  be  so  chosen  that  /  (a,  ft)  has  the  same  sign  as  U,  or  as  L,  and  is 
arithmetically  greater  than  a  prescribed  positive  number  c. 

We  have  now  obtained  the  following  results  : 

If  f(x}  be  a  continuous  function,  and  (a,  b)  be  the  whole,  or  a  part,  of  its 
domain,  to  which  U  and  L  correspond,  then  (1)  if  L  be  finite,  there  exists  in 
(a,  b)  an  infinity  of  points  for  which  D+(x),  D±(x)  both  lie  between  L  and 
L  +  e,  where  e  is  an  arbitrarily  prescribed  positive  number;  and  at  these  points 
D+(x)}  D+(x)  are  either  equal,  in  which  case  a  derivative  on  the  right  exists,  or 
else  they  differ  from  one  another  by  less  than  e:  (2)  if  U  be  finite,  there  exists  in 
(a,  b)  an  infinity  of  points  for  which  D+(cc),  D+(x)  both  lie  between  U  and  U  -  e; 
and  at  these  points  there  exist  derivatives  on  the  right,  or  else  D+(x),  D+(x)  differ 
from  one  another  by  less  than  e ;  (3)  if  U  or  L  be  infinite,  there  exists  an  infinity 
of  points  at  which  D+(x),  D+(x)  are  both  numerically  greater  than  an  arbitrarily 
great  number  c,  and  have  the  same  sign  as  the  U  or  L  which  is  infinite.  A  similar 
statement  holds  as  regards  the  derivatives  on  the  left. 

The  above  is  true  irrespectively  of  the  number  of  the  maxima  and  minima 
of /(&') ;  but  if  f(x)  have  in  (a,  b)  only  a  finite  number  of  maxima  and  minima, 
and  if  the  same  be  true  of  all  the  functions  f(x)  —  Ix  —  m,  obtained  by  the 
addition  of  a  linear  function,  then  there  exist  in  (a,  b)  finite  sub-intervals  such 
that  at  all  points  in  one  of  them  the  above  statements  hold  both  as  regards 
the  derivatives  on  the  right  and  as  regards  those  on  the  left.  The  numbers 
U  and  L  correspond  in  each  case  to  the  particular  sub-interval. 

It  will  be  observed  that  the  theorem  does  not  assert  the  necessity  of  the 
existence  of  points  at  which  a  determinate  derivative  on  the  right  or  on  the 
left  exists,  but  it  states  that  there  are  in  every  sub-interval  points  at  which 
the  difference  between  the  upper  and  lower  derivatives  on  one  side  is  less 
than  a  prescribed  arbitrarily  small  number,  or  else  at  which  both  such  de- 
rivatives are  arithmetically  greater  than  an  arbitrarily  fixed  large  number. 
There  are  therefore  certainly  points  in  every  sub-interval  at  which  there  is, 
so  to  speak,  an  arbitrarily  near  degree  of  approximation  to  the  existence  of  a 
finite  or  infinite  derivative  on  the  right,  and  also  points  at  which  the  same  is 
true  as  regards  derivatives  on  the  left. 
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280.  It  will  now  be  shewn*  that,  for  a  continuous  function,  of  which  (a,  b) 
is  the  whole,  or  a  part,  of  its  domain,  the  upper  boundary  of  each  of  the  four 
derivatives  D+f(x),  D+f(x),  D~f(x),  D_f(x)  for  all  values  of  x  in  (a,  b)  is  U, 
the  upper  boundary  of  the  incrementary  function  in  (a,  b),  and  that  the  lower 
boundary  of  each  of  the  four  functions  is  L.  If  U  and  L  be  both  finite  the 
function  belongs  to  the  class  of  functions  with  bounded  derivatives. 

A  function  with  bounded  derivatives  accordingly  satisfies  the  condition, 
that  for  every  pair  of  points  xl,  x2,  \f(xl}—f(x^)  <k\x^  —  x2  ,  where  k  is  a 
fixed  positive  number.  It  has  been  pointed  out,  in  §  275,  that  such  a  function 
belongs  to  the  class  of  functions  of  bounded  variation. 

It  is  clear  that  the  upper  boundary  of  each  of  the  functions  D+f(x), 
D+f(x),  D~f(x),  D_/(#)  is  a  number  which  cannot  be  greater  than  U.  Now 
since  it  has  been  shewn  that  points  exist  in  (a,  b)  such  that,  if  e  be  an  arbitrarily 
prescribed  number,  both  D+f(x),  D+f(x)  differ  from  U  by  less  than  e,  when  U 
is  finite,  and  are  arbitrarily  great  if  U  is  +  x ,  it  follows  that  U  is  in  either 
case  the  upper  boundary  of  D+f(x),  D+f(x).  In  a  similar  manner  it  can  be 
shewn  that  U  is  the  upper  boundary  of  both  D~f(x),  D_f'(x).  The  proof  that 
L  is  the  common  lower  boundary  of  the  four  functions  is  exactly  similar. 

Each  of  the  four  expressions  D+f(x),  D+f(x),  D~f(x\  D_/(#)  may  be 
regarded  as  a  function  defined  for  the  whole  domain  of  f(x),  except  at  one  of 
the  end-points ;  the  ordinary  definition  of  a  function  being  extended  so  far  as 
to  admit  infinite  functional  values  with  a  fixed  sign. 

If,  at  any  point  x0,  interior  to  (a,  b),  one  of  the  above  functions,  say  D+f(x), 
be  continuous,  either  in  the  ordinary,  or  in  the  extended,  sense  of  the  term  (§  219), 
then  at  that  point  the  other  three  functions  are  also  continuous,  and  are  equal  in 
value  to  D+f(x0),  and  thus  there  exists,  at  x0,  a  differential  coefficient. 

To  prove  this,  take  any  interval  (XQ  —  e,  cr0  +  e) ;  then  all  four  functions  have 
in  this  interval  the  same  upper  boundaries,  and  also  the  same  lower  boundaries. 
If  D+f(x0}  be  finite,  the  upper  and  lower  boundaries  of  D+f(x)  in  (x0  —  e,  x0  +  e) 
each  differ  from  D+f(x0)  by  less  than  a  number  77  depending  on  e,  in  such  a 
way  that,  as  e  is  indefinitely  diminished  to  the  limit  zero,  77  also  diminishes 
to  the  limit  zero.  Since  all  four  functions  have  the  same  upper  boundary  and 
the  same  lower  boundary  in  (xn  —  e,  x0  +  e),  the  upper  and  the  lower  boundaries 
of  each  differ  from  D+f(xQ)  by  less  than  rj,  and  rj  can  be  made  as  small  as  we 
please  by  taking  e  small  enough.  It  follows  that  all  four  functions  are  con- 
tinuous at  XQ,  and  that  all  four,  at  x0,  are  equal  to  D+f(x0);  and  thus  there 
exists  a  differential  coefficient  at  #„. 

In  case  D+f(Xo)  is  -4-  x  ,  e  can  be  so  chosen  that,  in  (x0  —  e,  xn  +  e),  D+f(x) 
is  everywhere  greater  than  an  arbitrarily  large  chosen  number  c,  and  the  upper 
and  lower  boundaries  of  each  of  the  four  functions  are  then  greater  than  c ; 

*  Du  Bois  Raymond,  Math.  Ann.,  vol.  xvi,  p.  119,  also  Scheeffer,  Acta  Mathematicu,  vol.  v,  p.  190. 
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by  taking  a  succession  of  values  of  c  which  increase  indefinitely,  and  con- 
sidering the  corresponding  sequence  of  values  of  e  which  converge  to  zero,  we 
see  that  each  of  the  functions  D+f(x),  D~f(oc),  D_f(x)  is  infinite  at  x0,  and  is 
continuous,  in  the  extended  sense  of  the  term,  at  that  point  ;  there  is  then  a 
differential  coefficient  at  #„  which  is  infinite,  and  of  definite  sign. 

It  follows  that,  if  it  be  knoivn  that  any  one  of  the  four  derivatives  is  every- 
uhere  continuous  in  an  interval,  there  exists  everywhere  in  the  interval  a 
differential  coefficient  in  the  ordinary  sense  of  the  term. 

281.  The  upper  and  lower  boundaries  of  any  one  of  the  four  derivatives  of 
a  continuous  function  are  the  same  in  the  open  interval  (a,  b)  as  in  the  closed 
interval  *. 

It  is  clear  that  the  upper  boundary  of  D+f(x)  in  the  open  interval  cannot 
exceed  that  in  the  closed  interval.  If  possible  let  it  be  less  by  some  positive 
number  c.  This  means  that  D+f(a)  exceeds  the  upper  boundary  in  the 
open  interval  by  c.  There  exists  then  no  point  interior  to  (a,  b)  for  which 
D+f(x)  >  D+f(a)  —  c.  Now  a  point  xl  can  be  so  determined  that  I  (x1}  a) 
differs  from  D+f(a)  by  less  than  ^e,  and  a  point  x.2  can  be  so  determined  that 
/(a?!,  #2)  differs  from  I(xl}  a)  by  less  than  £e,  and  therefore  from  D+f(a)  by 
less  than  e.  It  follows  that,  in  any  interval  which  includes  x1  and  x2,  the  upper 
boundary  of  D+f(x)  is  greater  than  D+f(a)  —  e  ;  and  if  e  <  c  this  contradicts 
what  has  been  shewn  above.  Hence  the  upper  boundary  of  D+f(x),  in  a£x<b, 
is  the  same  as  that  in  a<x<b. 

It  follows  as  a  corollary  that  D+f(a)  cannot  exceed  the  upper  limit,  on  the 
right,  of  D+f(x),  at  x  =  a. 

282.  f(x)  being  continuous  in  (a,  b),  the  continuous  function 

0  (*)  =f(x)  -f(a)-^  (f(b)  -/(a)), 

which  vanishes  at  a  and  b,  must  have  a  maximum,  or  a  minimum,  at  an  interior 
point  of  (a,  6).  At  a  maximum  a,  D+<f>(ot),  D+<f)  (a)  are  both  £  0,  and  D~<f>  (a), 
D_<£(ot)  are  both  =0,  the  inequalities  being  both  reversed  in  case  a  is  a 
minimum.  If  it  be  known  that,  at  every  interior  point  of  (a,  b),  there  is  no 
distinction  of  right  and  left  as  regards  derivatives,  so  that  D+  (/>(#)  =  D~fy(x) 
and  D+<$>(x}  =  D_$(x),  we  see  that  all  four  derivatives  must  vanish  at  the 
point  a,  and  thus  there  is  a  differential  coefficient  at  a,  which  has  the  value  0. 
We  have  thus  obtained  the  following  extension  -j*  of  the  theorem  of  the  mean: 

If  there  is  no  distinction  of  right  and  left  with  regard  to  the  derivatives  of 
the  function  f  (x)  (bounded  or  unbounded)  continuous  in  the  interval  (a,b),  there 
exists  a  point,  interior  to  the  interval,  at  which  a  differential  coefficient  exists 


. 
b  —  a 

See  W.  H.  and  G.  C.  Young,  Quarterly  Jl.  of  Math.,  vol.  XL,  p.  12.  f  Ibid.,  p.  10. 
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Thus        —         =/'  (a  +  6b^a),  where  6  is  such  that  0  <  6  <  I. 

0  "~™  Cb 

As  this  may  be  applied  to  any  interval  (a,  /3)  contained  in  (a,  b),  we  obtain 
the  following  theorem  : 

If  there  is  no  distinction,  as  regards  right  and  left,  between  the  derivatives 
of  the  continuous  function  f(x),  defined  in  the  interval  (a,  b),  there  exists  an 
everywhere  dense  set  of  points  in  (a,  b),  of  the  cardinal  number  of  the  continuum, 
at  which  f  (x)  exists  ;  and  f  (x)  has  every  value  between  its  upper  and  lower 
boundaries. 

J283.  The  derivatives  D+f(x),  D+f(x)  of  a  continuous  function  are,  at 
any  point  x0,  such  that  a  £  XQ  <  b,  either  both  continuous  on  the  right,  or  both 
of  them  have  a  discontinuity  of  the  second  kind,  on  the  right:  but  they  cannot 
have  ordinary  discontinuities  on  the  right 

A  similar  statement  holds  as  regards  the  continuity  or  discontinuity  of 
D~f(x\  D_f(x),  on  the  left. 

Suppose  that  D+f(x)  has,  at  the  point  x6,  a  limit  X,  at  XQ,  on  the  right  ; 
then,  if  8  be  a  prescribed  positive  number,  an  interval  (#0,  x0  +  e)  can  be 
found,  such  that  D+f(x),  for  every  point  of  this  interval,  except  x0,  lies 
between  A.  +  8  and  A,  —  B.  The  upper  and  lower  boundaries  of  each  of  the  four 
derivatives  D+f(x),  D+f(x),  D~f(x),  D^f(x),  for  any  interval  (x0  +  en  x0  +  e), 
where  ex  <  e,  must  all  lie  between  the  values  X  +  B,  X  —  8  ;  hence  the  upper 
and  lower  boundaries  of  D~f(x),  for  the  interval  (x0,  x0  +  e),  lie  between  these 
same  values,  the  function  D~f(x)  being  regarded  as  undefined  at  the  point 
#0;  and  these  upper  and  lower  boundaries  of  D~f(x)  are  the  same  as  those  of 
D+f(x),  D+f(x),  for  (#0,  oc0  +  €),  the  point  x^  being  included.  It  follows  that 
_D+/(#0),  D+f(x0)  both  lie  between  X+£  and  X  —  8;  and  as  this  holds  for 
every  value  of  8,  we  must  have 

D+f(x0)  =  D+f(x0)  =  \  =  \'; 

where  X'  denotes  the  limit  of  D+f(x),  at  x0,  on  the  right  ;  and  thus  D+f(x), 
D+f(x)  are  both  continuous  at  x0,  on  the  right.  If  X  —  +  oc  ,  then  in  the 
interval  (x0,  x0  +  e),  at  every  point  except  x0>  D+f(x)  >  c,  where  c  is  an  arbitrarily 
chosen  number  on  which  6  depends  ;  the  argument  then  proceeds  as  before. 

284.  A  continuous  function  f(x)  cannot  have,  at  every  point  of  a  whole 
interval,  a  single-valued  derivative  on  the  right,  which  is  everywhere  infinite  and 
of  the  same  sign. 

For  if/(#)  had  this  property  in  an  interval  (a,  /3),  so  also  would 
/(*)  -/(a)  -[/G8)  -/(a)], 


362  Functions  of  a  real  variable  [on.  v 

and  this  function  necessarily  has  a  maximum  or  minimum  within  (a,  ft),  which 
is  contrary  to  the  condition  that  it  has  a  derivative  on  the  right  which  is 
always  of  the  same  sign;  for  this  involves  the  condition  that  the  function 
must  constantly  increase  as  x  increases  from  a  to  ft. 

Let  us  now  suppose  that  the  continuous  function  f(x)  has,  at  all  points 
of  (a,  6),  single  valued  derivatives  on  the  right  (finite  or  infinite),  such  that, 
in  a  part  (a,  ft),  of  (a,  b),  this  derivative  is  continuous  at  least  on  one  side;  the 
function  f(x)  is  then  such  that,  at  an  infinite  number  of  points,  it  possesses  an 
ordinary  differential  coefficient. 

The  derivative  Df(x),  on  the  right,  cannot  at  all  points  of  (a,  ft)  be  infinite. 
For  if  we  take  a  point  xn  such  that  it  is  continuous  on  one  side,  in  the  extended 
sense  of  the  term  explained  in  §  219,  then  if  it  were  everywhere  infinite,  its 
sign  at  all  points  in  an  interval  on  the  one  side  of  xa  would  be  the  same;  but 
it  has  been  shewn  to  be  impossible  that,  everywhere  in  any  interval,  Df(x) 
should  be  infinite  and  of  constant  sign.  It  follows  that  there  are  points  in 
the  neighbourhood  of  x0  at  which  Df(x)  is  finite.  If  xl  be  such  a  point  in 
(a,  ft),  then,  since  Df(x)  is  continuous  on  one  side  at  x1}  an  interval  can  be 
found  at  all  points  of  which  it  is  finite,  and  also  continuous  on  one  side.  If 
(a,,  /3j)  be  such  an  interval  in  (a,  ft),  then  since  Df(x)  is  everywhere  finite  in 
it,  and  continuous  on  one  side  at  least,  it  is  a  point-wise  discontinuous  func- 
tion, if  it  be  not  continuous  in  (a1}  /5j);  and  there  must  therefore  be  an  infinity 
of  points  in  (alt  ft^  at  which  Df(x)  is  continuous.  At  such  points,  in  accord- 
ance with  §  280,  f(x)  has  a  differential  coefficient. 

285.  As  regards  everywhere  oscillating  functions,  the  following  remarks 
may  be  made. 

If  a  continuous  function  have,  in  every  neighbourhood  on  the  right  of  a 
point  ,r0,  an  infinite  number  of  maxima  and  minima,  there  is,  in  such  neigh- 
bourhoods, an  infinity  of  points  at  which  the  derivatives  on  the  right  are  nega- 
tive or  zero,  and  an  infinity  of  points  at  which  these  derivatives  are  positive 
or  zero.  It  follows  from  this,  that  none  of  the  derivatives  at  a  point  x,  on  the 
right  of  x0,  can  have  a  definite  limit,  as#  approaches  the  limit  #0,  unless  such 
limit  be  zero. 

In  particular,  if  at  all  such  points  x,  definite  derivatives  on  the  right  and 
on  the  left  exist,  these  derivatives  cannot  be  continuous  at  x0,  unless  the 
derivatives  at  XQ  'are  both  zero. 

If,  at  the  point  a?0,  and  at  every  point  in  the  neighbourhood  of  x0  which 
contains  an  infinite  number  of  maxima  and  minima,  a  differential  coefficient 
exist,  which  is  continuous  at  #•<,,  this  differential  coefficient  must  be  zero  at  x0 
and  at  an  infinity  of  points  in  the  neighbourhood  of  x0,  and  must  therefore 
itself  have  an  infinite  number  of  maxima  and  minima  in  the  neighbourhood 
of  xn. 
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If  a  function  f(x\  which  has  an  infinite  number  of  maxima  and  minima 
in  the  neighbourhood  of  #0,  have  at  #0»  and  in  its  neighbourhood,  differential 
coefficients  of  any  number  of  orders,  then  they  are  all  functions  with  an  infinite 
number  of  maxima  and  minima  in  the  neighbourhood  of  #0,  and  all  of  them 
vanish  at  x0,  except  that  the  one  of  highest  order  may  be  discontinuous  at  ac0, 
not  then  necessarily  vanishing  at  that  point.  If  differential  coefficients  of  all 
orders  exist,  they  must  all  vanish  at  #„;  and  such  a  function  is  incapable  of 
expansion  in  powers  of  x  —  #0  in  the  neighbourhood  of  x0.  An  example*  of  a 
function  of  this  kind  is 


(*-*•>• 


sn 


FUNCTIONS  WITH   ONE   DERIVATIVE   ASSIGNED. 

286.  If  two  functions,  defined  for  a  given  interval,  have  each  bounded  deriva- 
tives, and  if  the  two  functions  have  each  a  particular  one  of  their  four  derivatives, 
say  the  upper  derivative  on  the  right,  equal  to  one  another  at  every  point  which 
does  not  belong  to  a  set  of  points  E,  of  measure  zero,  then  the  two  functions  differ 
from  one  another  by  a  constant,  the  same  for  the  whole  interval. 

This  theorem^  differs  from  that  of  §  267,  in  that  the  functions  are  restricted 
to  be  such  continuous  functions  as  have  bounded  derivatives;  it  is  however 
more  general,  in  that  E  is  not  restricted  to  be  enumerable. 

Let  the  points  of  E  be  enclosed  in  the  interiors  of  intervals  of  a  set,  of 
which  the  total  length  has  the  arbitrarily  small  value  e.  To  each  point  P,  of 
E,  there  corresponds  an  interval  PP',  where  PP'  is  that  part  of  the  interval 
of  the  set  that  encloses  P  which  is  on  the  right  of  P;  these  intervals  PP'  may 
be  denoted  by  8'.  If  the  two  functions  /i  (x),  f.2  (x)  are  such  that,  at  a  point 

xlt  D+/I  oo  =  DVate).  ^  has  been  seen  in  §  267>  that  £+/O/>=°'  £>+/(y)^o> 

where  f(x)  denotes  /x  (x)  —f2  (x).    Since  f(x)  is  continuous  at  #1}  it  follows 
that  there  is  a  set  of  points  xl  +  h,  on  the  right  of  #1}  such  that 


if  we  suppose  h  to  have  the  greatest  value  for  which  this  holds,  the  interval 
(#15  xl  +  h)  is  an  interval  on  the  right  of  x1}  and  such  intervals  may  be  denoted 
by  8. 

Let  £  be  any  point  such  that  a  <  %  ^  b,  and  consider  the  interval  (a,  £). 
From  the  point  a  lay  off"  an  interval  8,  or  8',  according  as  a  is  not,  or  is,  a 
point  of  E;  from  the  end  of  this  interval  lay  off  another  interval  8,  or  8',  as 
the  case  may  be.  Proceeding  in  this  manner,  we  may  construct  a  Lebesgue 

*  Dini,  Grundlagen,  p.  814. 

t  Lebesgue's  Lemons  sur  V  integration,  p.  79. 
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^» 

chain  reaching  from  a  to  £  (see  §  78).  The  set  of  points  not  interior  to  the 
intervals  of  the  chain  is  a  closed  enumerable  set.  We  can  now  find/(£)  —f(a) 
as  the  sum,  or  limiting  sum,  of  the  differences  of  the  functional  values  at  the 
end-points  of  the  intervals  of  the  chain,  and  each  of  which  is  either  a  8,  or 
a  8'.  It  is  clear  that 


where  the  summations  refer  to  those  of  the  intervals  8,  8'  which  have  been 
employed  in  the  construction;  and  A  denotes  the  finite  upper  boundary  of 
\I(as1}a^)\,  for  every  pair  of  points  aclt  x2  in  the  interval  (a,  £);  and  this  is 
identical  with  the  upper  boundary  of  the  absolute  value  of  the  derivatives  of 
f(x)  in  the  interval.  Since  e  is  arbitrarily  small,  it  follows  that/(£)  .=/(«),  and 
therefore/,  (£)—  /2(f)  =/i(«)—  /2(«);  thus  the  theorem  has  been  established. 


THE    CONSTRUCTION    OF    CONTINUOUS    FUNCTIONS. 

287.  One  of  the  most  fruitful  methods  of  obtaining  continuous  functions 
which  exhibit  various  peculiarities  as  regards  the  existence  or  non-existence 
of  differential  coefficients  at  all  the  points,  or  at  sets  of  points,  of  their  domain, 
consists  of  denning  the  functions  by  means  of  series  specially  constructed  with 
a  view  to  the  purpose  in  hand;  this  method  will  be  explained  and  illustrated 
in  Vol.  n.  Broden,  Kb'pcke,  and  others,  have  however  given  direct  constructions 
for  continuous  functions,  which  illustrate  various  possibilities  in  relation  to  the 
existence  and  properties  of  derivatives. 

The  method  employed*  by  Broden  is  that  of  denning  a  continuous  function 
in  the  domain  (a,  6),  as  the  function  obtained  by  extension  (see  §  241)  of  a 
function  denned  for  an  enumerable  everywhere  dense  set  of  points  in  (a,  6), 
the  primary  points.  A  continuous  function  is  entirely  determinate  when  the 
functional  values  at  such  a  primary  set  of  points  have  been  assigned.  The 
necessary  and  sufficient  condition  that  a  function  defined  for  the  primary  set 
should,  by  extension  to  the  domain  (a,  b),  give  a  function  which  is  continuous 
in  that  domain,  is  that  the  primary  function  should  be  uniformly  continuous 
with  respect  to  the  unclosed  primary  domain.  To  prove  this,  let  {£}  denote 
the  set  of  primary  points,  and  [x]  the  set  of  secondary  points;  then  the  condi- 
tion that  the  function  /(£)  may  be  uniformly  continuous  with  respect  to  the 
domain  {£},  is  that,  if  ^  be  any  point  of  {£},  and  if  77  be  a  prescribed  arbitrarily 
small  number,  the  condition  |/(f )  — /(|V)  <  rj  be  satisfied  at  all  points  %  which 
are  such  that  f  —  ^  <  e,  where  e  is  a  number  dependent  on  77,  the  same 
for  all  points  £1(  of  (£}.  Now  assuming  that  this  condition  is  satisfied  let  xl 
be  a  secondary  point,  and  let  £1,  £2>  •••  fw  •••>  £1',  &'>•••  £»/•••  be  any  two 

*  Crelle's  Journal,  vol.  cxvui ;  see  also  Acta  Univ.  Lund.,  vol.  xxxni. 
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sequences  of  primary  points,  each  of  which  has  #,  as  its  limit;  we  have  to  shew 
that  each  of  the  sequences 


converges  to  the  same  number,  which  will  then  be  the  single  functional  value 
/(a?j).  Enclose  ^  in  the  interval  (x^  —  ^e,  #i  +  ^e);  then,  from  and  after  some 
particular  value  of  n,  all  the  points  of  both  sequences  of  values  of  £  lie  within 
this  neighbourhood.  Let  this  value  of  n  be  m,  then 


for  all  positive  integral  values  of  r;  hence  the  first  sequence  of  functional 
values  is  convergent,  since  77  is  arbitrary;  and  similarly  the  second  is  also 
convergent.  Also  for  every  77  there  is  a  definite  m  such  that 


hence  the  two  convergent  sequences  have  the  same  limit,  and  this  limit  defines 
/(tfj).  We  have  now  to  shew  that  the  single  valued  function  so  defined  ia 
continuous.  We  have 

|/(X)  -/(£)  !  <  77,  provided  \a^-^\<  |e, 
and  l/Oa)  -/(la)  i  <  *?»  provided    #s  -  £2  <  |e  : 

also  /(&)  -/(&)   <  17,  provided  |  &  -  £   <  e. 

Hence  it  follows  that  /(%)  —f(xi)   <  3??, 

and  this  holds  provided  \xz—  xl\<  2e, 

for  £1;  £2  can  be  taken  to  be  between  xl  and  #2;  and  therefore  f(x)  is  con- 
tinuous at  xlt  since  877  is  at  our  choice.  The  extended  /(#)  is  also  easily  seen 
to  be  continuous  at  any  primary  point.  It  has  now  been  proved  that  the  con- 
dition of  uniform  continuity  is  sufficient;  that  it  is  necessary  follows  from  the 
theorem  of  §  217. 

The  derivatives  at  any  point  depend  only  on  the  functional  values  at  the 
primary  points  in  the  neighbourhood  of  the  point.    For  let  xl  be  any  point, 

and  consider  the  limit  of  —     —  -  —  —  ,  when  x  has  any  sequence  of  values 

30—  Xi 

which  converge  to  #lt  A  set  of  primary  values  of  x  can  always  be  found, 
.  such  that  the  ratio  converges  to  the  same  limit,  when  x  has  the  values  of 
this  sequence  of  primary  points,  as  for  the  prescribed  sequence  consisting  of 
secondary  points,  or  of  both  primary  and  secondary  points.  For  a  primary- 
point  |  can  be  found,  corresponding  to  x,  such  that 

!/(*)  -/(«,) 


where  8  is  an  arbitrarily  small  number.    This  follows  from  the  fact  that 


is  a  continuous  function  of  x  at  every  point  except 
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288.  In  order  to  construct  monotone  continuous  functions,  the  values  of 
the  function  are  first  assigned  at  the  end-points  a,  b  of  the  interval,  then  at 
two  points  XQ,  #!,  where  XQ<XI;  then  at  four  points  x^,  xol,  xw,  xu,  where 


afterwards  at  eight  points 

^000  )     &Q01  )    ^010  >     ^011  >    ^100)     ^101  J     "^110  !     ^111  >     &C. 

lying  in  the  successive  intervals  measured  from  left  to  right,  into  which  (a,  b) 
was  divided  by  the  four  points;  and  so  on.  The  function  may  then  be  re- 
garded as  the  limit  of  a  sequence  of  continuous  functions,  each  of  which  is 
representable  as  a  polygon  obtained  by  joining  the  end-points  of  ordinates 
which  represent  the  functional  values  that  have  been  assigned  at  any  stage 
of  the  process. 

In  this  manner  Broden  has  constructed  a  monotone  continuous  function 
/(#),  which  is  such  that  it  has  derivatives  on  the  right,  and  on  the  left,  which 
are  everywhere  definite,  finite,  and  different  from  zero  ;  and  such  that  a  definite 
differential  coefficient  everywhere  exists,  except  at  the  everywhere  dense 
enumerable  set  of  primary  points. 

He  has  also  constructed  a  monotone  function  f(x)  which  is  such  that,  at 
the  everywhere  dense  enumerable  set  of  primary  points,  the  derivative  on  the 
left  exists,  and  is  zero,  and  the  derivative  on  the  right  exists,  and  is  positive; 
for  an  unenumerable  everywhere  dense  set  of  points  there  is  a  differential 
coefficient,  everywhere  zero,  and  for  another  such  set  of  points,  there  is  no 
definite  derivative  on  the  left,  but  there  is  a  positive  one  on  the  right. 

A  third  case  is  the  following  : 

f(x)  is  continuous,  monotone,  and  increasing  ;  at  an  everywhere  dense 
enumerable  set  of  points  the  derivative  on  the  left  is  zero,  and  that  on  the 
right  is  +  oo  ;  for  an  everywhere  dense  unenumerable  set,  both  derivatives  exist, 
and  are  positive;  for  another  such  set  both  exist,  and  are  zero;  fora  third  such 
set,  both  derivatives  exist,  and  are  +  oo  ;  for  a  fourth  such  set,  neither  derivative 
exists;  for  a  fifth  such  set,  the  derivative  on  the  left  is  zero,  and  that  on  the 
right  is  indefinite,  but  has  zero  for  its  lower  limit;  for  a  sixth  such  set,  the 
derivative  on  the  right  is  +  oc  ,  but  that  on  the  left  is  indefinite,  with  +  oo  for 
its  upper  limit. 

289.  For  the  construction  of  everywhere  oscillating  continuous  functions 
it  is  more  convenient  to  assign  successively  the  functional  values  at  sets  of 
points  proceeding  by  powers  of  3,  instead  of  2,  as  in  the  case  of  monotone  func- 
tions. In  this  manner  Broden  has  constructed  such  a  function  f(x),  which 
has  the  following  properties: 

At  an  everywhere  dense  enumerable  set  of  points,  the  derivative  on  the 
left  exists,  and  is  positive:  that  on  the  right  exists,  and  is  negative  (or  the 
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reverse),  this  set  corresponding  to  maxima  and  minima  of  the  function;  for  a 
certain  unenumerable  everywhere  dense  set,  there  is  a  differential  coefficient 
everywhere  of  the  same  sign;  and  for  another  such  set,  there  is  a  differential 
coefficient  which  is  zero;  for  a  third  such  set,  one  or  both  of  the  derivatives 
are  indefinite. 

Kopcke*  has  given  the  first  example  of  a  function  which  is  everywhere 
oscillating,  and  yet  has  at  every  point  a  definite  differential  coefficient,  thus 
confirming  the  conjecture  of  Dini  that  such  functions  can  exist;  and  Brodenf 
has  also  constructed  such  a  function.  A  general  theory  of  such  functions  has 
been  givenj  by  Schoenflies.  The  method  adopted  by  Kopcke  is  to  construct 
the  function  as  the  limit  of  a  succession  of  polygons  of  which  the  sides  are 
circular  arcs.  Everywhere  oscillating  functions  have  also  been  studied  by 
Steinitz§.  A  detailed  account  of  all  the  special  cases  treated  of  by  these 
writers  would  require  much  space ;  reference  can  therefore  only  be  made  to  the 
original  memoirs.  A  simplification  of  Kb'pcke's  construction,  due  to  Pereno, 
will  be  given  in  Vol.  II. 

290.  A  function  f(oc)  which  is  of  such  a  character  that  it  can  be  repre- 
sented approximately  by  a  graph  that  exhibits  all  the  peculiarities  of  the 
function,  so  that  y=f(x)  is  the  equation  of  a  "curve,"  in  the  ordinary  sense 
of  the  term,  must  satisfy  the  following  three  conditions: 

(1)  The  function  must  be  continuous  everywhere,  with  the  possible  ex- 
ception of  a  finite  number  of  points,  at  which  it  may  have  ordinary  discon- 
tinuities. 

(2)  It  must  be  differentiate,  except  that  there  may  be  a  finite  number  of 
points  at  which  no  differential  coefficients  exist,  but  at  which  definite  deriva- 
tives on  the  right  and  on  the  left  exist. 

(3)  It  can  have  only  a  finite  number  of  maxima  and  minima;  and  the 
same  must  hold  of  every  function  obtainable  by  the  addition  of  a  linear  func- 
tion to  the  one  in  question.    This  condition  may  be  expressed  in  the  form, 
that  the  function  must  be  in  general  monotone  with  reference  to  every  possible 
axis  which  may  be  employed  for  the  measurement  of  abscissae. 

A  function  which  satisfies  these  conditions  may  be  characterised ||  as  an 
ordinary  function.  As  has  been  already  indicated,  there  exist  functions  which 
satisfy  the  conditions  (1)  and  (3),  but  do  not  satisfy  the  condition  (2).  Again, 
there  exist  functions  which  satisfy  the  conditions  (1)  and  (2),  but  not  the 
condition  (3). 

*  Math.  Ann. ,  vol.  xxix,  p.  123 ;  vol.  xxxiv,  p.  161 ;  vol.  xxxv,  p.  104.  See  also  Pereno,  Giorn.  di 
Mat.,  vol.  xxxv,  p.  132. 

t  Stockholm  Vet.  Ah.  Off.,  1900,  pp.423  and  743. 

£  Math.  Ann., .vol.  LIV;  also  bis  Bericht,  vol.  r,  p.  164. 

%'Math.  Annalen,  vol.  LII.  |;  Du  Bois  Reymond,  C relic's  Journal,  vol.  LXXIX,  p.  32. 
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GENERAL   PROPERTIES   OF   DERIVATIVES. 

291.  A  number  of  important  properties  of  the  derivatives  of  a  continuous 
function  have  been  investigated  in  §§  279  —  286.  Some  of  the  most  interesting 
properties  of  the  derivatives  of  a  function  hold  for  functions  that  are  not  re- 
stricted to  be  continuous,  and  are  either  quite  general,  or  are  only  restricted  to- 
belong  to  the  wide  class  of  measurable  functions.  It  will  in  general  be  assumed 
that  a  function  f(x),  defined  for  any  interval  of  x,  either  finite  or  indefinitely 
great,  has  a  definite  finite  value,  for  each  value  of  x,  whether  the  function  be 
bounded  or  not. 

If  f(x)  be  defined  in  a  finite,  or  an  unbounded,  interval,  and  k  be  any  fixed 
number,  the  set  of  points  x,  at  each  of  which  the  conditions  D+f(x)  ^  k,  D~f(cc}  <  k 
are  satisfied,  is  enumerable*. 

'  Let  G  be  the  set  of  points  at  which  D+f(x)  ^  k;  and  let  {en},  \rjn]  be  two 
monotone  sequences  of  positive  numbers  that  converge  to  zero.  If  £  be  a  point 
of  G,  and  h  be  a  sufficiently  small  positive  number,  /(£,  %  +  h)  ^k  —  rjn.  All 
the  values  of  h  for  which  this  condition  is  satisfied  have  an  upper  boundary  h, 
at  which  the  condition  may,  or  may  not,  be  satisfied.  Let  f  =  £-f  \h,  then  we 
have,  for  such  a  point  £,  of  G,  a  definite  interval  (£,  \  ),  at  every  point  of  which 
the  condition  /  (f  ,  x)  =  k  —  rjn  is  satisfied.  We  now  take,  for  the  point  f  ,  the 
interval  8$  on  its  right,  of  length  equal  to  the  smaller  of  the  two  numbers 
en,  |  —  £,  so  that,  in  this  interval,  both  the  conditions 


are  satisfied.  Let  An  be  the  set  of  all  such  intervals  Sf,  when  every  point  £,. 
in  G,  is  considered;  and  let  A  be  the  set  {Aw}  of  all  such  intervals,  when  all 
values  of  n  are  taken  into  account.  There  exists  at  most  an  enumerable  set 
H,  of  the  points  of  G,  that  are  not  interior  points  of  any  interval  of  A  (see 
§  72).  If  £'  be  a  point  of  G  —  H,  it  is  interior  to  an  interval  (£,  77),  of  Aw, 
whatever  value  of  n  is  taken.  It  follows  that  /  (£,  £')  ^  k  —  ?;n.  As  n  is  in- 
definitely increased,  £  will  converge  to  £',  and  rjn  converges  to  0;  it  is  thus 
seen  that  D~  f(%")  =  k.  This  result  holds  at  every  point  £',  of  G,  that  does  not 
belong  to  the  enumerable  set  H,  and  thus  the  theorem  has  been  estab- 
lished. 

It  is  clear  that  the  corresponding  result  holds,  that  the  set  of  points  at 
which  D_/(»  =  k,  D+f(x)  <  k  is  enumerable. 

If  f(x)  be  changed  into  —/(#),  and  k  into  —  k,  we  see  that  the  set  of  points 
at  which  D+f(x}  ^  k,  D_f(x)  >  k  is  enumerable,  and  that  the  set  of  points  at 
which  D~f(x)  =  k,  D+f(x)  >  k  is  also  enumerable. 

*  See  G.  C.  Young,  A  eta  Math.,  vol.  xxxvn,  p.  143,  and  Quarterly  Journal  of  Math.,  vol.  XLVII,. 
p.  127. 
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It  is  clear  that  the  set  of  points,  at  each  of  which  D+f(x)  =  +  oo  ,  D~f(x)  <  k, 
being  a  part  of  the  enumerable  set  at  which  D+f(x)  ^  k,  D~f(x)  <  k,  is 
enumerable.  If  we  give  to  k  successively  the  values  in  a  divergent  sequence 
{km}  of  positive  numbers,  the  set  at  which  D+f(x)  =  +  oo  ,  D~f(x)  <  oo  ,  is  such 
that  each  point  belongs  to  one  of  the  enumerable  sets  for  which  D+f(x)  =  +  oo  , 
D~f(x)  <  kn\  it  has  thus  been  shewn  that  : 

The  set  of  points  at  which  D+f(x)  =  +  cc  ,  and  D~f(x)  is  finite,  or  —  oo  ,  is 
enumerable. 

Similarly  the  three  sets  at  which  .D_/(#)is  +  oo  ,  and  D+f(x)  is  finite,  or 
—  oo  ;  at  which  D+f(x)  is  —  oo  ,  and  D_f(x)  is  finite  or  +  oo  ;  and  at  which 
D~f(x)  is  —  oo  ,  and  D+f(x)  is  finite  or  +  oo  ,  are  all  enumerable. 

292.  Except  at  points  of  an  enumerable  set  the  upper  derivative  on  one  side 
is  greater  than,  or  equal  to,  the  lower  derivative  on  the  other  side. 

Let  us  consider  a  point  x,  at  which  D+f(x)  =  a,  D_f(x)  =  b,  where  a  and  6 
are  numbers  such  that  a<b;  the  number  b  may  be  +  oo  .  Let  &  be  a  rational 
number  interior  to  the  interval  (a,  b),  then  x  belongs  to  the  set  of  points  at 
which  D+f(x)  <  k,  D_f(x)  >  k,  and  this  set  E^,  being  a  part  of  the  enumerable 
set  at  which  _D+  f(x)  <  k,  D_f(x)  ^  k,  is  enumerable.  Any  point  x,  at  which 
D+f(x)  <  D_f(x],  belongs  to  Ek  for  all  properly  chosen  values  of  k.  Any 
point  at  which  D+f(x)  is  finite,  or  —  oo  ,  and  also  D_f(x)  =  +  oo  ,  belongs  to 
the  enumerable  set  Ex.  All  the  sets  Ek  for  all  rational  values  of  k,  and  also 
for  k  =  +  oo  ,  form  an  enumerable  set.  Therefore  all  the  points  at  which 


form  an  enumerable  set.    Similarly,  it  is  seen  that  the  set  of  points  at  which 
D~f(x)  <  D+f(x)  is  enumerable,  and  thus  the  theorem  is  established. 

In  the  case  of  a  continuous  function  f(x),  there  is  a  derivative  on  one  side 
which  has  any  assigned  value  not  greater  than  the  upper  derivative,  and  not 
less  than  the  lower  derivative  on  that  side  (see  §  260).  At  every  point  x  that 
does  not  belong  to  a  certain  enumerable  set, 

D+/(*)  £  D_f(x],  and  D~f(x)  i  D+f(x); 

thus  at  least  one  point  of  the  closed  interval  (D_f(x),  D~f(x))  lies  in  the  closed 
interval  (D+f(x),  D+f(x)).    We  have  then  the  following  theorem: 

A  continuous  function  has  at  least  one  symmetrical  median,  or  extreme, 
derivative  at  every  point  that  does  not  belong  to  a  certain  set  of  measure  zero. 

The  following  theorem  follows  at  once  from  the  above  general  theorem  : 

Except  at  points  belonging  to  an  enumerable  set,  a  function  cannot  have 
unique  progressive  and  regressive  derivatives  which  are  unequal,  whether  finite, 
or  infinite  with  a  determinate  sign. 

H.  24 
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293.  The  following  theorem*,  which  will  be  of  use  in  the  further  dis- 
cussion, will  now  be  established: 

Iff(x)  is  continuous  with  respect  to  a  perfect  set  G,  contained  in  an  interval 
(a,  b),  in  which  f(x]  is  defined,  and  if,  at  each  point  x,  of  G,  there  is  a  closed 
interval  (x,  x  +  hx)  on  its  right,  such  that,  if  x'  be  any  point  in  it,  other  than  x 
itself,  f(x'}  ^f(x],  then  an  interval  (a.,  ft)  exists,  in  (a,  b),  containing  a  part  of 
G,  such  that,  for  every  pair  of  points  x,  x ,  in  (a.,  /3),  which  are  such  that  x 
belongs  to  G,  and  x  ^  x,  the  condition  f  (x)  ^f(x)  is  satisfied. 

Let  a  system  of  nets  with  closed  meshes  be  fitted  on  to  the  interval  (a,  b}. 
If,  in  any  net  Dn,  there  is  a  mesh  dn  which  contains,  in  its  interior,  points  of 
G  such  that,  for  each  such  point  x,  the  condition  f(x')  =/(#)  is  satisfied  for 
all  points  x',  in  the  mesh  dn,  which  are  on  the  right  of  x,  the  mesh  dn,  or  else 
that  part  of  it  obtained  by  cutting  off  a  portion  of  its  left  end,  will  be  an 
interval  such  as  is  required.  Let  it  be  assumed  that,  if  possible,  for  each  value 
of  n,  and  for  every  mesh  dn,  which  contains  points  of  G  in  its  interior,  there 
is  at  least  one  pair  of  points  x,x'(>  x),  where  #  is  a  point  of  G,  and  such  that 
f(x)  >f(x}\  and  where  x  is  interior  to  dn. 

In  Dn,  let  Hn  be  the  set  of  those  points  x,  of  G,  each  of  which  is  interior 
to  a  mesh  of  Dn,  and  such  that  there  is  a  point  x  (>x]  in  the  same  mesh,  for 
which/(#')  >/(#).  Let  Kn  be  the  set  of  all  other  points  of  G',  thus  G  =  Hn  +  Kn. 

It  will  be  shewn  that,  on  the  assumption  that  the  theorem  does  not  hold, 
the  set  Kn  is  non-dense  in  G.  Let  A  be  any  interval  that  contains  points  of 
G  in  its  interior.  For  a  sufficiently  large  value  of  n,  A  will  contain,  in  its 
interior,  a  mesh  of  Dn  that  has,  within  it,  points  of  G',  one  of  such  points  x  is 
such  that/(V)  >f(x),  for  some  point  x  (>x)  contained  in  the  mesh.  A  neigh- 
bourhood (x  —  e,x  +  e')  of  x,  contained  in  the  mesh,  can  be  so  determined  that, 
for  all  points  x,  of  G,  in  that  interval,  f(x)  >f(x}\  this  follows  from  the  con- 
tinuity of  f(x)  in  G.  Therefore  no  point  £,  of  G,  in  (x  —  e,  x  +  e')  is  a  point  of 
Kn.  Since,  interior  to  an  arbitrary  interval  A  which  contains  in  its  interior 
points  of  G,  another  interval  (x  —  e,  x  +  e')  can  be  so  determined  as  to  contain 
points  of  G,  none  of  which  belong  to  Kn,  it  follows  that  Kn  is  non-dense  in  G. 

We  see  that  Hn  contains  Hn+l\  and  the  sets  Kn  being  all  non-dense  in  G, 
the  set  M (Ki,  K.2, ...  Kn, ...)  is  of  the  first  category  relatively  to  G.  It  follows 
that  there  exists  a  set  HM,  a  residual  with  respect  to  G,  that  consists  of  points 
each  of  which  belongs  to  Hn,  for  every  value  of  n.  If  x  be  a  point  of  Ha,  there 
exists  an  interval  x  +  hx,  such  that,  for  every  point  x'  in  it,  f(x)  =/(#). 

If  n  be  sufficiently  large,  the  part  of  that  mesh  of  Dn  containing  x,  that 
is  on  the  right  of  x,  will  be  contained  in  (x,  x  +  hx),  and  consequently,  in  that 

*  See  G.  C.  Young,  Proc.  Land.  Math.  Soc.  (2),  vol.  xv,  p.  367;  the  proof  there  given  differs 
from  that  in  the  text,  in  that  it  seems  to  require  an  infinite  number  of  undetermined  acts  of  choice, 
as  applied  to  successive  pairs  of  points.  For  corrections,  see  Proc.  Land.  Math.  Soc.  (2),  vol.  xix, 
p.  152. 
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mesh  there  is  no  point  of  (>x)  such  that  f(x)  >f(x)\  and  this  is  contrary 
to  the  hypothesis.  There  is  consequently  a  contradiction  in  the  assumption 
made.  Hence  the  theorem  has  been  established. 

294.  The  following  theorem  can  be  deduced  from  that  of  §  293: 

If,  at  each  point  of  the  perfect  set  G,  with  respect  to  which  f(x)  is  continuous, 
D+f(x)  <  0,  an  interval  (a,  ft)  can  be  so  determined  that,  for  any  interval  contained 
in  it,  with  a  point  of  G  as  left-hand  end-point,  the  incrementary  ratio  is  ^  0. 

For  if,  at  a  point  x,  of  G,  we  have  D+ f(x)  <  0,  a  set  of  intervals  (x,  x  +  h) 
exists,  such  that  /(£)  ^  /(#),  for  all  points  £  (=}=  x)  of  (x,  x  +  h).  There  will  be 
an  upper  boundary  h,  of  h,  and  at  the  point  x  +  h,  we  may  have  f(x  +  h)  >f(x). 
The  neighbourhood  (x,  x  +  ^h)  may  be  assigned  to  x  as  the  interval  (x,  x  +  hx) 
in  the  theorem  proved  above.  In  the  interval  (a,  /8),  we  have  D+f(x)  £  0,  at 
every  point  x,  of  G, 

If  we  replace  f(x),  in  the  above  theorem,  by  f(x)  —  k,  we  have  the  following 
theorem: 

If,  at  each  point  of  the  perfect  set  G,  with  respect  to  which  f(x)  is  continuous, 
D+ f(x)<  k,  an  interval  (a.,  /3)  can  be  so  determined  that,  for  any  interval  con- 
tained in  it,  with  a  point  of  G  as  left-hand  end-point,  the  incrementary  ratio 

is  ^  k. 

If  we  change  f(x)  into  —f(x\  and  k  into  —  k;  since 
D+{-/(«)}--D+/(«) 

we  have  the  corresponding  theorem,  in  which  D+f(x)  >  k  takes  the  place  of 
the  condition  Z)+ f(x)  <  k,  and  the  incrementary  ratio  in  (a,  /3)  is  ~  k. 
It  is  clear  that  corresponding  theorems  hold  for  D~f(x),  D_f(x). 

295.  In  order  not  to  interrupt  the  continuity  of  the  account  of  the  main 
properties  of  the  derivatives  of  functions,  and  to  secure  the  greatest  attainable 
degree  of  generality  in  the  results,  the  conception  of  a  measurable  function 
will  be  here  introduced.    A  function  f(x),  defined  in  any  interval  (a,  6),  is  said 
to  be  measurable,  provided  that,  for  every  value  of  A,  the  set  of  points  x,  of 
(a,  6),  at  which  f(x)>A,  is  a  measurable  set  of  points.    The  properties  of 
measurable  functions  will  be  considered  in  detail  in  the  account  of  Lebesgue 
integrals  which  will  be  given  in  Chapter  VII.    An  important  property  of  a 
measurable  function  /(»)  will  here  be  assumed,  viz.  that  a  set  E,  of  points  of 
(a,  6),  of  measure  arbitrarily  near  to  b  —  a,  the  measure  of  the  interval,  exists 
such  that  f(x)  is  continuous  with  respect  to  the  set  E.    This  property  of 
measurable  functions  will   be  established  in  Vol.  II.    If  this  property  of  a 
measurable  function  be  not  assumed,  the  properties  of  derivatives  established 
below  must  be  taken  only  to  apply  to  continuous  functions.    It  has  been  shewn 
in  §  128,  that  any  measurable  set  E  contains  a  perfect   set    G,  such   that 

24—2 
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—  m'(G)  is  less  than  an  arbitrarily  chosen  positive  number,  it  being 
assumed  that  m(E)>0.  It  has  further  been  shewn,  in  §  138,  that  a  perfect 
set  G,  of  measure  greater  than  zero,  contains  a  perfect  component  G,  of  measure 
equal  to  that  of  G,  such  that  G  is  metrically  dense  in  itself.  It  follows  that  a 
measurable  function  /(#),  defined  in  the  interval  (a,  b),  is  continuous  with 
respect  to  a  perfect  set  G,  metrically  dense  in  itself,  and  such  that  m  (G)  is 
less  than  b  —  a,  by  less  than  an  arbitrarily  chosen  positive  number. 

If  H  be  a  measurable  set  of  points  contained  in  (a,  b),  and  of  measure 
m  (H)  >  0,  the  set  H  must  contain  a  perfect  component,  metrically  dense  in 
itself,  and  such  that  the  measurable  function  f(x)  is  continuous  with  respect 
to  that  perfect  component.  For  G  may  be  so  defined  that 

b-a-m(G)<m  (H), 

and  thus  H  and  G  have  a  part  K  in  common,  such  that  m  (K]  >  0.  The  set 
K  contains  a  perfect  component,  metrically  dense  in  itself,  and  this  is  the 
required  component  of  H,  with  respect  to  which  f(x)  is  continuous. 

296.  The  set  of  points  at  which,  for  a  finite  measurable  function  f(oc), 
the  conditions  D+  f(x)  =  4-00,  D_f(x)  >  0  hold,  forms  a  set  of  measure  zero. 

A  function  is  said  to  be  finite,  whether  it  be  bounded  or  not,  when  its 
value  at  no  point  of  its  domain  is  infinite. 

Suppose,  if  possible,  that  the  set  had  a  measure  >  0.  A  part  G  of  this  set 
can  be  taken,  which  is  perfect,  and  with  respect  to  which  f(x)  is  continuous, 
and  such  that  G  is  metrically  dense  in  itself.  An  interval  (a,  6)  can  be  so  deter- 

/(£)  _  f(x\ 

mined  (see  §  294)  that,  if  £  be  any  point  of  it  belonging  to  G,  -^4.  —  —  —  =  0, 

£  —  x 

if  a^x<%.    Let  E  be  the  part  of  G  in  (a,  b);  then  m  (E)  >  0. 

To  each  point  of  (a,  b)  an  interval,  with  the  point  as  left-hand  end-point, 
can  be  assigned  as  follows: 

To  each  point  of  E,  we  assign  an  interval  such  that  the  incrementary 
ratio  in  it  is  =  A,  a  fixed  number.  To  each  point  in  an  interval  contiguous 
to  E,  we  assign  that  part  of  the  contiguous  interval  which  is  on  its  right. 
To  b,  an  arbitrary  interval  can  be  assigned.  One  Lebesgue  chain  can  be 
defined,  constituted  of  intervals  of  the  set,  which  reaches  from  a  to  6;  we  can 
then  omit  the  interval  corresponding  to  b. 

The  sum  of  all  those  intervals  of  the  chain  of  which  the  left-hand  end- 
point  belongs  to  E  has  measure  >  m(E);  for  these  intervals  contain  all  the 
points  of  E,  and  the  other  intervals  contain  no  points  of  E  in  their  interiors. 

A  finite  set  A,  of  intervals  of  the  chain,  can  be  so  chosen  that  the  remainder 
has  measure  <\m  (E). 

The  finite  set  contains  therefore  intervals  A',  of  which  the  left-hand  end- 
points  belong  to  E,  and  the  sum  of  such  intervals  is  >  \m  (E}.  In  the  intervals 
of  (a,  6)  complementary  to  A',  the  incrementary  ratio  is  ~  0.  In  A'  and  its 


295-297]  General  properties  of  derivatives  373 

complement  the  increment  off(x)  from  left  to  right  is  ^  0;  and  the  total  of 
all  such  increments  is  f(b)—f(a).  But  these  intervals  have  been  shewn  to 
contain  a  set  in  which  the  sum  of  the  increments  is  >  \Am(E},  which  exceeds 
f(b)-f(a),  provided  A  is  chosen  sufficiently  large;  and  this  is  impossible. 
Therefore  the  measure  of  the  set  is  0. 

If  we  substitute /(#)  +  kx  for/(#),  we  see  that  the  set  of  points  at  which 
D+  /(*)  =  +  oo ,  D_/(#)  >  —  k,  is  zero. 

Giving  to  k  successively,  the  values  in  a  set  klt  kz,  ...  which  increase 
indefinitely,  such  that  kn<kn+l,  the  sets  corresponding  to  klt  k2,  ...,  each 
has  measure  zero,  hence  the  set  at  each  point  of  which  D+f(x)  =  +  oo  , 
D_f(x)  >  —kn,  for  some-  value  of  n,  has  measure  zero. 

Therefore  at  every  point  at  which  D+f(x)  =  +  oo ,  with  the  exception  of  a 
set  of  measure  zero,  we  must  have  D_f(x)  =  —  oo  . 

Hence  we  have  the  following  theorems: 

(1)  Iff  (as)  be  a  finite  measurable  function,  and  if,  at  the  points  of  a  set, 
D+f(x)  —  +  cc,  then  D_f(x)  =  —  x>  at  all  the  points   of  that  set,  with  the 
exception  of  a  part,  of  which   the  measure  is  zero.     In  the  set  in  which 
D+f(x)  =  —  oo  ,  with  a  similar  exception,  D~~  f(x)  =  +  oo  . 

(2)  If  f(x)  be  a  finite  measurable  function,  the  set  of  points  at  which  it 
has  an  infinite  unique  derivative,  on  one  side,  has  the  measure  zero. 

Let  Z)+  f.(x)  =  D+f(x)  =  +  oo  ,  at  all  points  of  a  set  G.    Since 

JD-/(«) "+/(«), 

except  at  points  of  a  set  of  measure  zero,  we  must  have  D~f(x)  =  -f-  oo  , 
if  the  exceptional  set  be  left  out  of  account.  By  the  last  theorem,  when 
D~  f(x)  =  +  oo  ,  we  must  have  D+f(x)  =  —  oc  ,  except  at  a  set  of  measure  zero. 
It  follows  that  the  set  G  has  measure  zero. 

297.  (3)  If  f(x}  is  a  measurable  and  finite  function,  the  points  at  which 
D+f(x),  D+f(x)  are  finite,  and  different  from  one  another,  form  a  set  of 
measure  zero. 

If  S  is  the  set  of  such  points,  and  we  remove  from  S  some  set  of  measure 
zero,  the  derivatives  D~f(x),  D_f(x)  must  also  be  finite.  This  follows  from 
theorem  (1).  We  may  assume  then  that  $  is  a  set  in  which  all  four  deri- 
vatives are  finite  at  each  point,  and  D+ f(x)^D+f(x).  •  Let  us  suppose  that, 
if  possible  m  (S)  >  0.  If  N  be  a  positive  number,  let  Sjy  be  that  part  of  S  in 
which  all  the  four  derivatives  are  numerically  <  N.  If  N  have  the  values  in 
an  increasing  sequence  N1}  Nz, ...,  which  diverges,  we  see  that  SNr+l  contains 
SNf\  the  set  S  is  the  outer  limiting  set  of  the  sequence  {Syr},  and  since 
m(S)  >  0,  we  must  have  m  (Sr)  >  0,  from  and  after  csome  value  of  r.  We  may 
then  take  N  so  large  that  m  (SN)  >  0. 
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A  part  2,  of  SN,  can  be  determined  so  as  to  be  perfect,  and  such  that  the 
measure  of  that  part  of  it  that  lies  in  any  interval  which  contains,  in  its 
interior,  points  of  2,  has  measure  >  0;  and  is  also  such  that/(#)  is  continuous 
relatively  to  2  (see  §  295). 

Let  g  (x)  =/(#)  +  NX,  then  all  the  derivatives  D+g  (x),  D+g  («),  D~g  (x) 
are,  in  2,  <  2N,  and  >  0. 

An  interval  (a,  6)  can  be  chosen  such  that  it  contains  a  part  G,  of  2,  such  that 
the  incrementary  ratio  of  g  (x),  for  any  pair  of  points  contained  in  the  interval, 
is  ^  2JV,  if  the  right-hand  point  belongs  to  G,  or  is  ^  0,  if  the  left-hand  point 
belongs  to  G.  The  interval  can  be  so  chosen  that  a  and  b  are  points  of  G. 

A  part  Gic,  of  G,  can  be  so  chosen  that  D+g  (x)  —  D+g  (x)  >k,  and  k  can 
be  so  chosen  that  m  (Gk)  >  0. 

For,  if  k  have  the  values  of  a  decreasing  sequence  of  numbers  that  con- 
verges to  0,  Gjcr  is  contained  in  Gkf+l ;  thus  the  measure  of  Gkr,  from  and  after 
some  value  of  r,  must  be  >  0.  The  set  Gk  is  the  sum  of  a  number  of  sets  G^, 
where  i  is  an  integer,  such  that,  in  GjcL,  ^(i  —  V)k^D+g(x}<\ik,  where 
i  =  1,  2,  3,  ...  t:;  t,  being  the  greatest  integer  for  which  ^(t  —  1)&  <  IN.  One  at 
least  of  these  sets  Gki  must  have  measure  >0;  let  i  be  the  least  integer  for 
which  this  is  the  case.  There  exists  a  part  E,  of  Gki,  that  is  perfect  and  has 
measure  >  0.  We  may,  by  alteration  of  a  and  b,  if  necessary,  ensure  that 
a  and  b  belong  to  E.  In  the  set  E, 

%(i-I)k  £D+g  (x)<  %ik,  and  D+g(x)  >  $  (t  +  1)  k. 
Choose  a  finite  set  of  the  intervals  that  are  contiguous  to  E,  such  that  the 
measure  of  the  sum  of  the  remaining  intervals  is  <  e,  where  0  <  e  <  m  (E). 

Let  this  finite  set  be  denoted  by  A',  and  let  A  be  the  finite  set  comple- 
mentary to  A',  in  (a,  b).  We  have  then 

m  (A)  >  m  (E),  and  m  (A)  <  m  (E)  +  e. 

Let  us  consider  one  of  the  intervals  (a,  /3),  of  A;  a,  /3  are  points  of  E. 
To  each  point  x,  of  the  part  of  E  interior  to  (a,  /3),  and  to  the  point  a,  we 
assign  an  interval  (x,x+h),  on  its  right,  such  that  g  (x  +  h)  —  g  (x)  <  \  ikh. 
To  each  point  in  (a,  /3),  not  a  point  of  E,  we  assign  that  part  of  the  interval 
contiguous  to  E  in  which  it  lies,  that  is  on  its  right.  There  is  a  unique  chain 
composed  of  intervals  so  defined,  that  stretches  from  a  to  /3.  Remove  from 
this  chain  a  set  of  intervals  of  total  measure  <  e/n,  such  that  the  remaining 
intervals  A"a>0  form  a  finite  set.  The  sum  of  the  finite  set  of  intervals  com- 
plementary to  A"a,0  is  a  finite  set  A'"ai0,  of  total  measure  <e/n,  where  n  is 
the  number  of  intervals  (a,  /3),  of  A.  We  proceed  in  this  way  with  each  interval 
(a,  /3),  of  A. 

The  interval  (a,  b)  is  now  divided  into  a  finite  number  of  parts  consisting 
of  (1)  the  set  A"  =  2  A"a,0,  (2)  the  set  A'"  =  2A'"a,p,  and  (3)  the  set  A'. 

We  have  m  (A"')  <  e. 
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The  intervals  of  A"  consist  partly  of  intervals  (x,  x  4-  h),  where  &  is  a  point 
of  E,  and  partly  of  intervals  not  containing  in  their  interior  points  of  E.  The 
measure  of  the  latter  set  is  <  e,  and  that  of  the  former  is  >  m  (E)  —  e,  and 
<  ra  (  E)  +  e. 

Since,  for  one  of  these  intervals  (x,  x  +  h\  we  have 

g  (x  +  h)  —  g  (x)  <  \tkh, 

and  in  any  interval  with  the  left-hand  end-point  a  point  of  E,  the  incrementary 
ratio  is  ^  2N;  we  have 

g(b)-g(a)<$ik{m(E)  +  e}  +  2Ne  +  p,    ...............  (1) 

where  p  denotes  the  sum  of  the  increments  of  g  (x)  in  the  intervals  A'. 

Again,  if  we  assigned  to  each  point  x,  of  E,  an  interval  (x,  x  +  h')  such 
that  g  (x  +  h')  —  g  (x)  >^(i  +  l)k,  and  proceeded  as  before,  then,  remembering 
that,  in  any  interval  in  which  the  left-hand  end  -point  belongs  to  G,  the  in- 
crementary ratio  is  =  0,  we  should  find  that 

g(b)-g(a)>±(i  +  I)k[m(E)-e}+p.     ...  ............  (2) 

The  inequalities  (1),  (2)  are  in  contradiction  if 


which  will  hold  provided  e  have  been  chosen  sufficiently  small.  It  follows  that 
the  set  S  cannot  have  measure  >  0.  The  method  of  the  above  proof  can  be 
applied  to  shew  that  a  similar  result  holds  for  D~f(x),  D+f(x).  It  thus  appears 
that,  in  the  theorem  (3),  any  pair  of  derivatives  may  be  taken. 

298.  It  has  now  been  shewn  that,  in  the  set  $1}  of  points  for  which  all 
four  derivatives  are  finite,  the  upper  and  lower  derivatives  on  the  right  are 
equal,  and  the  upper  and  lower  derivatives  on  the  left  are  equal,  except  in  a 
part  of  Si,  of  measure  zero;  and  thus,  in  a  set  of  measure  m(S1),  there  are 
unique  derivatives  both  on  the  right  and  on  the  left.  The  points  of  $x  at 
which  these  are  unequal,  form  a  set  of  measure  zero,  as  is  seen  from  the 
theorem  that  the  points  at  which  the  upper  derivative  on  one  side  is  less 
than  the  lower  derivative  on  the  other  side  form  a  set  of  measure  zero.  We 
have  accordingly  the  following  theorem  : 

The  set  of  points  at  which  the  four  derivatives  are  all  finite,  and  not  all 
equal,  has  measure  zero.  Thus,  in  the  set  of  points  at  which  all  the  derivatives 
are  finite,  there  is  a  finite  differential  coefficient  everywhere,  except  at  points  of 
a  set  of  measure  zero. 

Next,  let  us  consider  the  set  S2,  in  which  one  derivative  is  infinite,  +  oo  , 
or  —  oo  ,  and  the  others  are  all  finite.  By  theorem  (1),  $2  has  measure  zero. 

If  at  points  of  a  set  S3,  two  of  the  derivatives  are  infinite  and  the  other  two 
finite,  then,  except  in  the  points  of  a  part  of  the  set,  of  which  the  measure  is  zero, 
these  derivatives  must  be  the  upper  derivative  on  one  side  and  the  lower  deriva- 
tive on  the  other  side,  and  the  two  finite  derivatives  are  equal. 
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If  three  of  the  derivatives  are  infinite  at  points  of  a  set  S^  then,  by  the 
theorems  (1)  and  (2),  all  four  derivatives  are  infinite  at  all  points  of  $4,  except 
in  a  part  of  the  set,  of  which  the  measure  is  zero. 

If  all  four  derivatives  are  infinite  at  the  points  of  a  set  S5,  then,  except 
in  a  part  of  S5,  of  which  the  measure  is  zero, 

D+f(x)  =  D~f(x)  = '+  oo  ,  and  D+f(x}  =  IL/0)  =  -  oo  . 
The  general  result  may  be  stated  as  follows: 

Iff(x)  be  a  measurable  function,  finite  at  each  point,  and  if  a  set  of  points 
of  measure  zero  be  left  out  of  account,  then  at  every  point  x,  either 

(1)  there  is  a  finite  differential  coefficient,  or 

(2)  the  two  upper  derivatives  are  +  oo ,  and  the  two  lower  derivatives  are 

—  oo  ,  or 

(3)  the  upper  derivative  on  one  side  is  +00,  the  lower  derivative  on  the 
other  side  is  —  oo ,  and  the  other  two  derivatives  are  finite,  and  equal. 

This  general  theorem  was  established  by  Denjoy*  for  the  case  of  a  con- 
tinuous function,  and  by  G.  C.  Youngf  for  the  case  of  any  measurable  function. 
The  investigation  here  given  is  based  upon  the  work  of  the  latter  writer. 

The  following  is  a  consequence  of  the  general  theorem: 

There  is  no  geometrical  distinction  of  right  and  left  with  regard  to  the  four 

extreme  derivatives  of  a  measurable  finite  function,  except  possibly  at  points  of 

a  set  of  measure  zero. 

A  number  of  special  theorems  are  particular  cases  of  the  general  theorem. 

Thus,  we  have  the  following: 

If  a  measurable  function  have  finite  derivatives,  except  at  points  of  a  set  of 
measure  zero,  it  has  a  finite  differential  coefficient  except  at  points  of  a  set 
of  measure  zero. 

This  theorem  was  first  given  by  MontelJ,  for  the  case  of  a  continuous 
function. 

A  monotone  function  f(x)  has  a  finite  differential  coefficient  at  every  point 
which  does  not  belong  to  a  set  of  measure  zero. 

For,  in  the  case  of  a  monotone  function,  all  four  derivatives  are  =  0,  or  all 
four  are  ^  0,  and  thus  the  cases  (2)  and  (3)  of  the  general  theorem  cannot  arise. 

This  theorem  was  first  established  §,  in  connection  with  the  theory  of 
Integration,  by  Lebesgue,  for  the  case  of  a  continuous  function.  It  was 
extended  by  W.  H.  Young  to  the  case  of  a  monotone  function  which  is  not 
necessarily  continuous|j. 

*  Journal  de  Math.  (7),  vol.  i,  p.  105.  t  Proc.  Land.  Math.  Soc.  (2),  vol.  xv,  p.  360. 

J  Comptes  Rendus,  Paris,  Dec.  23,  1913.  §  Lemons  sur  I' integration,  p.  128. 

||  Quarterly  Journal,  vol.  XLII,  p.  79. 
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Since  every  function  of  bounded  variation  is  the  difference  of  two  mono- 
tone functions,  we  have  the  theorem  that : 

A  function  of  bounded  variation  has  a  finite  differential  coefficient  at  every 
point  which  does  not  belong  to  a  set  of  measure  zero. 

The  following  special  case  of  the  general  theorem  was  first  given  by  Lusin : 

The  points  at  which  a  continuous  function  has  an  infinite  differential  coeffi- 
cient (with  fixed  sign}  form  a  set  of  measure  zero. 

Also,  it  was  first  shewn  by  Denjoy  that: 

The  points  at  which  a  continuous  function  has  an  infinite  progressive  deri- 
vative, or  an  infinite  regressive  derivative,  form  a  set  of  measure  zero. 

299.  We  have  hitherto  assumed  that  f(x),  whether  bounded  or  not,  is 
finite,  for  each  value  of  x.  But  if  it  be  assumed  that  there  is  an  exceptional 
set  of  points,  of  measure  zero,  at  which  f(x)  is  infinite,  the  general  theorem 
will  still  hold  good,  as  is  seen  by  examining  the  proofs  of  the  various  theorems. 
It  is  assumed  that,  in  the  definition  of  a  derivative  Df(x),  the  incrementary 
ratio,  for  a  pair  of  points  at  which  the  function  is  infinite,  with  the  same  sign, 
is  ignored.  At  a  point  where  f(x)  =  +  oo  ,  we  have 

D+/0)  =  D+f(x)  =  -  oo ,  and  D~f(x)  =  D_f(x)  =  +  oo  ; 
and  at  a  point  where  f(x)  =  —  oo ,  we  have 

D+/0)  =  D+f(x}  =  +  oo  ,  and  D~f(x)  =  D-f(x)  =  -  oo  . 

It  thus  appears  that  the  set  of  points  at  which  the  upper  derivative  on  one 
side  is  less  than  the  lower  derivative  on  the  other  side  contains  the  set  of 
points  at  which  the  function  is  infinite. 

An  extension  has  been  made  by  G.  C.  Young*  to  the  case  in  which  the 
function  is  infinite  at  points  of  a  set  of  which  the  measure  is  greater  than  zero. 

It  has  been  shewn  by  Denjoy  that  a  continuous  function  can  be  con- 
structed, in  which  all  three  cases  of  the  general  theorem  present  themselves 
for  one  function,  or  in  which  one  or  more  of  the  cases  may  be  absent. 

EXAMPLES. 

The  following  examples  have  been  given  by  Denjoy : 

1.    Let  G  be  a  perfect  non-dense  set  of  measure  >  0 ;  let/(#)=0,  at  all  points  of  G ;  and 

in  each  complementary  interval  (an,  bn}  of  G,  let /(#)  =  ;- — '-   -  {(x  —  an)  (bn  —  x}}^.   At  each 

On  ~  an 

point  of  G, 

Z> +/(*)=+«,     D+f(x}  =  0,     Z>-/(#)  =  0,     D_f(x}~-v. 

In  each  complementary  interval/^)  has  the  maximum  value  £,  and  thus  the  function 
is  not  continuous  at  the  points  of  G. 

*  Loc.  cit.,  pp.  378-384., 
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2.  Let  G  be  a  perfect  non-dense  set,  of  measure  zero,  in  the  interval  (a,  6).     Let  un  be 
the  length  of  the  contiguous  interval  (an,  &„);  and  suppose  that  uit  u2,  ...un,  ...  are  in 
descending  order  of  magnitude.    Let  A1?  A2,  •••  hn,  ...  be  positive  numbers  such  that  ~S,hnun 

2  fx  —  a  \  ^ 

is  convergent,   and   let   yn  (#)  =  —  hnunsm~l  (  --  -)  ,   for  an<x<bn.      The    function 

7T  \     Un      } 

f(x}  =  (a'2:X)hnun  +  u)ym  (x),  where  a>  is  zero  if  x  is  a  point  of  G,  and  is  1,  if  x  is  interior  to 
a  contiguous  interval  ;  and  (a'S.x)  hnun  denotes  the  sum  of  those  parts  of  2  hnun  that  are  in 
the  interval  (a,  x).  The  function  f(x)  is  continuous  ;  it  has  a  finite  differential  coefficient 
at  every  point  that  does  not  belong  to  G,  and  it  has  an  infinite  differential  coefficient  at  all 
points  of  G. 

3.  With  the  same  notation  .as  in  Ex.   2,  let  /(#)  =  0,  in  all  points  of  G,  and  let 
f(x)  =  UnZy  (  -----  -  ),  in  the  interval  (an,bn);  where  y  (#)  =  (!  —  #)2#asin2—  .     The  function 

\     un     /  % 

f  (x)  is  continuous;  it  has  a  finite  and  continuous  differential  coefficient  at  all  points  not 
belonging  to  G.  In  all  end-points  of  contiguous  intervals  Z)+/(.r)=  +  oo  ;  and  the  other 
three  derivatives  are  all  zero  at  all  points  of  G. 

4.  Let  E  be  a  perfect  set,  metrically  dense  in  itself;  and  let 


Let/(#)  =  0,  at  all  points  of  E;  in  un  \eif(x}  —  pn%y  ('—  r-^-M,  where  pn  is  chosen  as  follows: 

\     *n    / 

From  the  interval  (a,  6)  remove  the  intervals  ul}  u2,  ...  un_l;  then  un  lies  in  one  of  the 
remaining  segments  of  (a,  6)  ;  let  pn  be  the  length  of  this  segment.     In  all  points  of  E, 


5.    Defining  E  as  in  Ex.  4,  let  /(#)=(),  in  all  points  of  E,  and  let 

2f*  (#)  =  {pn*  (1  -*)}  *  sin  ^]~r^)  ' 
In  all  points  of  E, 

D+f(x}  =  D-f(x)=+v,     D+f(x}=D_f(x}=-v. 


300.    The  following  general  theorem  is  due*  to  W.  H.  Young. 

The  points  at  which  one  at  least  of  the  four  derivatives  of  any  given  function 
is  infinite,  form  an  ordinary  inner  limiting  set,  if  it  exists.  The  set  of  such 
points  is  accordingly  of  power  c,  when  it  contains  a  component  dense  in  itself; 
and  otherwise  it  is  enumerable,  or  finite. 

It  follows  that  the  set  of  such  points  is  a  set  of  the  second  category,  in 
case  it  be  everywhere  dense. 

The  special  case  of  the  theorem  which  arises  when  the  function  is  con- 
tinuous, and  the  set  of  points  is  everywhere  dense,  was  given  by  Brodenf- 

Let  #0  be  a  point  at  which  one  of  the  four  derivatives  is  infinite,  it  being 
immaterial  whether  the  other  derivatives  are  infinite  or  finite.  A  sequence 

*  Arkiv  for  Matematik,  Astronomi  och  Fysik,  vol.  i,  Stockholm,  1903. 
t  Acta  Univ.  Lund.  vol.  xxxm,  p.  31. 
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a?!,  #2, . . ,  #n, . . .  converging  to  sc0,  and  on  one  side  of  it,  can  be  found,  which  has 
the  property  that,  corresponding  to  an  arbitrarily  large  positive  number  o-,  an 
integer  m,  can  be  found,  such  that 

/to»)  -/to) 


>  <r,  for  n  >  ml ; 


further,  m'  can  be  chosen  so  great  that 

1 


Let  the  intervals  (#0,  xn)  be  prolonged  on  the  side  beyond  x0,  each  being  in- 
creased by of  its  length ;  and  the  whole  set  of  intervals  so  constructed  for 

every  point  x  of  the  set  at  which  a  derivative  is  infinite  may  be  called  Iff. 

Let  o-j,  o-2,  cr3, ...  be  a  set  of  values  of  <r  which  increase  without  limit;  then 
the  corresponding  sets  of  intervals  7^,  7^, ...  define  an  inner  limiting  set  of 
points,  to  which  all  the  points  x  of  the  given  set  belong;  and  it  will  be  shewn 
that  no  other  points  belong  to  this  inner  limiting  set.  If  possible  let  £  be  a 
point  of  the  inner  limiting  set  which  does  not  belong  to  the  given  set  of  points 
at  which  a  derivative  is  infinite.  There  is  at  least  one  interval  of  each  of  the 
sets  /o-j,  /0-j,  •••,  such  that  £  is  an  interior  point  of  it;  let  such  intervals  be 
$!,  £>>•••>  and  let  £1?  £2,  £3,  •••  be  points  of  the  given  set  interior  to  .these 
intervals.  Let  f1}  £2,  fs>  •••  be  the  end-points  of  the  intervals  on  the  sides  of 
those  intervals  which  were  not  lengthened.  We  have 


thus  the  points  £,  £2,  £3, ...,  and  also  the  points  fl5  £2, ...,  form  a  sequence  of 
which  £  is  the  limit. 


Since 


fc-fc 


>  crt,  therefore  [/(£)  — /(£J  j  >  (0"t  —  1)  8t. 


Also  a  positive  number  A  can  be  determined,  such  that  for  all  values  of  L, 


for  otherwise  ^  would  be  a  point  with  an  infinite  derivative;  and  from  this  we 
see  that 


For  a  sufficiently  great  value  of  i, 


L  —  1  >  ^4  ; 


hence,  for  such  a  value  of  L, 


and  thus 


f-l. 
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Now  o\  —  .A  —  1  is  arbitrarily  large  for  a  sufficiently  great  t;  hence,  since  £  is 
the  limiting  point  of  the  sequence  {ft},  there  is  an  infinite  derivative  at  £. 
This  is  contrary  to  the  hypothesis  made;  therefore  the  points  of  the  given 
set  constitute  the  inner  limiting  set  which  has  been  denned. 


FUNCTIONS  OF  TWO  VARIABLES. 

301.  It  has  already. been  shewn  that  many  of  the  fundamental  concep- 
tions and  theorems  relating  to  the  continuity,  uniform  continuity,  etc.,  of  func- 
tions of  one  variable  hold  good,  without  essential  change,  for  functions  of 
two  or  more  variables. 

Most  of  the  points  in  which  the  theory  of  functions  of  a  number  of  vari- 
ables involves  considerations  which  are  not  an  immediate  generalization  of 
those  which  occur  in  the  case  of  functions  of  a  single  variable  are  sufficiently 
illustrated  by  the  case  of  functions  of  two  variables.  Accordingly  the  proper- 
ties of  functions  of  two  variables  will  be  considered  in  some  detail. 

That  a  function  f(x,  y}  should  be  continuous  at  a  point  (a,  b),  which  i$  a 
limiting  point  of  its  domain,  it  is  necessary,  but  not  sufficient,  that  the  func- 
tion f(x,  b),  of  x,  should  be  continuous  at  the  point  x  =  a,  and  that  the  function 
f(a,  y\  of  y,  should  be  continuous  at  the  point  y  =  b.  Thus  a  function  may  be 
continuous  at  a  point  with  respect  to  x,  and  also  with  respect  to  y,  whilst  it 
is  discontinuous  with  respect  to  the  two-dimensional  domain  (x,  y\ 

It  is  not  even  sufficient  to  ensure  the  continuity  off(x,  y)  at  a  point,  that 
it  be  continuous  in  every  direction  from  the  point.  Thus 

/(a  +  r  cos  6,  b  +  r  sin  6} 

may  be  a  continuous  function  of  r,  at  r  —  0,  for  each  value  of  6  in  the  interval 
(0,  2?r),  and  yet*  the  function  may  be  discontinuous  at  (a,  b). 

The  necessary  and  sufficient  condition  that  f(x,  y)  may  be  continuous  at 
(a,  b)  may  be  expressed  in  the  form  that  f(x,  y)  must  be  continuous  in  every 
direction  at  the  point,  and  uniformly  so  for  all  directions. 

Thus,  if  /(a  +  r  cos  0,b  +  r  sin  6)  be  continuous  at  r  ==  0,  for  each  value  of 
6,  and  uniformly  so  for  all  values  of  6,  then,  if  e  be  a  prescribed  positive  num- 
ber, a  number  p  can  be  determined,  independent  of  6,  such  that 

/(a  +  r  cos  6,  b  +  r  sin  6}  —/(a,  b)  \  <  e, 

provided  r  <  p.  From  this  condition  it  follows  that  \f(x,y}  —f(a,  b)  <  e,  pro- 
vided x  —  a\,  |  y  —  b  are  each  <  p/\/2,  and  thus  the  condition  of  continuity  of 
the  function  is  satisfied. 

The  remarks  which  have  been  made  as  regards  the  continuity  of  a  function 
at  a  point  are  applicable  without  essential  change  if  those  functional  values 

*  See  Thomae,  Abriss  einer  Theorie  der  komplexen  Functionen,  2nd  ed.,  p.  15. 
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in  the  neighbourhood  of  the  point  are  alone  taken  into  account,  which  are  in 
one  of  the  four  quadrants,  the  values  at  points  on  the  axes  bounding  the 
quadrant  being  either  included  or  excluded  from  consideration,  as  may  be 
agreed  upon.  Thus  the  condition  of  continuity  at  a  point  may  be  satisfied  for 
one  such  quadrant,  and  not  for  another  one. 

As  regards  the  points  of  discontinuity  of  a  function  of  two  or  more  vari- 
ables reference  may  be  made  to  a  memoir*  by  W.  H.  Young. 

EXAMPLES. 

1.  Let  f(x,  y)  =  -2      2>  and  /(O,  0)=0.     This  function  is  discontinuous  at  the  point 

x  -\-y 

(0,  0),  although  it  is  continuous  at  that  point  with  respect  to  #,  and  also  with  respect  to  y 
since  f(x,  0)  =  0,  /(y,  0)  =  0.  In  all  other  directions  the  function  is  discontinuous;  for 
writing  #=rcos$,  y  —  rsind,  the  function  is  |  sin  20  and  therefore  has  a  constant  value 
different  from  zero  on  a  straight  lino  for  which  6  is  constant,  unless  6  has  one  of  the  values 
0,  £TT,  TT,  or  f  TT. 

3CM 

2.  Lett  /(a?,  y)=  -^—\>  /(0»0)=0.      This  function   is  discontinuous  at  the  point 
(0,  0),  although  it  is  continuous  in  each  particular  direction,  at  that  point.     We  find  that 

^-j—  -.  <e.  if  r  <  —  cosec-  6  {1  —  \/l  —  4e2  cos2  6}  ;  and  in  order  that  this  condition 
24  2e 


may  be  satisfied,  the  greatest  value  of  r  diminishes  indefinitely  as  6  approaches  the  value 
irr;  whereas  when  6  =  \ir,  the  function  is,  for  every  value  of  r,  equal  to  /(O,  0).  It  is 
thus  seen  that  the  convergence  in  different  directions  is  non-uniform. 

DOUBLE   AND   EEPEATED   LIMITS. 

302.  Let  (a,  6)  be  a  limiting  point  of  the  domain  for  which  a  function 
f(x,y}  is  denned,  and  let  a  neighbourhood,  of  which  the  corners  are  the  four 
points  (a  ±  e,  b  ±  e'),  be  taken.  Let  V,  L  be  the  upper  and  lower  boundaries 
of  the  function  for  all  points  of  the  domain  in  this  neighbourhood,  the  func- 
tional value  at  (a,  b)  being  however  disregarded,  in  case  (a,  b)  belongs  to  the 
domain.  If  e,  e'  be  diminished,  the  number  U  cannot  increase;  and  when 
values  of  e,  e'  belonging  to  sequences  e^  e2,  ...  en,  ...  ,  and  e/,  e2',  ...  e,/,  ...,  each 
of  which  converges  to  the  limit  zero,  are  taken,  and  Un  be  the  value  of  U 
corresponding  to  .  the  values  en,  e,/,  of  e,  e',  the  numbers  Uly  U2>...  Un,... 
form  a  sequence  of  numbers  which  do  not  increase.  This  sequence  has  then 
a  limit  U,  which  may  however  have  the  improper  value  oo  ,  in  case  all  the 
numbers  Un  have  this  improper  value.  It  is  easily  seen  that  U  is  independent 
of  the  particular  sequences  chosen  for  e,  e'.  This  number  U  is  said  to  be  the 
upper  limit  of  the  function  at  (a,  b),  and  may  be  denoted  by 

lim    /  (x,  y). 

x~a,  y~b 

*  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  vm,  p.  117. 
t  Genocchi-Peano,  Gale.  Diff.,  §  123. 
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The  lower  limit    lim    f(x,  y)  may  be  defined  in  a  similar  manner,  as  the 

x~a,  y~b 

limit  of  a  sequence  of  values  of  L ;  and  it  may  have  the  improper  value  —  oo  . 
At  a  point  of  continuity  of  the  function,  the  condition 
lim    f(x,  y)  =    lira   f(x,  y) 

x~a,y~b  x~a,y~b 

is  satisfied ;  and  further,  each  of  these  limits  is  equal  to  f(a,  b),  in  case  (a,  6) 
belongs  to  the  domain  of  the  function. 

Corresponding  pairs  of  limits  may  also  be  defined  for  the  case  in  which 
the  functional  values  in  one  quadrant  only  are  taken  into  account,  the  func- 
tional values  Oii  the  axes  being  either  included  or  excluded,  in  case  they  exist, 
as  may  be  agreed  upon. 

The  saltus,  or  measure  of  discontinuity,  at  the  point  (a,  6)  is  measured  by 
the  excess  of  the  greatest  over  the  least  of  the  three  numbers 


/(a,  b),    lim    f(x,  y\     lim    f(x,  y}. 

x~a,y~b  x~a,y~b 

The  saltus  at  a  point  of  discontinuity  may  have  a  finite  value,  or  it  may  be 
indefinitely  great. 

In  case    lim   f(x,  y)  —    lim   f(x,  y},  their  common  value  may  be  denoted 

x~a,y~b  x~a,y~b 

by     lim   f(x,  y},  and  the  function  is  then  said  to  have  a  definite  double  limit 

x  ~  a,  y  ~  b 

at  the  point  (a,  b) ;  this  double  limit     lim   f(x,  y)  may  be  finite,  or  infinite 

x~a,  y  ~  b 

with  a  definite  sign. 

When  the  upper  and  lower  limits  have  different  values,  \imf(x,  y)  is 
frequently  regarded  as  existent  but  indeterminate,  the  upper  and  lower  limits 
being  regarded  as  its  limits  of  indeterminancy. 

303.  In  considering  the  functional  values  in  the  neighbourhood  of  a  point, 
and  the  functional  limits  at  the  point,  it  is  frequently  convenient  to  consider 
one  quadrant  only ;  this  we  may  take,  without  loss  of  generality,  to  be  the 
quadrant  in  which  x  —  a  > 0,  y  —  b>0.  The  results  which  will  be  established 
are  essentially  applicable  to  any  one  of  the  four  quadrants,  and  can  be 
immediately  extended  to  the  case  in  which  account  is  taken  of  the  whole 
neighbourhood  of  (a,  b),  by  taking  the  totality  of  the  results  for  the  four 
separate  quadrants,  and  for  the  lines  x  =  a,  y  =  b. 

Assuming  that  x—  a>  0,  y  —  b>0,  the  function  f(x,  y}  considered  as  a 
function  of  y  only,  with  x  constant,  has  two  functional  limits  f(x,  b  +  0), 

f(x,  b  +  0),  at    the  point    (x,    b};    these   may   be  denoted  by  lim  f(x,  y), 

y  ~  b 
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lira  f(x,  y}  respectively.    In  case  these  two  limits  are  identical,  their  common 

j~^b 

value  may  be  denoted  by  lira  f(x,  y\  the  functional  limit  /(ar,  b  +  0)  having 

y~b 

in  that  case  a  definite  value, 

If  either  of  the  limits  \\m-f  (x,  y),  lim/(#,  y)  is  to  be  taken  indifferently, 

y  ~b  y~b 

we  may  denote  them  by  lim/(a?,  y).  This  may  be  regarded  as  a  function  of  a, 

y^~b  __ 

such  that  its  value  at  the  point  (a,  b)  is  multiple-valued,  and  has  lira/  \x,  y), 

y~b 

lim  f(x,  y)  for  its  limits  of  indeterminancy. 
T^b 

It  may  happen  that  lim  f(x,  y),  considered  as  a  function  of  x,  has  a 

.      y^b 

definite  functional  limit  at  the  point  x  =  a  ;  this  limit  may  be  either  finite, 
or  infinite  with  fixed  sign.  In  case  such  a  limit  exists,  it  is  denoted  by 
lim  lim  f(x,  y),  and  is  said  to  be  the  repeated  limit  of  f(x,  y}  at  the  point 

x  ~  a  y  ~  b 

(a,  b),  the  order  of  the  limits  being,  that  the  limit  for  y  ~  b  is  taken  first,  and 
then  afterwards  the  limit  for  x  ^  a. 

In  case  this  repeated  limit  does  not  exist,  either  as  a  definite  number,  or 
as  infinite  with  fixed  sign,  we  may  regard  lim  lim  f(x,  y)  as  indeterminate,  its 

x  ~  a   y  <~  b 

limits  of  indeterminancy  being  lim  lim  f(x,  y),  and  lim  lim  f(x,  y}. 

X  ~.a  y  ~b  x  ~a   y  ~b 

The  repeated  limit  lim  lim/(a;,  y},  in  which  the  limit  with  respect  to  x 

y  ~b  x~  a 

is  first  taken,  and  afterwards  that  with  respect  to  y,  may  be  defined  in  a 
precisely  similar  manner. 

It  is  clear  that  the  functional  values  on  the  straight  lines  x  =  a,  y  =  b  are 
irrelevant  as  regards  the  existence,  or  the  values,  of  the  repeated  limits. 

In  case  the  double  limit     lim     f(x,  y),  for  x  >  a,  y  >  b,  exists  at  the  point 

x  ~  a,  y  ~  b 

(a,  b),  having  either  a  finite  value,  or  being  infinite  with  fixed  sign,  the 
existence  of  the  two  repeated  limits 

lim  lim  f(x,  y},    lim  lim  f(x,  y} 

x~  a  y  ~  i>  y  ~b  x~  a 

follows  as  a  consequence,  their  common  value  being     lim      f(x,  y}.     In  this 

x  ~  a,  y  ~  b 

case  lim/  {x,  <}>(x)}  also  exists,  and  is  equal  to  the  double,  limit,  where  $  (x} 

x~a 

is  any  function  of  x,  which   is   >  b,  and  is  such   that   lim   (f>(x)  =  b.     Also 

.!•  ~  (I 

lim  /{</>  (t),  ^r(t)}  exists,  and  is  equal  to  the  double  limit;  where  </>(£), 


t  ~  T 

are  functions  of  a  variable  t,  such  that  $  (t)  >a,-^r  (t)  >  b,  and  that  lim  0  (t)=a, 
lim  ^  (t)  =  b. 

t~r 
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The  converse  of  these  statements  does  not  hold  good.     In  particular,  the 
existence  of    lim    f(x,  y)  is  not  necessary  either  for  the  existence,  with 

x  ~  a,  y  ~  b 

definite  values,  or  for  the  equality,  of  the  two  repeated  limits 
lim  lim/(#,  y)\  lim  li 


EXAMPLES. 

1.    Let  f  (x,  y}  be  defined  for  the  positive  quadrant  by  /  (x,  y}  =  —  -.     We  find 

~~ 


lim  lim  f(x,  y)  =  l,  lim  lim  f(x,  y}  =  -  1  ;  thus      lim     f(x,  y}  cannot  exist. 

x~0  y~0  j/~0  x~0  a;~0,   j/~0 

<y>2  7/2 

2.  Let  ffay^rf-fr:?-—-™'     In  this  case  lim  lim  /(*»  ^  and  lim  lim  /(*.  ^ 

•*  y   +  (x~y)  x~0    y~Q  y~0    x~Q 

are  both  zero,  and  yet      lim     f(x,  y}  does  not  exist;  for  if  y  =  x,  f(x,  y}  =  \;  and  there- 

x~0,  y~0 

fore  lim  f(x,  x}  =  \. 

x~0 

3.  Let*  /(^P,  y)  be  defined  for  #>0,  3/>0,  by  the  expression  (x+y)  sin  -  sin  -.     In 

#        y 

this  case  lim   /(j?,  y)  =  ^sin  -,  lirn  /  (a?,  y)  =  —  x  sin  -  ,  and  lim  f(x,  y)  —  lim  f(x,y)  has 
i/~o  '  A>    ^o  v~o  ^5 

for  x=0  the  limit  zero.      We  have  then  lim  lim  f  (x,  ^/)  =  0,  since  #sin-,   —  .#  sin  - 

x~0    y~Q  '  J?  X 

have  each  the  limit  zero  for  #=0.      It  is  clear  that  lim  lim  f  (x,  y}  is  also  zero.      If 

,v~o  x~o 

0  <  x  <  \  6,  and  0  <  </  <  \  e,  we  see  that  |  /  (x,  y}  \  <  f  ,  and  therefore      lim     /  (x,  y)  exists, 

^~0,    y~Q 

and  is  equal  to  zero. 


304.  An  important  matter  for  investigation  is  the  determination  of  the 
necessary  and  sufficient  conditions  for  the  existence  and  equality  of  the  two 
repeated  limits  at  a  point.  A  knowledge  of  such  conditions,  as  also  of 
sufficient  conditions,  is  required  in  various  fundamental  theorems  of  analysis 
which  turn  upon  the  legitimacy  of  inverting  the  order  of  a  repeated  limiting 
process. 

It  will  be  observed  that  the  existence  of  lim  lim  f(x,  y}  does  not  neces- 

.c  ~  a  y  ~  b 

sarily   involve    the    existence   of  lim  f(x,  y}   as  a  definite  number,  since 

y~b 

lim  lim  f(x,  y),  lim  \irnf  (x,  y)  may  both   exist  and  have  the  same  value, 

x~a y ~b  x~ay~b 

without  it  being  necessarily  the  case  that  lim  f(x,  y},  lirn  f(x,  y)  are  identical. 

y ~ b  y ~b 

It  is  however  necessary  that  limy  (a;,  y)—  lim  f(x,  y)  should  converge  to  the 

y ~b  y  ~b 

limit  zero,  as  x  converges  to  the  value  a. 

*  Pringsheim,  Encyklopadie  der  Math.  Wissensch.,  IIA.  1,  p.  51. 
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The  necessary  and  sufficient  conditions  required  are  contained  in  the 
following  general  theorem: 

In  order  that  the  repeated  limits  lim  lim  f(x,  y},  lim  limf(x,  y)  may  both 

x~ay~b  y~bx~a 

exist,  and  have  the  same  finite  value,  it  is  necessary  and  sufficient,  (1)  that 
limf(x,  y)  —  limf(x,  y)  should  have  the  limit  zero,  for  x~a,  and  that 

y  ~  6  y~  b 

Lim  f(x,  y}  —  limf(x,  y}  should  have  the  limit  zero,  for  y  ~  b;  and  (2)  that, 

x~  a  x~  a 

corresponding  to  any  fixed  positive  number  e  arbitrarily  chosen,  a  positive  number 
/3  can  be  determined,  satisfying  the  condition  that,  for  each  value  of  y  interior  to 
the  interval  (b,  b  + 13),  a  positive  number  ay,  in  general  dependent  on  y,  exists, 

such   that,  for   this   value   of   y,  f(x,  y)    lies   between   lim  f(x,   y)  +  e   and 

v  ~b 

lim,  f  (x,  y)  —  e,for  all  values  of  x  interior  to  the  interval  (a,  a  +  ay}. 

y  ~b 

Let  us  first  assume  that  the  conditions  stated  in'  the  theorem  are  satisfied. 
A  value  of  y  may,  in  virtue  of  (1),  be  so  chosen  that  the  difference  of  the  two 
limits  lim/(a¥,  y\  lim  f(ac,  y)  is  less  than  an  arbitrarily  chosen  number  77; 

.r~a  x~  a 

and  this  value  of  y  may  also  be  so  chosen  that  it  is  interior  to  (b,  b  +  /3).  For 
this  fixed  value  of  y,  an  interval  (a,  a  +  or,/),  for  x,  may  be  so  chosen  that 
f{x,  y)  lies  between  lim /(.?;,  y)  +  e  and  \irnf  (x,  y)  —  e,  provided  y  has 

x ~  a  x~n 

the  fixed  value,  and  a<x<a  +  ay':  this  follows  from  the  definition 
of  the  upper  and  lower  limits.  Again,  from  the  condition  (1),  a  number 
a."  can  be  determined,  such  that,  if  x  be  interior  to  the  interval  (a,  a  +  a"), 
the  difference  between  the  two  limits  lim /"(&•,  y),  lim  f(x,  y}  is  less  than  77, 

y  ~b  y  ~b 

Now  lefc  Oy  be  the  smallest  of  the  three  numbers  ay,  ay',  a"',  then,  if  a;1}  x» 

be  any  two  values  of  x  within  the  interval  (a,  a  +  ajj),  and  y  have  the  fixed 
value;  by  applying  the  conditions  of  the  theorem,  we  see  that  the  conditions 

\f(*i>y)-/(**>y)\  <*!  +  %*> 


/(a-!,  7/)-lim/(^,  y) 


O2,  y)-  lim/0?2) 


y~b 

are  all  satisfied.    It  follows  that 


<  rj  +  e, 


m/X^j,  y)  —  lim  f(xz,  y}  \  <  3?;  +  4e 

6  y~b 

for  every  pair  of  points  x1}  x.,  within  the  interval  (a,  a  +  6^).  Hence,  since  s,  77 

are  both  arbitrarily  small,  lim/(#,  y)  converges,  for  x~a,  to  a  definite  value 

y~b 

H.  25 
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which  is  the  limit  both  of  lim  f(x,  y)  and  of  lim  f(x,  y),  when  x~a;  and 
thus  lim  lim  f(x,  y)  exists. 

x~a  y~b 

Again,  since  lim  lim/(#,  y)  has  a  definite  value,  an  interval  (a,  a+  B)  can 

x~a   y~b 

be  determined,  such  that,  for  any  point  x  interior  to  it, 


lim  li 

x~a  y~ 


,  y)  —  lim  f(x,  y) 


<  €. 


Now  lim  lim/ (x,  y)  —  lim  f(x,  y)  is  the  sum  of  the  three  differences 

x~a  y~h  ^ 

lim  lim  f(x,  y)  -  lim  f(x,  y}, 

x~a  y~b  y^b 

Sm /(*,  y)  -/  O> y)>  f(x>  y}  - lim  f(x>  y\ 

y~b  x~a 

and  for  a  fixed  y,  chosen  as  before,  x  may  be  chosen  so  that  it  not  only  lies 
within  the  interval  (a,  a  +  B),  but  is  also  such  that 


f(x,  y}  -  Um/(«,  y}   ,     f(x,  y)  -  lim/ (a?,  y)  | 

y~b  x~a  I 


are  each  less  than  17  +  2e.    It  follows  that 


lim  li 

x~a  y~b 


e 


v,  y}  —  lim/(#,  y} 
and  thus  that  lim  f(x,  y}  converges,   as   y  converges   to   b,  to  the  limit 


lim  lim/ (x,  y}.    It  has  thus  been  shewn  that  the  two  repeated  limits  both 

x~a  y~b 

exist,  and  have  the  same  value. 

Conversely,  let  us  assume  that  the  repeated  limits  both  exist,  and  are 
finite  and  equal.    We  have  then    lim  lim/(#,  y)  -  lim  f(oc,  y)  j  <  £  provided 

x~a  y~b  -^  | 

y  lies  between  b  and  b  +  /3,  where   (3  is  some  fixed  number,   £  being  an 
arbitrarily  chosen  positive  number ;  from  this  it  follows  that 

lim/O,  y)-\\mf(x,y} 

x~a  x~a 

is  <  2f,  for  6  <  y  <  b  +  /3.    Also  . 

lim  f(oc,  y)  —  lim  li 


provided  a;  lies  within  some  fixed  interval  (a,  a  +  B') ;  and  from  this  it  follows 
that  lim  f(x,  y}  —  lim  f(x,  y)  is  <  2£,  for  a  <  x  <  a  +  8'.    Since  £  is  arbitrarily 

small,  we  now  see  that  the  condition  (1)  of  the  theorem  is  satisfied.    Further 
we  see  that 

/O,  y)  -  lim  f(x,  y) 
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where  f  is  any  arbitrarily  chosen  positive  number,  provided  x  lies  within 
some  interval  (a,  a  +  ay'),  where  ay  depends  upon  y,  and  may  diminish  in- 
definitely as  y  approaches  the  value  b.  It  follows  from  the  three  inequalities, 

that 

.-, . .  ^ 


f(x,  y}  -  lim /(a?,  y) 

y~b 

provided  b<y<b  +  /3,  and  provided  also  x  lies  within  some  interval  (a,  a  +  ay), 
where  Oy  depends  in  general  upon  y.  Since  £  and  £'  are  both  arbitrarily  small, 
it  follows  that  the  condition  (2)  of  the  theorem  is  satisfied. 

If  the  condition  (2),  in  the  above  general  theorem,  be  replaced  by  the 
more  stringent  condition  that,  corresponding  to  any  fixed  positive  number  e, 
arbitrarily  chosen,  a  positive  number  /3  can  be  determined,  which  is  such 
that,  for  each  value  of  y  interior  to  the  interval  (6,  b  +  /3),  a  positive  number 
Oy,  dependent  on  y,  exists,  such  that,  for  this  value  of  y,  and  for  all  smaller 
values,  f  (x,  y}  lies  between  Hm"  /  (x,  y)  +  e  and  Km/ (a?,  y)  —  e,  then  this 

y~b  y~b 

condition  and  the  condition  (1)  are  the  necessary  and  sufficient  conditions 
that  not  only  lim  lim/(#,  y},  lira  lim/(a?,  y)  exist  and  are  equal,  but  also 

x~a   y~b  y~b  x~a 

that  the  double  limit      lim     f(x,  y}  exists,  having  a  definite  value,  the 

x~a,   y~b 

same  as  the  repeated  limits.  In  case  the  function  be  defined  for  values  of 
so,  y  on  the  lines  x  =  a,  y  =  b,  the  additional  conditions  must  be  added  that 
the  functional  values  on  these  lines  also  converge  to  the  same  limit 

lim      f(x,  y}. 

x~a,  y~b 

For,  under  the  conditions  stated,  we  have,  provided  y  lies  within  the 
interval  (6,  6  +  &),  where  &  <  ft, 

f(x,  y)- lira /(a?,  y)    <  e  +  77, 

y~b 

where  x  has  any  value  in  the  interval  (a,  a  +  £),  f  being  the  lesser  of  the  two 
numbers  0/3,  and  S'  ;  the  number  8'  being  so  chosen  that 


lim/O,  y)-  lim f(x,  y} 


y~b 


y~h 


<  77,  for  a  <  x  <  a  +  B'. 


Also 


lim/ (x,  y)  —  Um  lim/(#,  y}   <  e,  provided  x  lies  within  an  interval 


chosen  sufficiently  small.    Hence  the  condition 

f(x>  y}  ~  lim  Km /(a,  y)   <  2e  +  77 

x~a  y~b 

is  satisfied,  provided  b  <  y  <  b  +  /31}  and  provided  x  lies  within  an  interval  of 
which  the  length  may  depend  upon  e  and  77.  It  follows,  since  e,  77  are 
arbitrarily  small,  that  f(x,  y}  has  a  definite  double  limit  at  the  point  (a,  b). 
That  the  conditions  stated  are  necessary,  follows  at  once  from  the  definition 
of  lim  f(x,  y}. 


x~a,  y~b 


05 2 
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305.   The  theorem  obtained  in  §  304  may  be  simplified  in  the  case  in  which 
]imy(#,  y},  lim/(#,  y)  both  have  definite  values  at  all  points  on  the  straight 

y~b  x~a 

lines  x  =  a,  y  =  b  which  are  in  sufficiently  small  neighbourhoods  of  the  point 
(a,  b).    We  may  then  state  the  theorem  as  follows: 

If  lim,f(x,  y},  lim  f  (x,  y)  have  definite  finite  values  in  the  neighbourhood 

y~b  x~a 

of  the  point  (a,  b),  then  the  necessary  and  sufficient  condition  that  the  two  repeated 
limits  lim  Urn  f  (x,  y),  lim  Urn  f  (x,  y}  may  both  exist,  and  have  the  same 


~b    x~a 


finite  value,  is  that,  corresponding  to  any  fixed  positive  number  e,  arbitrarily 
chosen,  a  positive  number  /3  can  be  determined,  which  is  such  that,  for  each 
value  of  y  interior  to  the  interval  (b,  6  +  /3),  a  positive  number  ay,  in  general 
dependent  on  y,  exists,  such  that,  for  this  value  of  y,  f(x,  y}  —  limf(x,  y}  <  e, 

y~b 

for  all  values  of  x  within  the  interval  (a,  a  +  ay). 


In  case  the  condition 


f(x,  y)  —  lim/(#,  y}    <  e,  for  all  values  of  x  within 


(a,  a  +  ay),  be  satisfied,  not  only  for  the  particular  value  of  y,  but  for  all  smaller 
values,  and  this  hold  for  every  e,  then  the  double  limit  lim  f(x.  y}  exists, 

x~a,  y~b 

and  is  equal  to  each  of  the  repeated  limits.  In  this  case  the  point  (a,  b) 
is  said  to  be  a  point  of  uniform  convergence  of  the  function  f  (SB,  y}  to  the 
limit  \imf(x,  y\  with  respect  to  the  parameter  x\  and  thus,  for  such  a  point, 

7/~5 

there  exists,  for  each  value  of  e,  an  interval  (a,  a  +  a),  where  a  depends  in 
general  upon  e,  such  that,  for  each  value  of  x  within  this  interval,  the 
f(w,  y}  —  limy  (x,  y)  \  <  e  is  satisfied,  provided  y  be  less  than  some 


condition 

y~b 

fixed  value  which  is  the  same  for  the  whole  ^-interval  (a,  a  +  a). 

It  may  happen  that,  as  e  is  indefinitely  diminished,  a  has  a  positive 
minimum  a.  In  that  case  the  fixed  interval  (a,  a  +  a)  is  such  that,  for  each 

e,  the  condition   \f(x,  y)  —  \imf(sc,  y}  \  <  e  is  satisfied   for  all  values   of  x 

y~b 

within  the  fixed  interval  (a,  a  +  a),  provided  y  is  less  than  some  fixed  value 
dependent  on  e,  the  same  for  the  whole  ^-interval.  In  this  case  f(x,  y}  is 
said  to  converge  to  lim  f(x,  y)  uniformly  within  the  interval  (a,  a  +  a),  with 

y~b 

respect  to  the  parameter  x.  Not  only  the  point  (a,  b),  but  also  each  interior 
point  of  the  interval  (a,  a  +  a),  is  then  a  point  of  uniform  convergence  of 
f(x,  y}  to  lim  f(x,  y},  with  respect  to  the  parameter  x. 

y~b 

306.  The  necessary  and  sufficient  conditions  for  the  existence  and  equality 
of  the  two  repeated  limits  of  f(x,  y),  at  (a,  b\  may  be  put  into  the  following 
form,  different  from  that  of  the  theorem  of  §  304. 

The  necessary  and  sufficient  conditions  that  lim  limf(x,  y}  =  lim  limf(x,  y}, 

,T~rt  y~b  y~b  ,r~« 

their  value  being  finite,  are  (1)  that  lim  f  (x,  y)  converge  to  a  definite  value 
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.  lim  Urn  f(x,  y),  when  y  converges  to  b,  and  that  Urn  f  (x,  y}  —  Urn  f(x,  y)  con- 

y~b  x~a  y~b  J^j; 

verge  to  zero,  for  x~a;  and  (2)  that,  corresponding  to  any  arbitrarily  chosen 
positive  number  e,  and  to  an  arbitrarily  chosen  value  b  +  (30,  of  y,  a  value 
yl<b  +  fto,  of  y,  can  be  found,  and  also  a  positive  number  a,  such  that  the 
condition  that  f(xf  y^  lies  between 

limf(x,  y)  +  €,  and   limf(x,  y)  —  e 
v~b  -y^b 

is  satisfied  for  every  value  of  x  within  the  interval  (a,  a  +  a). 

In  case  limf(x,  y)  everywhere  exists  in  the  neighbourhood  of  x  =  a,  the 

y~b 

condition  (2)  is  that    f(x,  y^  —  limf(x,  y)  \  <  e,  for  every  value  of  x  within 

y~b  | 

the  interval  (a,  a  +  a). 

That  the  conditions  contained  in  the  theorem  are  necessary,  is  seen  from 
the  theorem  of  §  304;  it  will  be  shewn  that  they  are  sufficient.  Let  us  assume 
that  the  conditions  are  satisfied.  We  have 

,    ]\ 

\_y~b  J 


y}  ~  li™  lim/(a>,  y)  =  [Km  f(x,  y)  -f(x,  y] 

y~b  V~b  z~«  \_y~b 

+  f/O>  y}  ~  Jim  /(«,  y)]  +  flim  /(a?,  y)  -  lim  lim/O,  y)~\  . 

[_  x~a  J         \_x~a  y~b  *~«  J 


A  positive  number  01  can  now  be  chosen,  such  that,  if  b<y<b  +  @lt  the 
condition     \imf(x,  y)  —  lim  \imf(x,  y)   <  e  is  satisfied;  moreover  we  may 

x~a  V~b    x~a 

choose  $!  so  that  it  is  <  /?u. 

Next,  a  value  ylf  of  y,  exists,  such  that  f(x,  y^  lies  between  \imf(x,  y)  +  e, 

and  \irnf  (x,  y}  —  e,  provided  x  be  within  the  interval  (a,  a  +  a):  the  value  of 

y~b 
/30  may  be  chosen  so  small  that  \imf(x,  y)  —  lim  f(x,  y}  <  e,  for  every  value 

x~a  x~a 

of  y  which  is  <  b  +  j30,  and  therefore  for  the  value  y1}  of  y.   Again,  an  interval 
for  x,  possibly  less  than  (a,  a  +  a),  can  be  so  chosen  that 


lim  f(x,  y)  -Jim  f(x,  y)  <  e, 

y~b  y~b 

provided  x  lies  within  the  interval.    It  follows  that  an  interval  (a,  a  +  a.'},  for 
.x,  can  be  found,  such  that  j  \imf(n>)y}—f(x,y1)   <  3e.    Further,  the  interval 

I  u~b 

within  which  x  lies  may,  if  necessary,  be  so  restricted  that 

'(x,  Wj)  — lim  f(x,  y 

V7t7i/  •/     \      '  «/ 

x~a 
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Hence,  provided  x  lies  within  a  definite  interval,  we  see  that 


lira  /  (x,  y)  —  Km  Km  f(x,  y)   <  6e ; 


and  since  this  condition  holds  for  an  arbitrary  e,  it  follows  that  Km  f(x,  y} 

y~b 

converges,  for  x  ~  a,  to  Km  Km  f(x,  y)]  and  thus  the  sufficiency  of  the  conditions 

y~b x~a 

is  established. 

THE  LIMITS  OF  MONOTONE  FUNCTIONS  OF  TWO  VARIABLES. 

307.  A  monotone  function  f(x,  y),  of  two  variables  x  and  y,  was  defined  in 
§  253  as  a  function  such  that/(X,  y'}  ^  f(x,  y),  or  such  that/(V,  y'}  ^  f(x,  y\  for 
every  pair  of  points  (x,  y},  (x',  y'},  such  that  x'  =  #,?/'  =  y.  The  following  theorem, 
as  regards  the  limits,  at  a  point,  of  such  a  function,  will  be  established. 

Iff(x,  y)  is  a  monotone  function ,  in  each  of  the  four  open  quadrants, 
x  +  h  >x,  y  +  k>y;  x  +  h  > x,  y  —  k<  y;  x  —  h<x,y  +  k>y;  x  —  h<x,  y  —  k<y, 
the  double  limit  and  the  two  repeated  limits  of  the  function,  as  h  <%»  0,  k  ~  0,  all 
exist,  and  have  the  same  value. 

It  will  be  observed  that  all  these  limits  are  independent  of  the  values  of 
the  function  on  the  straight  lines  through  (x,  y)  parallel  to  the  axes. 

We  consider  the  case  in  which  f  (x ,  y'}  =  f(x,  y},  when  x  =  x,  y'  =  y ;  the 
other  case  can  be  treated  in  the  same  way. 

Let  fa  (x,  y}  denote  Km  f(x,  y  +  k),  and  let  fa  (x,  y)  denote  Km  f(x,  y  —  k), 

k~0  k~Q 

at  any  point  (x,  y}. 

We  have  then  Km  Km  f(x  +  h,  y  +  k)  =  Km  fa  (x  +  h,  y)  =  fa  (x  +  0,  y}. 

h~()  A-~0  h~0 

A  value  /«!,  of  h,  can  be  so  determined  that  fa  (x  4-  hly  y}  —  fa  (x  +  0,  y}  is 
<  |e,  where  e  is  an  arbitrarily  chosen  positive  number;  a  value  kl}  of  k,  can 
then  be  so  determined  that  f(x  +  hlt  y  +  kl)  —  fa(x  +  h1}  y)<%e.  We  have 
then  0  ^f(x  +  hly  y  +  &a)  —  fa  (x  +  0,  y}  <  e;  and,  from  the  property  that  the 
function  is  monotone,  we  see  that  0  =f(x  +  h,  y  +  k}  —  fa  (x  +  0,  y)  <  e,  for  all 
values  of  h,  k  such  that  0  <  h  ^  h^,  0  <  k  ^  k-^.  It  follows,  since  e  is  arbitrarily 
small,  that  the  double  limit  of  f  (x  +  h,  y  +  k),  as/i~0,  &~0,  is  equal  to 
0!  (x  +  0,  y} ;  and  therefore  the  double  limit  and  the  two  repeated  limits  have 
the  same  value  fa  (x  +  Q,  y}.  Thus  the  theorem  is  proved  for  the  case  of  the 
quadrant  in  which  the  values  of  the  coordinates  are  greater  than  x,  and  greater 
than  y,  respectively. 

Next  consider  the  quadrant  which  is  below  the  parallel  to  the  #-axis,  and 
on  the  right  of  the  parallel  to  the  y-axis.  In  this  case  h^  can  be  so  determined 
that  0  ^  fa  (x  +  AI  ,  y}  —  fa  (x  +  0,  y)  <  e ;  and  k^  can  be  so  determined  that 
0  ^  fa(x  +  hl}  y)—f(x  +  hl,  y  —  k1)<e',  it  then  follows  that/(/c  + /h,  y  —  kj}  is 
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between  the  two  numbers  <£2(#  +  0,  y)  ±  e,  that  is,  between  the  two  numbers 
lim  lim  f(x  +  h,y  —  k)±€.    In  the  same  manner,  it  can  be  shewn  that  A/,  &/ 


can  be  so  determined  that  f(x  +  h}',  y  —  &/)  lies  between  the  two  numbers 

lim  lim/(#+  ft,  y  —  k)  +  e.    The  numbers  A,,  fcj,  A/,  A;/  can  be  so  chosen  that 

.:  ~n  //~d 

A-/  >  kl}  A/  <  hlt  and  thus  that/(#  +  AJ,  y  -  kj  =f(x  +  A/,  y  —  A;/).    From  this 

it  follows  that  lim  lim  f(x  +  h,y  —  k)<  lim  lim  f(x  +  A,  y  —  k)  +  2e,  or  that 
A~o  &~o  fc~o  fc~o 

lim  lim/(a;  +  h,y  —  k)£  lim  lim/(#  +  A,  y  —  A;),  since  e  is  arbitrary.    By  choosing 

A~0  i~0  &~0  &~0 

the  numbers  A,,  A?!,  A/,  A;/  so  that  ^  >  A;/,  At  <  A/,  it  could  be  shewn  that,  in  the 
inequality,  ^  would  be  changed  into  S.  It  follows  that  the  two  repeated  limits 
are  equal. 

Again,  A2,  k2  can  be  so  chosen  that  <£2  (#  +  A2,  ?/)  —  <£2  (#  +  0,  y)  <  ^e,  and  that 
lim  lim  f(x  +  A,  y  -  k)  -  lim  /  (x  +  A,  y  -  k2)  <  %  e.  It  follows,  since  <f>2  (x  +  0,  y} 

Jc~0  h~0  h~Q 

is  equal  to  the  repeated  limit,  that  <j>.2  (x  +  h2>  y}  -  lim/(«  +  h,y  —  k2)  <  e. 

h~0 

At  any  point  (x  +  h,  y  —  k\  such  that  0  <  A  <  A2,  0  <  k  <  k2,  f(x  +  h,  y  —  k) 
lies  between  62(x  +  h2,  y)  and  lim/(#  +  A,  y  —  k2),  and  also  the  repeated  limit 

7t~0 

lies  between  the  same  two  numbers.  Therefore  f(x  4-  A,  y  —  k)  differs  from  the 
repeated  limit  by  less  than  e.  It  follows  that  the  double  limit  in  the  quadrant 
exists,  and  is  equal  to  the  repeated  limits.  The  cases  of  the  other  two  quadrants 
can  be  treated  in  a  similar  manner. 

308.  We  now  consider  the  functions  which  have  been  defined  in  §  255, 
and  named  quasi-monotone  functions.  If  f(x,  y)  be  such  a  function,  the 
function  F  (x,  y)  defined  by  F  (x,  y}  =/(#,  y)  —  /(#,  &)  -/(«,  y)  +/(«•>  b)  is  a 
monotone  function.  It  follows  at  once  from  the  property  of  F  (x,  y}  established 
in  §  307,  that  in  each  of  the  four  open  quadrants,  the  double  and  repeated 
limits  off(x,  y)  exist  and  have  the  same  value.  For  f(x,  b),f(a,  y)  are  mono- 
tone functions  of  the  variables  x,  y  respectively  ;  each  being  non-diminishing, 
or  non-increasing,  according  to  the  type  of  the  function  ;  and  in  any  case  each 
of  them  has  a  single  limit  which  is  identical  with  its  double  limit. 

For  a  quasi-monotone  function,  the  following  theorem*  relating  to  the 
points  of  discontinuity  of  the  function,  will  be  established: 

If  f(x,  y)  be  a  quasi-monotone  function,  there  exist  two  enumerable  sets  of 
straight  lines  parallel  to  the  x-axis  and  the  y-axis,  respectively,  such  that  every 
point  of  discontinuity  of  the  function,  with  respect  to  (x,  y},  lies  on  a  straight 
line  belonging  to  one  or  other  of  the  enumerable  sets. 

Let  <f>i(x,y),  <j>z(ic,  y)  be  defined  as  in  §  307;  then,  iif(x,y)  be  monotone 

*  See  W.  H.  Young,  Proc.  Royal  Soc.,  vol.  xcm,  p.  25  where  these  functions  are  termed 
monotone  functions. 
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non-diminishing  with  respect  to  x,  for  each  constant  value  of  y,  and  also  mono- 

tone non-diminishing  with  respect  to  y,  for  each  constant  value  of  x,  we  have 

fa  (x  +  0,  y)  ^f-(x  +  0,  y)  2:  &  O  +  0,  y)  ^/(#,  y  -  0)  2:  <£2  (a?  -  0,  y), 


For  a  constant  value  a,  of  #,  the  discontinuities  off(x,  y)  with  respect  to  y  are 
at  points  of  an  enumerable  set  on  the  straight  line  x  =  a.  Let  a  be  the  value 
of  x  on  the  right-hand  boundary  of  the  cell  in  which  the  function  is  denned. 
If  x  =  a,  y  =  j3  be  a  point  at  which  f(a,  y)  is  continuous  with  respect  to  y,  then 
f(x,  y)  is  continuous  with  respect  to  y,  at  y  =  /3,  for  each  value  of  x  in  the  cell.  For, 
let  (/3  —  k,  j3  +  k)  be  a  neighbourhood  of  /3,  such  that  /(a,  j3  +  k)  —/(a,  /3  —  &)  <  e, 
then,  from  the  definition  of  /(#,  y),  we  see  that  f(as,  ft  +  k)  —f(x,  p  —  k)<  e. 
It  then  easily  follows  that/(#,  y)  is  continuous  with  respect  to  y,  for  y  =  /?,  for  any 
fixed  value  of  #.  For  such  a  value  of  y,  as  ft,  we  have  fa  (x,  y)  =  fa  (#,  y),  for  every 
value  of  x.  This  equality  therefore  holds  for  all  values  of  y  which  do  not  belong 
to  a  certain  enumerable  set.  Again  fa  (x,  y)  is  a  monotone  non-diminishing 
function  of  x,  for  each  value  of  y;  and,  as  before,  we  see  that  it  is  a  continuous 
function  of  x,  for  all  values  of  x  not  belonging  to  a  certain  enumerable  set,  for 
each  value  of  y. 

If  (x,  y)  is  any  point  not  on  either  set  of  straight  lines  parallel  to  the  axis, 
defined  by  the  exceptional  values  of  x  and  of  y,  indicated  above,  we  have 
fa  (x  +  0,  y)  =  fa  (x  —  0,  y)  ;  and  then,  employing  the  inequalities  given  above, 
we  see  that  the  four  limits,  at  (x,  y),  of  the  function,  in  the  open  quadrants, 
and  the  four  limits  at  (x,  y),  dependent  on  the  values  of  the  function  on  the 
parallels  to  the  axes,  through  the  point,  are  all  equal  ;  and  therefore  (x,  y)  is 
a  point  of  continuity  of  the  function. 

In  the  case  in  which  f(x,  y)  is  a  non-diminishing  function  of  y,  for  each 
constant  value  of  x,  and  in  which  y  (a-,  y)  is  a  non-increasing  function  of  x,  for 
each  value  of  y  ;  the  inequalities  employed  above  can  be  replaced  by 

fa  (x  +  0,  y)  £f(x,  y-0)^fa(x-Q,  y)  </(*  -  0,  y)  =§  fa  (x  -  0,  y), 

fa  (x  +  0,  y)  £f(x  +  0,  y)  ^  fa  (x  +  0,  y)  £f(x,  y  +  0)^fa(x-0,  y\ 

and  a  similar  argument  to  that  above  will  shew  that  fa  (x  +  0,  y)  =  fa  (x  —  0,  y) 

at  every  point  (x,  y)  not  on  either  of  two  enumerable  sets  of  lines  parallel  to  the 

axes.  The  cases  of  functions  of  the  other  types  can  be  treated  in  a  similar  manner. 

PARTIAL  DIFFERENTIAL  COEFFICIENTS. 
309.    If,  at  a  point  (x0,  y0),  in  the  domain  for  which  the  function  f(x,  y}  is 

defined,  the  limit  lim-^0  +  h'  yA~f(x^  ^  exists,  having  either  a  definite 

h~o  « 

finite  value,  or  being  indefinitely  great,  but  of  fixed  sign,  this  limit  is  said  to 
be  the  partial  differential  coefficient  of  f  (x,  y},  at  (x0,  y0),  with  respect  to  x; 

it  is  usually  denoted  by  df^>^. 
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When  the  limit  lim^0>  -«'        existgj  it  is  said  to  be  the 

k~Q  K 

partial  differential  coefficient  of  f(x,  y),  at  (#„>  y0),  with  respect  to  y\  it  is 
denoted  by  §%*>. 

fyo 

In  general,  h,  k,  in  these  definitions,  are  regarded  as  having  either  sign. 
It  is  possible  that  either  of  the  above  limits  may  not  exist,  but  that  there 
may  be  two  definite  limits,  one  for  positive  values  of  the  increment  h  or  k, 
and  the  other  for  negative  values.  In  that  case  the  two  limits  are  said  to  be 
the  progressive  and  regressive  partial  differential  coefficients  with  respect  to 
the  particular  variable.  It  is  of  course  possible  that,  at  a  particular  point, 
one  of  these  may  exist,  and  not  the  other. 

rlf          7\f 

That  the  two  partial  differential  coefficients  J-.    J—  may  exist,  it  is 

dx0     dy0 

necessary,  but  not  sufficient,  that  f(x,  y}  should,  at   the  point  (x0,  y0),  be 
continuous  with  respect  to  x,  and  also  with  respect  to  y. 

To  express  the  increment  f(x0  +  h,y0  +  k)  —f(xQ,  y0),  of  the  function  /(#,  y), 
when  the  two  numbers  XQ,  y0  receive  increments  h,  k  respectively,  we  have 

,  y0  +  k)  -/Oo,  2/o)  =  [/Oo  +  h,y0  +  k)  -f(x0,  y,  +  k)] 


r 

If  we  now  assume  that  ~  exists  at  the  point  (#„,  2/0),  and  has  a  finite 

J 


value,  we  have 


;0,y0  0,y0 

—  =    —  ~ —   — h  cr  (A;), 


k 

c/  " 

where  cr  (k)  converges  to  the  limit  zero,  when  k  is  indefinitely  diminished. 
Again,  —  — y — ^-  -  converges  to  the  limit  ~  ,  when  k 

/ 1  QOCft 

is  first  diminished  to  the  limit  zero,  and  afterwards  h  converges  to  zero,  it 

*""  J? 

being  assumed  that  ~-  has  a  definite  value,  and  also  that  f(x,  y}  is  con- 
tinuous with  respect  to  y,  for  the  value  y  =  y0,  where  a;  has  any  value  in  a 
neighbourhood  of  x0.  In  order,  however,  that  the  double  limit 

f(x0  +  h,  y0  +  k)  —f(x0, 7/0  4-  k) 

i-\  /• 
may  exist,  in  which  case  its  value  is  J-  ,  being  independent  of  the  mode  in 

which  h,  k  approach  their  limits,  it  is  necessary  and  sufficient  that 

/(#„  +  h,  y0  +  k)  —f(x0,  ?/o  +  k) 

h 
should  be  a  continuous  function  of  (h,  k)  at  the  point  h  =  0,  k  =  0. 


394  Functions  of  a  real  variable  [OH.  v 

If  this  condition  be  satisfied,  positive  numbers  h1}  k^  can  be  determined, 
such  that  fl      .  ,        .  ,.„       ,  ,,      g". 


h 

where  rj  is  a  prescribed  positive  number,  and  Q<\h\£*hlt  0 £ 

We  have  now 

f  (x0  +  h,y0  +  k)—f  (x0,  y0  +  k)  _  df 

fi>  GOCn 

where  p  (h,  k)  converges  to  zero,  independently  of  the  mode  in  which  h,  k 
converge  to  zero. 

Under  the  conditions  stated,  we  have 

a/         zf 

•f  ( ff     _L  /)      ft!     _L    l'\  f  ( T        II 

J  \    o   i   ">  yo  ~  iv )      j  \^o>  yo. 


,  - 

ox0        o 

where  p,  a-  converge  to  zero,  when  h  and  k  are  indefinitely  diminished, 
independently  of  the  mode  in  which  they  approach  their  limits.  This  is 
equivalent  to  the  statement  that,  corresponding  to  an  arbitrarily  assigned 
positive  number  rj,  positive  numbers  A,,  ^  can  be  determined  so  that  \p  and 
are  each  <  rj,  for  all  values  of  h  and  k  such  that  \  h  \  <  /i1}  k  J  <  k^. 

In  the  notation  of  differentials,  denoting  f(xQ,  y0)  by  z0,  we  have 

,         dz    ,       dz    , 


the  expression  on  the  right-hand  side  being  termed  the  total  differential  of  z 

Cl  "~\ 

at  the  point  (x,  y),  and        dx,  ^—  dy  the  partial  differentials.    In  accordance 
ox0       oy0 

with  the  arithmetical  theory,  this  equation  can  only  be  regarded  as  a  con- 
veniently abridged  form  of  the  result  obtained  in  the  present  discussion. 

The  expression  h  ~  +  k~-  may  be  spoken  of  as  the  first  differential  of 

f(x,  y]  at  (x0,  ?/0),  and  may  be  denoted  by  8(1'/. 

The  theorem  obtained  may  be  stated  as  follows  : 

That  the  increment  of  a  function  f(x0,  ya)  when  x0,  y0  are  changed  into  x0  +  h, 

yQ  +  k  may  be  h  •       °'  *     +k  —   °'       +  hp  +  ka~,  where  p,  a  converge  to  zero,  when 
o.r0  oy$ 

h,  k  are  indefinitely  diminished,  independently  of  the  mode  in  which  they  are 

f}  I   (  IT       77    i       \lt    \  IT       7/1 

diminished,  it  is  sufficient  (1)  that  •L^-ll^  }  --A:"1       have  definite  finite  values, 

ox0  oy0 

7  /o\  n    .<  f(xo  +  h,  y0  +  k)—  f(x0,  Vo  +  k)  ,  .  ,        .         ,  ,    7. 

and  (2)  that  J—  --,  —  —  —  be  a  continuous  function  of  (h,  k)  at 

fi 

the  point  h  =  0,  k  =  0. 

It  will  be  observed  that  no  assumption  has  been  made  that  ~  ,  4~  have 

dx    dy 

definite  values  except  at  the  point  (x0,  yn)  itself. 
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It  follows  from  the  definition  that  it  is  necessary  for  the  existence  of  a  total 
differential  coefficient  at  (#0,  2/o)  that  the  function  be  continuous  with  respect 
to  (x,  y),  at  (a?0,  y0). 

fif 
If  it  be  assumed  that  ~  has  a  definite  value  at  (x0,  y\  for  all  values  of  y 

in  some  neighbourhood  of  ?/0,  the  condition  (2)  may  be  expressed  in  the  form 

?if 
that  (a),  J-,  for  x  =  x0,  must  be  a  continuous  function  of  y,  at  y  =  y0'>  and 

ooc 

that  (b),  the  point  h  =  0,  k  =  0  must  be  a  point  of  uniform  convergence  of  the 

f      i-       f(xo  +  h,y0  +  k)-f(x0,y0  +  k)  .,       ,  .  ,.        . 

function  •'—  —  ^  —  ^  —  ,  considered  as  a  function  of  /t,  to  its 

limit  for  h^Q,  with  k  as  a  parameter,  in  accordance  with  the  definition  of 
such  a  point  of  uniform  convergence  given  in  §  305. 

?if 
If  it  be  assumed  that  ~-  exists,  not  merely  at  the  point  (x0,  y0},  but  at  all 

Q3C 

points  in  a  sufficiently  small  two-dimensional  neighbourhood  of  the  point, 
the  conditions  contained  in  the  theorem  may  be  simplified.  For  we  have,  in 
that  case, 


(  > 

7)  T  Inp          nt    \ 

where  6  is  such  that  0  <  6  <  1  ;  and  this  expression  converges  to  •        "' 

OX0 

7}  ~f  (  $   if\ 
provided    !~-L^  be  continuous  with  respect  to  (x,  y)  at  the  point  (x0,  y0). 

It  has  thus  been  proved  that*,  in  order  that  the  increment  of  the  function 

may  be  of  the  form  given  in  the  theorem  above,  it  is  sufficient  (1)  that   ~-  ,  — 

dx'  dy 

have  definite  values  at  the  point  (x0,  y0),  and  (2)  that  one  at  least  of  these 
partial  differential  coefficients  have  definite  values  everywhere  in  a  two- 
dimensional  neighbourhood  of  (x0,  y0),  and  be  continuous  at  (#„,  y0),  with  respect 
to  the  domain  (x,  y). 

O  /*     £\  /* 

310.    Let  it  now  be  assumed  that  J-,  ~-  exist,  and  are  continuous  functions 

ox  oy 

with  respect  to  (x,  y),  at  all  points  of  an  open  domain  D.  Let  D1  be  any  perfect 
domain  interior  to  D;  at  any  point  of  Dl  we  have 

f(x  +  h,y  +  k)  -f(x,  y  +  k)  =  h  £-f(o:  +  6h,y  +  k), 

(JiJL 

where  0  <  d  <  1,  and  h,  k  are  such  that  \h\<t,\k  <  £;  the  number  £  being- 

I  -*  '    \  ^   J  3  o 

so  chosen  that  £\/2  is  less  than  the  distance  between  the  boundaries  of 
D!  and  D,  so  that  the  cell  (x  —  h,  y  —  k',  x  +  h,  y  +  k)  is  certainly  interior  to  D. 

*  Thomae,  Eirileitung  in  die  Theorie  der  bestimmten  Integrate,  p.  37. 
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r\ 

We  have  also  f(x,  y  +  k)  —f(oc,  y)  =  k^-f(x,  y  +  Q'k),  where  0  <  6'  <  1. 

Since  ^-,  ~-  are  continuous  in  the  perfect  domain  D1}  they  are  uniformly 

continuous  in  that  domain.    Accordingly,  a  positive  number  e  (<  £)  can  be 
so  determined  that,  if  77  be  a  prescribed  positive  number, 

d  d 

j?  /        |     /}/  i     7  \  •£  /  \  I   ^ 

\    O  Ci 

and 

for  all  points  (x,  y},  of  Dl ,  provided  |  h  \ ,  \k\  are  both  less  than  e. 

The  numbers  ij,  e  converge  to  zero  together.    We  have  now  the  following 
theorem : 

If the  function  f(x,  y)  have  partial  differential  coefficients  that  are  continuous, 
with  respect  to  the  plane  domain,  at  all  points  of  an  open  domain  D,  then 


O  (jll 

cohere  (x,  y},  (x  +  h,  y  +  k)  are  any  points  of  D;  and  R,  R'  tend  to  the  limit 
zero,  uniformly  for  all  points  (x,  y)  of  any  perfect  domain  contained  in  D,  as 
h,  k  converge,  in  any  manner,  to  zero. 


EXAMPLES. 


1.    Let*  f(x,  y}  =  >J\xy\)  where  the  positive  value  of  the  square  root  is  to  be  taken. 

"^  £     OJT 

In  this  case  ^-,  ~  both  exist  at  the  point  (0,  0),  and  are  both  =  0.    We  have 
ox    cy 


"J 

~I 
h 


\  k)  _      I 

~  V 


and  this  has  different  constant  values  for  different  constant  values  of  k/h,  and  is  therefore 
discontinuous  at  the  point  A=0,  £  =  0.  In  this  case  the  equation  /  (h,  k)  =  hp  +  k<r,  when 
p  and  cr  converge  to  zero  with  h  and  &,  cannot  hold. 

2.    Lett  /(#,?/)  =  x  sin  (4  tan"1  yjx),  for  .r>0;  and    /"(O,  y)  =  0,  for  all  values  of  y. 

We  find  C^4°L° )==0,   ^1°'^=0,  and  thus  ^®L^  is  continuous  with  respect  to  y,  at 
o,e  ex  dx 

(0,  0).     Also,  we  find  ^-^  =  4,    ^'-0)  =  0,  and  therefore  ^  (5'  0)  is  discontinuous 
ty  oy  ?AJ 

with  regard  to  a,  at  (0,  0).     The  value  of  —          /         ' —    is  sin  f^tan"1^ ) ,  and  this  is 

discontinuous  at  h  =  0.  /?-=0;  hence  the  relation  df=^-dx+^-  dy  does  not  hold  at  the 

ox  cy    * 

point  (0,  0). 

*  Stolz,  Grundziige,  vol.  i,  p.  133. 

t  Harnack's  Introduction  to  the  Differential  and  Integral  Calculus,  Cathcart's  Translation,  p.  93. 
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HIGHER  PARTIAL  DIFFERENTIAL  COEFFICIENTS. 

311.    If  the  function  f(x,  y)  have  the  partial  differential  coefficient  r^,  it 

ox 

r\  f 

may  happen  that,  at  the  point  (x0,  y0),  the  function  ^  possesses  a  partial  dif- 

d2f(x    y  ) 
ferential  coefficient  with  respect  to  x.    This  is  denoted  by       \  °'2       ,  and  is 

O&o 

spoken  of  as  the  second  partial  differential  coefficient  of  f  (x,  y]  with  respect 

d2f(x    v  } 
to  x,  at  the  point  #„.    The  second  partial  differential  coefficient  -  J     °'2  w  , 

dyo2 

with  respect  to  y,  is  defined  in  a  similar  manner. 

,  2/0)  .     ,   ,,      ,        ,,     r     -,     c  1  (9/(#()  +  A,  #0)      9/(#0,  ?/„)] 
y/-  is  defined  as  the  limit  of  T  I*7  -  -  ^  L_^/(  .  lt  bemg. 


gr/^    »,  \ 
assumed  that  J   -"'         has   a   definite  value.     It  is  not  necessary  for  the 


existence  of  this  limit,  as  a  definite  number,  that  -       °  —  l^l  j  for  ^=j=0,  should 

' 


have  a  definite  value.    When  —  —  °—  —  -L-L-^-  has  limits  of  indeterminancy 


i  i  i  •  *•      \  '  f    V^O     '     '&S     v  O/  ^/    \  "^-'0     •     'V)    '/(]  I  ' 

it  may  happen  that  hm  j  -j-   — » — ^ — :      xj-  =0,  and  that 

lira  T-  -*  ^       +    '  ^    -  f^         y°  ' 


has  a  definite  value. 

""2  -f 

Thus,  in  accordance  with  the  definition  of  ^2,  this   second  partial  dif- 
ferential coefficient  may  exist,  as  a  definite  number,  at  the  point  (,r0,  y/0),  but 

~ 

ox 


not  at  points  in  the  neighbourhood  ;  it  is  however  assumed  that  ~  exists 


It  may  happen  that,  at  the  point  (x0,  y0),  the  function  ~  has  a  differential 

OOU 


coefficient  with  respect  to  ?/ :  this  may  be  denoted  by  ~—  ( --•  -  ),  or  —       °'  "°  . 

dy0\cuj          dy0dx0 

~\  /* 

Similarly,  when  ~  has,  at  the  point  (x0,  T/O).  a  partial  differential  coefficient 

with  respect  to  x,  this  is  denoted  by  ^  —  (  ^- ) ,  or  -----  '  "'  ^    .    These  partial 

ox0\di/J  oxudy0 
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differential  coefficients  are  said  to  be  the  mixed  partial  differential  coefficients 
of  the  second  order  at  (o?0,  y0),  of/  (x,  y),  with  respect  to  x  and  y,  the  order  of 
differentiation  being  different  in  the  two. 

Under  certain  conditions  which  will  here  be  investigated,  the  two  mixed 
partial  differential  coefficients  of  the  second  order,  with  respect  to  x  and  y, 
satisfy  the  relation 


dxdy  dydx 

which  may  be  regarded  as  the  fundamental  theorem  for  partial  differential 
coefficients  of  the  second  order. 

rl        /  rlf  \  rl2  f 

The  differential  coefficient  ~  —  (  ~  }  ,  or-  —  i—.  is  defined  as  the  partial  dif- 

9#o  \oyj        ox0dy0 

ferential  coefficient  at  (x0,  y0),  with  respect  to  x,  of  ^-  ,  it  being  assumed  that 
J—--—?^-  exists,  as  a  definite  number. 

fyo 

Thus           3£-  =  lim  1T  [lirn  f^^L^±^'f(^  +  h^ 
v%ovy0     h~o  ">  L*~o  K 

-Km  /fop,  yo  +  £)-/(>o,  2 


92/1 

For  the  existence  of  -  -—:•—  as  a  definite  number,  it  being  assumed  that 
ox0dy0 

0.  yo)  hag  a  definite  vaiue;  it  is  not  necessary  that  9/(^  +  h>  VA  should 


*  9 

have  a  definite  value  for  h  =|=  0.     =  —  ~  may  exist  as 

J 


,  y0)      9/(a?0>  y0) 


When  the  differential  coefficient  -  —  ^~  exists,  it  is  equal  to  the  repeated 

ox0dy0 

limit 

.     f(«;0  +  h,y0  +  k)  -/0<>  +  h,  2/0)  -/(a?0,  y<>  +  ^)  +/K,  y0) 
lim  lim  —  —  j^j  —  . 

h~o  k~o  nto 

Conversely,  if  this  repeated  limit  has  a  definite  value,  and  if  also 

i-    /(^o.yo  +  ^)  -/(^Q.  ?/o) 

11111    —  1 

&~o  « 

B2/" 
exists,  then  s—  "4     exists,  and  is  equal  to  the  repeated  limit. 

ox0oy0 

^2  •£ 


It  is  however  possible  that  the  repeated  limit  may  exist,  and  yet  that 

r)  f 

may  not  exist,  if  the  condition  that  ~-  exists  be  not  satisfied. 
J 
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For   example*  let  f  (x,  y)  =  Xi(#)  +  X*(y}'>  where  ;&(#),  X*(y)  are  non- 
difTerentiable  functions  of  x  and  y  respectively.    In  this  case 

(*o,  yo) 


do  not  exist,  and  thus  ~—  x—  does  not  exist,  but 

9#0ctyo 

/(#0  +  h  ,  y0  +  k)  -  /(a?0  +  A,  y0)  -f(x0>  y0  +  A?)  +/(#„,  y0) 
i  1  1  1  1  iim  —  1  7 

A~0  fc~0  "«' 

exists,  and  is  equal  to  zero. 

d'2f(cc    v  ) 

It  would  be  possible  to  define    %.     "'  -    as  the  value  of  this  last  repeated 

ox0  dy0 

"bf(x    11  ) 
limit,  when  it  exists,  in  which  case      \°'        would  not  necessarily  exist.    It 

fyo 

is  however  more  convenient  to  restrict  the  definition,  as  has  been  done  above, 

H  /   I  'T'        7/1 

to  apply  only  to  the  case  in  which   J  V°'        exists,  as  a  definite  number; 
accordingly  this  definition  will  be  adopted. 

92/  (x    y  }  df(iK    V  } 

It  will  also  be  assumed  that    J  \  "'    °   exists,  only  when  ~°'  "y     exists. 

oatf  ox0 

92f 
A  similarly  restricted  definition  of  ^  —  |  —  will  be  employed. 

uy0oss0 

312.    Denoting 

f(x0  +  h,y,  +  k)  -/(#„  +  h,  y0)  -/(«o,  2/o  +  ^)  +/(«0,  2/o) 

32/"         92/ 
by  ^(A,  ^),  the  condition  that^r  —  ^-  =  -~--  holds  is  identical  with  the  con- 


dition  that  the  two  repeated  limits  of  —  Vi  —  >  for  A~0,  &~0,  should  both  exist, 

lik 

rlf       "r^-f 

and  have  the  same  finite,  or  infinite,  value;  it  being  assumed  that  ~,  ~-  both 

oy    ox 

exist  at  the  point  (x0,  y0).  The  necessary  and  sufficient  conditions  for  the 
equality  of  the  two  partial  differential  coefficients  may  be  obtained  by  applying 
the  conditions  given  in  either  of  the  theorems  in  §  304,  and  §  306,  to  the 
function  F  (h,  k)/hk.  It  is  however  convenient,  for  application  in  particular 
cases,  to  possess  sufficient  conditions  relating  to  the  partial  differential  coeffi- 
cients in  the  neighbourhood  of  the  point  (x0,  y0). 

The  following  theorem  will  be  established: 


y) 
//"(I),    %.   J       exists,  and  is  finite,  at  all  points  in  a  plane  neighbourhood 

(e)  of  the  point  (x0,  y0),  except  that  its  existence  at  the  point  (x0,  y0)  itself  is  not 
*  See  Hobson,  Proc.  Loud.  Math.  Soc.  (2),  vol.  v,  p.  235. 
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d2f  (x  i/} 

assumed,  and  if  '(2),   %,  V       Aas  «  unique  double  limit  A,  at  the  point  (x0,  y0), 

0  U  Gw 

which  is  either  finite,  or   infinite   with   a  fixed   sign,  and   if  (3),  ^^"'  ^  } 

are 


In  particular,  the  conditions  of  the  theorem  are  satisfied  if  ^--  exists 

oydx 

everywhere,  without  exception,  in  a  neighbourhood  (e)  of(x0,y0),  and  is  continuous 

rt  f 

at  (x0,  2/0),  and  if  ~  exists  at  (XQ,  y0). 

In  the  first  place,  we  observe  that,  as  a  consequence  of  (1),  ^  '  ^  exists 

9# 

everywhere  in  the  neighbourhood  (e)  of  (#0)  y0);  its  existence  at  the  point 
(x0,  y0)  being  assumed,  in  accordance  with  (3).  Moreover  •  .  ^  '^^  must  be  con- 

C'tA-' 

tinuous  with  respect  to  y,  except  possibly  at  the  point  (x0,  y0)  itself.  It  also 
follows  that  f(x,  y)  is  continuous  with  respect  to  x,  at  every  point  of  the 
neighbourhood  of  (x0,  y0),  including  the  point  (#„,  y0)  itself,  on  account  of  (3). 

Let  us  suppose  that  h,  k  are  both  positive  ;  the  other  three  quadrants  may 
then  be  considered  separately,  in  the  same  manner. 

Since  —         -^  ,  where  0  <  h  <  e,  is  continuous  with  respect  to  y,  in  the 

U& 

closed  interval  y0  ^  y  ^  yn  +  k,  where  k  <  e,  and  since  it  possesses  a  differential 
coefficient  with  respect  to  y,  at  every  interior  point  of  that  interval  of  y,  we 
have,  by  the  mean  value  theorem  (§  262), 


dx  dx  dydx 

where  0  is  some  number  such  that  0  <  0  <  1.  The  expression  on  the  right- 
hand  side  is,  by  (2),  equal  to  Jc[A+a  (h,  k)],  if  A  be  finite,  where  |  a  (h,  k)  |  <  6j, 
an  arbitrarily  chosen  positive  number,  provided  h,  k  are  each  less  than  some 
fixed  positive  number  77,  dependent  on  e^  In  case  A  =  +  so  ;  the  corresponding 
result  is  that 

df(x0  +  h,  -y0  +  k)      df  (x,  +  h,  y,} 


8~ 
X  CX 


/b_i.V    , 


for   0  <  h  <  77,   0  <  k  <  rj,   where   N  is  an  arbitrary  positive  number,   and  rj 
depends  on  N. 

Since  the  function  f  (x,  y0  +k)—f(x,  yn)  is  a  continuous  function  of  x,  in 
the  closed  interval  (x0,  x0  +  h),  and  since  it  possesses  a  differential  coefficient 
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with  respect  to  x,  at  all  points  x  in  that  interval,  we  have,  by  applying  the 
mean  value  theorem, 


where  #'  is  a  number  such  that  0  <  &  <  1. 

From  the  two  results  obtained,  we  have  F(h,  k)  =  A  +  of  (h,  k),  when  A  is 
finite;  where    a'  (h,  k)  \  <  e1(  provided  h,  k  are  both  less  than  77. 

In  case  A  =  +  x  ,    F(h,k)\>N,ifh,k  are  both  less  than  77. 

The  case  A  =  —  oc  can  be  treated  similarly.    In  either  case  F(h,  k)/hk  has 

the  double  limit  A.    Since  ~,  ~  both  exist  at  (#0,  y0),  as  finite  numbers, 

dx0   dy0 

^\2  J?  ^2-/* 

it  follows  that  ^  —  ^—  ,      •{     both  exist,  and  have  the  value  A. 


The  sufficient  conditions  in  the  foregoing  theorem  are  somewhat  simpler 
than  those  stated  by  Schwarz*,  who  assumed  the  additional  condition  that 

7j  "f"  (  nf*     7/1 

./  .   >  yos  exjgts  an(}  fs  flnite,  for  all  values  of  x  in  the  neighbourhood  of  x  =  x0, 

tyo 
for  the  constant  value  ?/„>  of?/. 

A  somewhat  different  set  of  sufficient  conditions  has  been  givenf  by 
W.  H.  Young,  in  the  following  theorem: 

92/ 
If  (1),   o~^~  exists,  and  is  finite,  at  all  points  of  the  neighbourhood  (e)  of 

the  point  (x0,  y0),  except  those  points  for  which  x=  x0  or  y  =.i/0>  ^s  existence  on 

rfif 

these  straight  lines  not  being  assumed,  and  if  (2),  these  values  of  ~   /    have  a 

rl  f 

unique  double  limit  A  (finite,  or  infinite  with  fixed  sign),  and  if  (3),  ~-   exists 

uoc 

on  the  two  straight  lines  X  —  XG,  y  =  yQ  in  the  neighbourhood  (e)  of  (x0,  ?/0)>  in- 
cluding that  point  itself,  and  is  continuous  with  respect  to  y,  when  y  =  y<>,  x  =|=  x0, 

7)£ 
and  if  (4),  ~  exists  when  y  =  yo,for  all  values  of  x  in  the  neighbourhood  (e)  of 

\J 

Tftj   (  fp       7/|       f}~  T  if        11    \ 

xa,  including  a?0  itself,  then    -4;     x  _y  v  o>  J*>_   j0^  exist,  and   have    the 

dx0  dy0          dy0  dx0 
value  A. 

Only  a  slight  modification  of  the  proof  of  the  last  theorem  is  required  in 
order  to  prove  this  theorem. 

*  GcsammelteAbh.,  vol.  n,  p.  275;  see  also  Peano,  Mathcsis,  vol.  x,  p.  153,  and  also  Stolz,  Gntnd- 
zilge  d.  Dijf.  Eech.,  vol.  i,  p.  147. 

t  Proc.  li.S.E.,  vol.  xxix,  p.  136. 

H.  26 
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?}f 
From  (1)  it  follows  that  jj~-  exists,  and  is  a  continuous  function  of  y,  when 

CSC 

7*if 
a?=f  fl?0»  2/4=2/0  5  this,  combined  with  (3),  shews  that  ~  exists  everywhere  in  the 

OCC 

neighbourhood  (e)  of  (x0,  y0),  and  that  it  is  everywhere  continuous  with  respect 
to  y,  except  possibly  at  the  point  (#„,  y0).  It  follows  that/  (as,  y)  is  everywhere 
continuous  with  respect  to  x.  The  proof  then  proceeds  as  before. 


iC    II  t        Vi(oO    li} 

313.  If      \         ,       \         have  total  differentials*  at  the  point  (ac0,  y0),  then 

COC  O'U 

.  s/o)  = 


In  accordance  with  the  condition  stated,  the  two  functions  ^—  ,  -£-  are 

ox     oy 

continuous  with  respect  to  (x,  y)  at  the  point  (os0,  y0),  and  they  accordingly 
exist  in  a  neighbourhood  of  (x0,  y0).  Hence  f(x,  y}  is  continuous  both  with 
respect  to  x,  and  with  respect  to  y.  We  have 


o  +  h,  y,  +  k)     3/(a?0,  y0)  _      92/(>0,  y0)          92/Q0,  2/o)      ,         , 

—  sr  —  ~^  —  «t  --  «  —  „  -----  r  A-    —  «  -  «  —  —  r  'fr/3  T  ftw, 

ox  dx0  9^o  82/0  9«o 

where    p  ,    a    converge  to  zero  as  h,  k  do  so  in  any  manner. 

We  have 

F(h,  k)=  {f(xQ+h,  y0  +  k)-f(x0  +  h,  y,)}  -{f(x9i  y0  +  k)-f(x0>  y,}} 

_ 

~ 


fa  fa 

where  6  is  such  that  0  <  9  <  1.    From  this,  we  see  that 


F  (h,  k)  =h0h        ;     +  k          y.)  +  eh,  +  ka 

(  dx0z  dy0  dx0 

-eh 

where    p  \  ,  j  <r'  \  ,  \  p"    converge  to  zero,  as  h  ~  0,  k  ~  0. 

We  now  have 

F(h,k) 


z      o  , 

Ilk  oyQ  OXQ  K  '  K 

hence,  if  h  and  k  converge  to  zero  in  any  manner  such  that  h/k  is  greater  than 
a  fixed  positive  number  e,  F  (h,  k)/hk  converges  to   -4^-§-^  •      Similarly,  it 


can  be  shewn  that  F  (h.  k)/hk  converges  to    •{.    "\  '  '  ^    ,  if  A  and  k  converge  to 

Ox0  oy0 

zero  so  that  k/h  >  e. 

*  See  W.  H.  Young,  loc.  cit. 
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By  letting  h  and  &  converge  to  zero,  so  that  both  h/k  and  k/h  are  >  e,  we 
^ 

Sec 


It  has  been  shewn,  in  §  309,  that  the  conditions  of  the  above  theorem  are 
satisfied  if  the  four  partial  differential  coefficients  of  the  second  order  all  exist 

£\2  f  C}2  -f 

at  the  point  (x0,  y0),  and  if  5~S  »   o~i  exist  in  a  neighbourhood  of  that  point 

ox      oy 

and  are  continuous  with  respect  to  (x,  y),  at  the  point  (x0,  y0).    The  following 
theorem  has  thus  been  proved  : 

If  the  four  partial  differential  coefficients  off(x,  y)  exist  at  the  point  (x0,  yQ), 

£\2  *f       Og  •£ 

and  if  ^  ,  ^  exist  in  a  neighbourhood  of  (x0>  y0),  and  are  continuous  relative 
to  (x,  y),  at  the  point  x0,  y0,  then 


314.  The  partial  differential  coefficients  of  higher  order  n  of  a  function 
f(x,  y)  are  of  the  form  g^T-y^'  .y  8a*  gy*  >  where  p,  q,r,...l  are  positive 

integers,  including  zero,  such  that  p  +  q+r+...  +  l  =  n.  Here,  f  is  first 
differentiated  I  times  with  respect  to  y,  then  k  times  with  respect  to  x,  and 
so  on.  The  total  number  of  possible  partial  differential  coefficients  of  order 
n  is  2n  ;  the  number  of  those  in  which  r  differentiations  with  respect  to  x, 

n\ 
and  n  —  r  with  respect  to  y  are  involved  is  —  T-—        -,  . 

r  !  (n  —  r)  ! 

Sufficient  conditions  for  the  existence  of  all  the  partial  differential  coefficients 
of  order  n  may  be  obtained  by  extending  the  theorem  of  §  311,  which  refers  to 
the  case  n  =  2.  The  following  criteria  *,  which  can  be  proved  by  induction, 
will  be  sufficient  for  the  purpose  : 

//  the  n-l  differential  coefficients  ^£_,  8^4/2""  *W*  ^ 
definite  finite  values  for  all  points  in  a  tiuo-dimensional  neighbourhood  of  the 
point  (x0,  ?/0),  and  are  continuous  at  the  point  (x0,  y0\  with  respect  to  (x,  y),  then 
all  the  other  mixed  partial  differential  coefficients  of  order  n  exist  at  the  point 
(x0,  y0)  ;  and  each  one  of  them  has  the  same  value  at  the  point  as  that  one  of 
those  given  above  in  which  the  same  number  of  differentiations  with  respect  to  x, 
and  with  respect  to  y,  occurs,  as  in  the  one  considered. 

EXAMPLE. 

Let+  the  function  f  (x,  y)  be  defined  by  f  (x,  y}=xy  --^—-^,  for  all  values  of  x  and  y 

x  +y- 

except  when  #=0,  y  =  0  ;  for  which  /(O,  0)  =  0.     At  the  point  (0,  0),  the  partial  differential 

o'2f      cf*f 

coefficients  5—5-,  -   A    both  exist,  and  have  different  finite  values. 
oxoy  oyox 

*  See  Stolz,  Grundzilfie,  vol.  i,  p.  153.  t  Peano,  Calc.  Di/.,  p.  174. 
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The  function  /(a?,  y)  is  continuous  at  the  point  (0,  0)  ;  for,  writing  x  =  r  cos  6,  y  =  r  sin  0, 
the  function  becomes  %r2  sin  40  ;  and  this  is  numerically  less  than  e,  provided  r 


df  (x  y)         (x*  —  y 
We  find    '    dx      =  y  W+y 

df  .          /(#,  0)  -/((),  0) 

£-  is  lim  s^-2  —  -  —  ^-^^  —  -  ,  which  is  =  0. 

ox      X=Q  x 

The  value  of  ¥^1  is  _y  and  that  of  9 
We  then  fed 


-  - 

oyox        y=Q  y  {     ox  tix 

and  ->  -lim  1 


- 

y  dy 

The  value  of  ,  as  also  that  of  ^  y),  is  l  +  r.  at 

'         ii  222 


point  except  (0,  0).  This  value  is  cos  2$  (1  +  2  sin2  20),  which  is  constant  for  a  constant 
value  of  0,  but  has  different  values  for  different  values  of  6  ;  and  thus  the  partial  differential 
coefficients  are  discontinuous  at  the  point  (0,  0).  The  conditions  of  the  theorem  giving 

p2/  g2^ 

sufficient  conditions  for  the  equality  of  —  4-  and  ~—  ^-  are  therefore  not  satisfied  for  the 

vxoy  oyox 

point  (0,  0). 


315.  Let  </>  (A)  be  a  function  of  h,  such  that  0  (0)  =  0,  and  such  that 
in  an  open  neighbourhood  of  the  point  h  =  0,  the  first  n  —  l  differential  co- 
efficients of  0  (h)  all  exist,  and  all  have  the  value  zero  at  h  =  0. 

We  have  then,  from  the  theorem  of  §  263, 


hn        nh^-1      n(n-l)h 
where  0  <  j  An_j   <  j  An_2  <  ...  <   /h   <  j 

Now  let  <f>(h)=F(h,  x,  y);  and  let  us  assume  that,  for  all  points  (x,  y) 
in  a  given  open  plane  domain  D,  and  for  all  values  of  h  in  a  given  open 
neighbourhood  of  h  =  0,  the  first  n  —  l  partial  differential  coefficients  of 
F  (h,  x,  y)  with  respect  to  h  exist,  and  are  all  zero,  when  k  =  0  ;  and  also 
that  F  (0,  x,  y)  =  0.  We  have  then 

F(h,x,y}          I 


where  /*„_!  is  such  that  0  <  j  hn^  <  \  h   ;  the  value  of  /IM_J  depending  on  the 
values  of  h,  x,  and  y. 


If  -'  J  '  have    partial    differential   coefficients   with    respect   to 

oiin~} 

h,  x,  y,  that  are  continuous  in  the  open  domain  for  which  (x,  y)  is  in  D,  and 
h  in  the  given  open  interval,  then,  employing  the  theorem  of  §  310,  as  ex- 
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tended  to  the  case  of  a  function  of  the  three  variables  h,  x,  y,  we  see  that 
dnr~1F(hnr.lt  x,  y}  _  ,       r(dnF(h,  x,  y)\  1 

where  R  converges  to  zero,  as  An_!  converges  to  zero,  uniformly  for  all  points 
(x,  y}  in  a  perfect  domain  A>  contained  in  D,  and  for  all  points  hn^  that 
correspond  to  a  point  h  in  a  closed  neighbourhood  of  h  =  0,  interior  to  the 
given  open  neighbourhood. 

Therefore,  if  h  be  sufficiently  small,  and  |  h  \  £  h,  we  have 


hn  nl\        dhn 

for  all  points  (x,  y)  in  D^ 

Now  let 

F(h,  x,  y)  =f(x  +  h,  y)  -f(x,  y} 

this  satisfies  the  conditions  that  F  and  its  first  n— 1  differential  coefficients 
with  respect  to  h  all  vanish  when  ^  =  0;  we  have  therefore,  provided  f(x,  y) 
and  its  partial  differential  coefficients  of  the  first  n  orders  are  continuous 
in  D, 


where  R'  converges  to  zero,  as  h  ~  0,  uniformly  for  all  points  (x,  y)  in*  the 
perfect  domain  D1}  interior  to  D. 

Similarly,  we  find  that 


where  R"  converges  to  zero,  as  k  ~  0,  uniformly  in  Z)lt 

In  the  first  of  these  equations,  we  can  write  y  +  k  for  y,  where  k  is  so 
small  that  (x,  y  +  k)  is  interior  to  D,  for  all  points  (x,  y},  of  D±.    We  have  then 

f(x  +  h,  y  +  k)  -f(x,  y  +  k} 


_  ,  a/te  y  +  *K      ,  hr  vf(x,  y  +  k}  ,_ 

-  +"  ~~          +  " 


Avhere  J2"'  converges  to  0,  as  A-  ~  0,  uniformly  in  D,,  and  uniformly  with 
respect  to  &,  for  all  values  which  do  not  numerically  exceed  a  fixed  number. 

Since  all  the  partial  differential  coefficients  of  the  first  orders  are  con- 
tinuous in  D,  we  have,  provided  h  ,  k  are  so  small  that,  when  (x,  y}  is  in 
DJ,  the  points  (x  +  h,  y  +  k)  are  all  interior  to  D, 


,  y}         V"f(x,  y)  k—     (^f(x,  y} 

i     K          ?:     ^.     ~     —    "T   .  .  .  T  /  \  i  1    's    «     i**^    *      '     •"•rr  > 

~ 


_ 

^    ~  *\    «  i  ?:     ^.     ~     —          .  .  .         /  \  i  1    's    «     i**^ 

9«r  dxr  dijdxr  (n  —  r)l(dyn~rdx 

•where  Rr  converges  uniformly  to  zero,  as  k  ~>  0. 
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Substituting  this  expression,  for  r  =  1,  2,  3,  . . .  n,  in  the  expression   for 
f(x+h,  y  +  k)  —f(x,  y  +  k),  and  adding  the  expression  for 

f(x,y  +  k)-f(x)y); 
we  obtain  the  formula 


dy     2!V  dx        dy 

If/,1      k—\f 
rl\    dx        dyj 

dx        d 

where  R  =  anihn  +  anztil-lk  +  ...  +  annkn;  and  o^,  am,  ...  ann  all  converge  to 
zero,  as  h,  k  converge  in  any  manner  to  zero,  uniformly  for  all  points  (x,  y) 
of  the  perfect  domain  DI,  interior  to  D. 


z-  +  k  fr     fis  used  to  denote 

dx  J 


,  ... 

~l  2!  dy*oxr  2  oyr 

That  v  ~-  —  =  =  -  ^~r     follows  from  the  condition  of  the  continuity  of 
dxrdyn~r     dyn~rdxr 

the  partial  differential  coefficients  of  the  first  n  orders  of  the  domain  D. 
That  this  is  the  case  is  verified  by  the  fact  that  the  expansion  obtained 
above  may  also  be  found  by  exchanging  the  parts  which  x  and  y  play  in  the 
process  by  which  the  expression  was  obtained. 

It  has  thus  been  shewn  that  : 

If  f(x,  y\  and  all  its  first  n  partial  differential  coefficients  with  respect  to 
x  and  y  are  continuous  in  an  open  domain  D,  of  (x,  y),  then  for  any  pair  of 
points  (x  +  h,  y  +  k),  (x,  y),  in  D, 


ft  1  T\  f/  \  T  1  J*  -          1  1  ~ 

f(x  +  h,  y  +  k)=f(x,y)+  (h=-  +k^-  }f+^(h^  +  k  ^-  }  f 
J  dx        dyJ  J      2l\    ox        OyJ  J 


where  R  =  amhn  +  an2hn~lk  +  ...  +  ctnnkn ;  and  ani,  an2,  ...  ann  converge  to  zero, 
as  h,  k  converge  in  any  manner  to  zero,  uniformly  for  all  points  (x,  y}  in  a 
perfect  domain  interior  to  D. 

This  theorem  is  the  generalization  of  that  of  §  310,  which  corresponds  to 
the  case  n  =  1. 

The  expression  (h-^-  +  k  ^-  J  f(x,  y)  is  called  the  rth  differential  off(x,  y}» 

\   ox        oy '  / 
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and  may  be  denoted  by  &{r>f(x,  y);  the  theorem  may  then  be  written  in 
the  form 

-f(x,  y) 


If  (x,  y}  be  a  fixed  point  (x0,  T/O),  and 
/(a?0  +  h,y0  +  k)-  f(x0  ,  y0) 


,    0 

Avhere  R  has  the  form  and  properties  given  above,  the  function  f(x,  y)  is 
said  to  possess  an  nth  total  differential  at  the  point  (x0,  y0).  In  accordance 
with  the  above  theorem  this  wth  total  differential  certainly  exists  in  case  there 
exists  a  plane  neighbourhood  of  (x0,  y0)  in  which  all  the  partial  differential 
coefficients  of  the  first  n  orders  exist,  and  are  continuous.  But,  as  has  been 
shewn  in  |  '309,  for  the  case  n  —  1,  less  stringent  conditions  than  these  are 
sufficient  to  ensure  the  existence  of  the  nth  total  differential  at  the  point 
(x0,  y0).  It  has  been  shewn*  by  W.  H.  Young  that,  if  f(x,  y)  have  an 
(n  —  l)th  total  differential  at  (x0,  y0),  while  the  partial  differential  coefficients 
of  order  n  —  1  all  exist,  and  are  independent  of  the  order  of  differentiation, 
in  a  closed  neighbourhood  of  the  point  (x0,  y0),  and  further,  if  they  all  have 
first  differentials  at  that  point,  then  the  function  has  an  nth  total  differential 
at  (x0,  2/o)>  so  that  the  above  expansion  theorem  holds  for  the  number  n. 

FUNCTIONS  DEFINED  IMPLICITLY. 

316.  A  function  y,  of  the  variable  x,  is  said  to  be  defined  implicitly  when 
there  is  given  a  functional  relation  between  the  variables  of  the  form  F(x,  y)  =0; 
provided  that  this  relation  suffices  to  determine  uniquely,  in  some  domain  of  x, 
the  corresponding  values  of  y. 

The  first  general  theorem  as  to  the  existence  of  the  function  y  =  (f>  (x), 
determined  implicitly  by  a  relation  F  (x,  y)  =  0,  was  given  by  Cauchy,  for  the 
case  in  which  F  (x,  y}  can,  in  a  neighbourhood  of  a  fixed  point  (a,  /3)  at  which 
F  (a,  /3)  =  0,  be  represented  by  a  convergent  series  proceeding  by  powers  of 
oc-a,y-l3. 

The  theorem  was  freed  by  Dinif  from  the  restriction  that  F  (x,  y)  must  be 
analytic,  and  his  theorem  may  be  stated  as  follows  : 

V  J? 

If  F(x,  y)  is  continuous  in  a  neighbourhood  of  the  point  (a,  /9),  and  if  ^ 

exists,  and  is  continuous  in  this  neighbourhood,  and  does  not  vanish  at  the  point 
(a,  ft);  then,  corresponding  to  a  sufficiently  small  positive  number  8,  another 
such  number  e  can  be  so  determined  that,  to  each  value  of  x  in  the  open 

*  Proc.  Lond.  Math.  Soc.  (2),  vol.  vn,  p.  171.  t  Analysi  infinitesimals,  vol.  i,  p.  162. 
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interval  (a  —  e,  a  +  e),  there  corresponds  a  unique  value  of  y  in  the  open  interval 
(0  —  8,  /3  +  8),  so  that  these  corresponding  values  of  x  and  y  satisfy  the 
condition  F  (x,  y)  =  0.  The  function  y  =  </>  (x),  so  determined,  is  continuous  in 
the  open  interval  (a  —  e,  a  +  e).  Moreover,  if  the  farther  condition  is  satisfied 

dF 
that  ~-  exists,  and  is  continuous,  in  the  neighbourhood  of  (a,  /3),  the  function 

cf>  (x)  has  a  continuous  differential  coefficient  in  the  open  interval  (a  —  e,  a  +  e). 

Since  -^—  is  continuous  in  a  neighbourhood  of  (a,  /3),  and  does  not  vanish 
dy 

at  that  point,  a  positive  number  h  can  be  so  determined  that,  if 

a  —  h<x<a  +  h,    {3  —  h<y</3  +  h, 
-^-  does  not  vanish  at  (x,  y),  and  has  the  same  sign  as  at  (a,  /3).    Let  it  be 

assumed  that  the  sign  is  positive.  The  number  h  can  be  so  chosen  that  the 
closed  neighbourhood  (h)  of  the  point  (a,  /3)  is  interior  to  the  neighbourhood 

in  which  F  and  ^--  are  continuous. 
dy 

rl  V 

Since  ^-  is  positive  for/3  —  h<y<@  +  h,x  =  a,  and  F(x,  y)  vanishes  when 

x  =  a,  y  =  /3,  F(a,  y)  must  be  negative  for  /3  —  h  ^  y  <  0,  and  it  is  positive  for 
0  <  y  ^  /3  +  h.  Since  F  (a,  /3  —  h)  is  negative,  a  positive  number  k\  (^  h)  can 
be  so  determined  that  F  (x,  /3  —  h)  <  0,  for  a  —  k\  <  x  <  a  +  &lt  Similarly,  a 
positive  number  k.2  (^  h)  can  be  so  determined  that  F(x,j3  +  h)>0,  for 
a  —  k2  <  x  <  a  +  7i2.  If  k  be  the  smaller  of  the  two  numbers  k1}  k2,  we  see  that, 
for  a  -  k  <  x  <  a  +  k,  F(x,  /3  +  h)  >  0,  and  F(x,/3-h)<  0. 

o  ET 

Since  ^    >  0,  for  a  —  k  <  x  <  a  +  k,  there  must  be  one  value  of  y,  and  only 

one,  such  that    y  —  /3  j  <  h,  at  which  F(x,y}  vanishes,  for  each  value  of  x 
such    that    |  x  —  o    <  k.      Thus    the    function    y  =  (j)  (x}    is    determined    for 
x  —  a   <k.    When  h  and  k  have  been  determined,  we  may  take  8  ^  h;  then  a 
value  of  e  ^  k  can  be  determined,  corresponding  to  8. 

If  =        -  are  both  continuous,  when  I  x  —  a.   £  h',  \  y  —  8  \  ^  k',  where  h'<  h, 

ox     f)y 

k'  <  k,  the  difference  &F  of  the  values  of  the  function  F  at  any  two  points 
(x  +  A#,  y  +  Ay),  (x,  y)  in  that  neighbourhood  of  (a,  /3),  is  given  by 

fdF  /dF       \    . 

&F=  U-  +  p    Aa-  +  U-  +  o-    Ay, 
\9#     r/  \9y       / 

where  p,  o-  are  continuous  functions  that  converge  to  zero,  as  A.T,  Ay  do  so, 
uniformly  for  all  the  points  (x,  y). 

When  y  =  0  (x),  y  +  Ay  =  <£  (#  +  A#),  we  have  A^=  0  ;  thus 
Ay_ 
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and  the  expression  on  the  right-hand  side  converges  uniformly  to 

_dF/dF 
dx/  dy  ' 

Thus  the  differential  coefficient  <f>  (x)  exists,  and  is  continuous  in  the  open 
interval  a  —  k<  x  <a+  k. 

In  case  the  function  F '  (x,  y)  has  the  form  f(y)  —  x,  we  have  the  case  in 
which  the  function  <f>  (a)  is  determined  as  the  inverse  of  the  function  x  =/(?/). 
The  theorem  then  takes  the  following  form : 

Iff(y)  and  its  differential  coefficient/'  (y}  are  continuous  in  a  neighbourhood 
of  y  =  ft,  and  if  f  (y)  does  not  vanish  when  y  =  @,  then,  corresponding  to  a 
sufficiently  small  positive  number  8,  another  such  number  e  can  be  so  determined 
that,  to  each  value  of  x  in  the  open  interval  (a  —  e,  a.  +  e),  there  corresponds  a 
unique  value  of  y  in  the  open  interval  (ft  —  8,  ft  +  8),  such  that  the  corresponding 
values  of  x  and  y  satisfy  the  relation  x=f(y).  The  function  y  =  (j>(x),  so 
determined,  is  continuous,  and  has  a  continuous  differential  coefficient  in  the 
open  interval  (a  —  e,  a  +  e). 

317.  With  a  view  to  the  extension  of  the  general  theorem  of  §  316,  the 
following  lemmas*  will  be  required: 

Iff(xl,  x2,  ...  Xp)  is  continuous  in  an  open  p-dimensional  set  0,  and  is  either 
(1)  positive,  or  (2)  different  from  zero,  at  every  point  of  a  closed  set  G  contained 
in  0,  then  a  neighbourhood  of  G,  open  or  closed,  can  be  so  determined  that 
the  function  has  the  same  property  (1),  or  (2),  in  this  neighbourhood,  as  in  G  itself. 

Let  K  be  a  closed  set  that  contains  G,  and  is  contained  in  0.  Those  points 
of  K  (if  any)  for  which,  in  case  (1),  /^  0,  or  in  case  (2),  /=0,  form  a  closed 
set  K;  this  follows  from  the  continuity  off.  The  two  closed  sets  G,  K  have  no 
point  in  common,  arid  therefore  a  neighbourhood  of  G  can  be  determined  (see 
§  118)  so  as  to  contain  no  points  of  K  ;  this  neighbourhood  can  also  be  so 
determined  as  to  be  interior  to  K,  arid  therefore  to  0.  In  this  neighbourhood 
the  condition  (1),  or  (2),  is  satisfied. 

If  f(xi,  x2, ...  Xp)  is  continuous  in  an  open  set  0,  and  if  it  have  the  value 
zero  at  all  points  of  a  closed  set  G,  interior  to  0,  a  neighbourhood  of  G  can  be 
determined,  interior  to  0,  such  that  at  every  point  of  that  neighbourhood 

!/(«!,  Xz>...  Xp) 

is  less  than  an  assigned  positive  number  e. 

Let  a  closed  neighbourhood  H,  of  G,  be  determined,  that  is  contained 
in  0.  The  points  of  H  (if  any)  at  which  j/|  ^  e  form  a  closed  set  L,  which  has 
no  points  in  common  with  G.  Determine  a  neighbourhood  of  G  that  contains 
no  points  of  L,  and  is  interior  to  H.  At  every  point  of  this  neighbourhood 
the  condition  |/|  <  e  is  satisfied,  and  it  is  interior  to  H,  and  therefore  to  0. 

*  These  lemmas  were  established  by  Bolza,  see  Vorlesungen  ilber  Variationsprinzip,pp.  155-158. 
The  proofs  in  the  text  were  given  by  Hobson,  Proc.  Land.  Math.  Snc.  (2),  vol.  xiv,  p.  151. 
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318.  Let  us  now  suppose  that  x,  in  F (x,  y},  denotes  a  point  (xly  x2,  ...  xm} 
of  an  ra-dimensional  set,  and  further  that  F(x,  y}  involves  p  parameters 
c1}  c2,  ...  cp.  We  may  then  state  the  following  generalization  of  the  theorem 
of  §316. 

Let  c1,c2,...cp  have  the  systems  of  values  corresponding  to  points  in  a 
p-dimensional  open  domain  Xp,  and  let  the  m  +  1  variables  xlt  x2, ...  xm,  y  have 
the  systems  of  values  representing  the  points  in  an  (m  +  l)-dimensional  open 
domain  jDm+] ;  the  p  +  m  -f  1  variables  having  accordingly  the  systems  of  values 
representing  the  points  in  an  open  domain  Dp+m+l.  Let  F{x-i)  x2,...xm,  y\ 
Cj,  c2, ...  Cp)  be  defined  for  all  points  in  Dp+m+l,  and  be  such  as  to  satisfy  the 
following  conditions : 

dF 

(1)  F  is  continuous,  and  ^—  exists,  and  is  continuous  in  Dp+m+1. 

t7 

(2)  At  a  fixed  point  (a1}  cr2, ...  am>  /8)  in  Dm+i,  F  has  the  value  zero,  and 

dF 

=—  =j£ 0,/or  all  points  (cj,  c2,  ...  cp)  in  Xp. 

If  S  be  an  arbitrarily  chosen  positive  number  sufficiently  small,  another  such 
number  e  can  be  determined,  such  that,  to  each  value  of  y  for  which  y  —  {3  <  B, 
there  corresponds  one  and  only  one  set  of  values  of  (xlt  x2,  ...xm),  for  which 
xr  —  ar\<€,  for  r=  1,  2,  3,  ...  m,  and  for  each  point  of  any  given  closed 
domain  Xp  interior  to  Xp;  and  these  values  of  xlt  x2,  ...  xm,  y  are  such 
that  F(x-i,  x2, ...  xlir,  y;  clt  c2,  ...  cp)  =  Q.  Thus  a  unique  function 

y  =  (f)(a;l,x.2,...  xm ;  cl ,  c2,  . . .  cp) 
is  determined. 

.  dF    dF          dF         ,  dF    dF         dF    .. 

Moreover,  if  -= —  .  ^—  .  ...   ^ —  ,  and  -=—  ,  -^— ,  . . .  ^—  all  exist,  and  are  con- 
oxl     dx2          oxm  oc-i     oc.2          ocp 

iinuousin  Dp+m+l,  the  function  ^>  has  continuous  partial  differential  coefficients 
with  respect  to  the  m  +  p  variables  xl}  x2, ...  xrn,  clr  c2, ...  cp. 

The  proof  is  similar  to  that  in  §  316,  but  it  is  necessary  to  employ  the 
lemmas  in  §  317. 

The  function  F(xlt  x.2,  ...  xm,  y;  cl}  cz, ...  cp}  has  the  value  zero,  when 
LI\  =  cfj ,  xz  =  of2 , .  • .  xm  —  cxm ,  y  =  P) 

r)F 

whatever  set  of  values  c1;  C2,  ...  cp  may  have  in  Xp.    Since  ^—  is,  for  the  same 

sets  of  values,  always  different  from  zero,  and  is  continuous  in  Xp,  it  has 
everywhere  the  same  sign  in  Xp\  let  it  be  assumed  that  that  sign  is  positive. 
A  neighbourhood  (h)  of  the  set  of  points  («n  «2,  ...  am,  /3;  cl,  c.2,  ...  cp),  where 
Ci,  c2, . . .  cp  is  in  Xp,&  closed  set  interior  to  Xp,  can  be  so  determined  that,  for  all 

7)  F 
points  in  that  neighbourhood,  ^-  is  positive.    Thus  for 

ar  -h<  xr  <  a,  +  h,  (r  =1,  2,3,  ...  ni),  &-h<y<0  +  h,  (c, ,  c,,  . . .  cp)  in  Xp, 
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JiET  O  El 

^—  is  positive.  If  #,,  #2>  •••  #»i  have  the  values  0.1,  a2, ...  am,  since  ^—  > 0,  as  y 
dy  dy 

is  increased  from  y8  —  A  to  ft  -f  A,  jP  is  increased,  and  therefore,  since  it  is  zero 
when  y  =  j3,  it  must  be  negative  when  fi  —  h  £  y  <  @,  and  it  must  be  positive 
when  $  <  y  <  0  +  h.  This  holds  for  every  point  of  Xp. 

A  neighbourhood  of  the  (m  +p)  dimensional  set  of  points 


at  which,  for  y  =  yS  —  h,  F  is  negative,  can  be  so  determined  that  Fis  negative 
in  that  neighbourhood.  Thus,  for  a  properly  chosen  positive  number  ki  (^  h), 

F(x    x    ...x     8  —  h'c    c     ..c)<0 
for  ar  —  kl<ocr<ar  +  k1,  (r  =  1,  2,  3,  ...  m). 

Similarly,  a  positive  number  7c2(^  h)  can  be  so  determined  that 

F  (x    x     ..  x     3  +  h'  c    c     . .  c  )  >  0 
for  ar  —  k2  <xr<ar  +  k2,  (r  =  1,  2,  3,  ...  m). 

If  k  be  the  smaller  of  the  two  numbers  klt  k2,  we  see  that,  for 

ar  —  k  <  xr  <  ar  +  k.  (r  =  1,  2,  3, ...  m), 
F  is  negative  when  y  =  /3  —  h,  and  positive  when  y  =  $  +  h  ;  whatever  point 

(d,  C2, ...  Cp)  may  be  of  Xp.    Since  =-  >  0,  for  all  points  such  that 

Or  —  k  <  xr  <  ar  +  k,  (r  =  1,  2, . . .  m),  /3  —  h<y</3  +  h,  (cl)  c2, ...  cp)  in  Xp, 
we  see  that,  for  each  point  of  Xp  there  is  one  point  y,  and  only  one,  such  that 
j  y  —  ft  |  <  h,  at  which  F  vanishes,  for  each  set  of  values  of  a?1}  x.2,  ...  xm.   Thus 
a  function  y  =  <f>(x1,x.2,...xm:>  C1:  c2,  ...  cp)  is  determined,  for 

\xr—  a   <k,  (r.  =  1,  2,  ...  m), 
and  (d,  c2,  ...  Cp)  in  Xp, 

such  that  F  vanishes  when  y  has  the  value  of  $. 

The  difference  of  the  values  of  F  at  two  points 

i  T    \   ^^ *y       IT  -\-  /\ '7"  (x*      \    ^A *ir        77  '{"  /\ -7 /  •    />    4    ^/\ /i      />     i    y\ /•»  /^    ^.  ^^ ^»  \ 

is  given  by 

r=p/ap 


where  all  the  7/i+p  +  l  letters  o>,  p,  r,.  denote  continuous  functions  that 
converge  to  zero,  uniformly  for  all  values  of  the  variables  in  the  closed  set 
interior  to  Dm+p+1. 


When  y'—<j>(tti,Ma,'-"&ra>  C1,c2,...cp) 

and  ?/  +  At/=  0  («j  +  A^, 
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we  have  AF  =  0.    Thus  if  Ac1;  Ac2,  ...  Ac^,  all  vanish,  and 

Aa?!,  A#2, ...  Aav-i,  Aav+1,  ...  A#OT 
also  vanish,  we  have 


and  since  o-r,  p  converge  uniformly  to  zero,  we  have 

80  _  _aF  /aF 

8#r         dxr  I  dy  ' 
In  a  similar  manner,  we  see  that 

80  =  _  ap  idF 

dcr         dcr/  dy  ' 
Thus  the  second  part  of  the  theorem  has  been  established. 

319.  The  theorem  of  §  318  may  be  extended  to  the  case  in  which  y  is 
replaced  by  n  variables  2/1,3/2,  •  •  •  yn,  and  there  are  now  n  functions  Fl}Fz^  . . .  Fn 
which  involve  the  m  +n+p  variables.  The  theorem  then  takes  the  following 
form: 

Let  Ci,c.2,...cp  have  the  system  of  values  corresponding  to  points  in  a 
p-dimensional  open  domain  Xp,  and  let  (xl,  x2,  ...  xm,  yly  y2,  ...  yn)  have  the 
system  of  values  corresponding  to  the  points  in  an  (m  +  n)-dimensional  open 
domain  Dm+n;  the  m  +  n  +  p  variables  having  accordingly  the  values  repre- 
sented by  points  in  an  open  domain  Dm+n+p.  Let  n  functions 

Frfa,  scz,  ...  xm,  yly  y,, ...  yn]  clt  c2,  ...  c^),  (r=  1,  2,  3, ...  n) 
be  defined  for  all  points  in  Dm+n+p,  and  be  such  as  to  satisfy  the  following 
conditions : 

(1)  The  functions  Fr  are  continuous,  and  all  the  functions 

dFr    ,  0 

^,(.,=  1,2,...,,), 

exist,  and  are  continuous,  in  Dm+n+p. 

(2)  At  a  fixed  point   (a1}  az,  ...  am,  /31(  /32,  •••  /3n)  in  Dm+n,  all  the  r 
functions  vanish,  and  the  Jacobian 

d(Fl,F,,...Fn) 
8(s/i,  2/2,  ...yn) 
has  a  value  which  is  not  zero,  for  all  points  (c1(  ...  c^)  in  Xp. 

If  8  be  an  arbitrarily  chosen  positive  number,  sufficiently  small,  another 
such  number  e  can  be  so  determined  that,  to  each  set  of  values  of  ylt  y2, ...  yn 
for  which  \  yr  —  /3r  \  <  8,  (r  =  1,  2,  3, ...  n),  there  corresponds  one,  and  only  one, 
set  of  values  of  ccl,  #.2,  •  •  •  ®n  for  which  \xr—a,r\<e,(r=1,2,...  m),  for  each  set 
of  values  of  (c1}  c2, ...  c^,)  in  a  closed  domain  Xp,  interior  to  Xp;  and  these 
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values  of  x  and  y  are  such  that  the  n  functions  Fr  are  all  zero.     Thus  unique 
functions  yr=<f>r  (xlt  xt, ...  xm;  clt  C2,  ...  cp),  (r  =  1,  2,  3,  ...  n)  are  determined. 

itr  •  rd*r     Off         .    ,      x. 

Moreover,  if  = —  ,  -^—  exist,  /or 

fjXg       CCf 

r  =  l,  2,...n;  s=  1,  2,  3,  ...  m  ;  *=  1,  2,  3,  ....p, 

awe?  are  continuous  in  Dm+n+p>  the  functions  </>r  are  all  continuous  and  have 
continuous  partial  differential  coefficients  with  respect  to  Hie  m  +  p  variables. 

The  theorem  of  §  318  is  the  particular  case  of  this  theorem  which  arises 
when  n  =  1.  Assume  that  the  theorem  holds  when  n  =  v  —  1  ;  it  will  then  be 
shewn  to  hold  when  n=  v,  and  thus  it  will  hold  generally. 

3(fl,Jt,....J'.)^ 

As  before,  if  -=-; —  — r-  >  0, 

9(2/1,2/2,  •  ••£/,) 

where  xr=  ar.  (r  =  1,  2,...  m),  yr  =  @r,  (r  =  1,  2,  3,  ...  i/), 

for  all  points  (aly  «2,  ...  o^)  in  Xp,  there  exists  a  positive  number  A,  such  that 
the  same  inequality  holds,  provided  j  xr  —  ar  <  h,  \  yr  —  /3,.  |  <  h,  for  all  points  of 
Xp  a  closed  domain  contained  in  Xp. 

At  a  point  at  which  the  Jacobian  is  not  zero  it  is  impossible  that  all  the 
partial  differential  coefficients  — L ,  — -1 . . . .  ~  can  vanish.  Choosing  a  number 

tyi  fy-2      fy« 

h'  (<  h),  at  each  point  of  the  closed  domain  E  given  by 
\yr-/3r\^h,  (r=l,  2,3,...i/), 
j  xr  —  ar  |  ^  A',  (r  =  1,  2,  ...  m),  and  (cl5  c2,  ...  cp) 

in  ^p,  one  at  least  of  the  above  partial  differential  coefficients  is  different 
from  zero.  For  any  point  of  the  closed  set  E,  a  closed  neighbourhood  of  the 
point  can  be  determined,  in  the  whole  of  which  one  and  the  same  differential 
coefficient  is  different  from  zero.  By  employing  the  Heine-Borel  theorem, 
we  see  that  a  finite  number  of  the  cells  that  constitute  these  neighbour- 
hoods exists,  such  that  each  point  of  E  is  interior  to  one  at  least  of  the 
cells  of  this  finite  set.  We  obtain  in  this  manner  a  division  of  the  set  E 
into  a  finite  number  of  parts,  such  that,  in  each  part,  one  of  the  differential 

T^W     7}Jf          riTf 

coefficients  ~  .  ~  - ,  ...  ~  is  different  from  zero  in  the  whole  of  that  part. 
32/i    fy2         fyv 

rtf 

Let  us  assume  that,  in  one  of  these  parts,  . -1-  >  0,  and  that  this  part  contains 

^2/i 
the  points  («1;  cr2, . . .  am,  /3j,  /3.2, ...  J3V;  Cj,  c2, . . .  cp)  in  its  interior,  provided  that 

(cn  c.2:...cp)  belongs  to  a  certain  closed  part  of  Xp  which  we  may  denote 
by  Xp*. 

We  now  consider  Fl(xl,  x2,  ...  xm,  y±,  y2, ...  yv ;  clt  c.,, ...  cp)  as  a  function 
of  xl,  x2,  ...  xm,  y1}  and  of  the  parameters  y.,,  y3, ...  yv,  d,  C2,  ...  cp.  Applying 
now  the  theorem  of  §  317,  we  see  that  positive  numbers  Al5  kl  (^  h')  can  be  so 
determined  that  one  and  only  one  value  of  yl  exists,  such  that  yl  —  /3l  <  /^ 


414  Functions  of  a  real  variable  [OH.  v 

corresponding  to  each  set  of  values  of  xlt  #2,...  xm  such  that  |#r  —  c^  <klf 
(r  =  1,  2,  3,  ...  ra),  and  provided  (y2,  y3,  ...  yv,  c,,  c2,  ...  cp)  is  in  a  certain  closed 
domain  of  v  +  p  —  1  dimensions.  This  value  of  yl  may  be  denoted  by 

2/i  ==  y  i  \^i  >  xz,  ...  xm  ,  ?/2  ,  2/3  ,  .  .  .  y  „  ,  Cj  ,  C2  ,  .  .  .  c^). 
On  substitution  of  this  value  of  y1  in  the  equations 

F2=0,FS  =  0,...FV  =  0, 
we  obtain  a  set  of  v  —  1  equations  which  may  be  denoted  by 

<pr  (si/i  ,  xz,  ...  xm  ,  y2  ,  ys  ,  .  .  .  yv  ,  GI  ,  C2  ,  .  .  .  Cp)  =  0, 
where  r  =  2,  3,  .  .  .  v.    These  hold  when  j  xr  —  ar   <  k1}  and  provided 

(.2/25  2/3>  •  •  •  2/"  >  GI,  Cg,  .  .  .  Cp) 

is  in  the  specified  domain.  Assuming  the  theorem  to  hold  for  n  =  v  —  1,  we 
see  that  there  exist  values  of  yz,  ys,  ...  yv,  given  by  equations 

yr  —  <j>r(®i>®a>>~®m',  GI  ,  ca  ,  .  .  .  Cp)  (r  =  2,  3,  .  .  .  i/), 
and  that  these  values  are  unique  for  every  set  of  values  of 

x1}  x.2,  .  .  .  xm,  d,  c.2,  ...  op,  such  that    xr  -  af\<  k2  (r  =  1,2,...  m), 
and  such  that  (c1;  c.2,  ...  c^,)  is  in  Xp°;  where  k2  is  some  positive  number.    The 
values  of  y.2,  ys,  .  .  .  yv  are  such  that    yr  —  J3r   <  hs,  some  fixed  positive  number  ; 
this  must  be  so  small  that  the  points  (y2,  ys,  ...  yv,  clt  c2,  ...  cp)  are  all  in  the 
specified  closed  domain.    This  holds,  subject  to  the  condition  that 

^  (^IL^3!^:  jy  i  o 
d(y2,y.,,...yv) 

at  the  points          («i,  «2,  ...  am,  /32,  @3,  ...  ftv;  c1;  c2,  ...  cp), 
where  (c1}  c2,  ...  cp}  is  in  Xp». 

The  function  /i  has  continuous  partial  differential  coefficients  with  respect 
to  all  the  variables  contained  in  it.    Thus 

9/T         dF./dF, 

o     —  —  3—7  a  —  ,  (r  »=  2,  d,  .  .  .  .i;) 

92/r         9yr/  9^ 

9/,_      dF./dF,     _- 

--  i     /^     \£  —  ^i  *•»•••  ^J 

a;g  d#s/  9t/j 

a/1__8Jp1/a^ 

^-    WJ'        "p 

NOW  fe.!f+ 


On  substituting  the  values  of  the  partial  differential  coefficients  in  the 
Jacobian,  we  obtain,  after  a  slight  transformation, 

d(F1,F9l...Fr)/dFlm 
1.2/2.  >.>y*)l  fyi  ' 
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and  thus  the  condition  is  that 


at  the  points  («,,  «2,  •  ••  «w,  &,  /32, ...  &,;  c1}  c2, ...  cp),  where  (c1(  ca, ...  c^)  is 
in  Xp°. 

It  has  now  been  shewn  that  there  is  a  unique  value  of  y,,  such  that 
I  y\—&i  |  <  ^i ,  for  each  set  of  values  of  y2,  y3, . . .  yv,  £cl}  x2, ...  xm,  cl}  c2,  . . .  cp, 
such  that  (yz,  y.A, ...  yv,clt  c2,  ...  cp)  is  in  a  certain  closed  domain  which  includes 
a  set  of  points  for  which 

2/2  =  &,  2/3  =  A, . . .  y,  =  /3, ;  and  that    xr  -ar\<  k^ 

It  has  also  been  shewn  that  a  unique  set  of  values  of  ya,  y3,  ...  yv,  such  that 
j  yr  —  /8r  <  hz,  (r  =  2,  3,  ...  v)  can  then  be  determined  for  each  set  of  values  of 
(a?!,  #2>  •••  #»»>  Ci,  c2,  ...  c^)  such  that  \  xr  —  ar  \  <  k2  and  that  (c1}  C2, ...  cp)  is  in 
jfiTp0.  We  can  choose  any  value  of  hlt  or  of  h?,  smaller  than  the  value  fixed  ; 
then  &j  and  A;2  will  in  general  have  to  be  diminished.  By  altering  the  value 
of  one  of  the  numbers  hjt  h2  we  may  make  them  have  equal  values,  say  h; 
and  for  k  we  then  take  the  smaller  of  the  corresponding  values  klt  k2.  It  now 
appears  that  there  exists  a  unique  set  of  values  of  yly  y2,  ...  yv,  such  that 
yr  —  (3r\<  h,  (r=l,  2,  ...  v),  corresponding  to  each  point 

(SDi)  x%, ...  a;m,  GI,  c2, ...  Cp) 
such  that  j  xv  —  ar   <k,  (r  =1,  2,  ...  m),  and  that  (c1}  C2, ...  cp)  is  in  Xp°. 

As  every  point  of  Xp  is  in  one  or  more  of  a  finite  number  of  such  sets  as 
Xp°,  corresponding  to  each  of  which  one  of  the  partial  differential  coefficients 

7)V 

— -  is  different  from  zero  for  the  whole  of  that  set,  the  above  reasoning  may 

jf 

be  applied  to  each  such  set  Xp°. 

We  have,  in  each  case,  a  pair  of  positive  numbers  h,  k,  determined  as 
above  ;  moreover  we  may  take,  instead  of  h,  any  number  less  than  h ;  k  being 
correspondingly  diminished.  If  we  take  for  8  the  smallest  of  the  numbers  A, 
it  is  clear  that  a  corresponding  number  e  may  be  determined  so  that  8  and  e 
can  be  used,  instead  of  h,  k,  for  all  the  parts  of  Xp.  It  has  thus  been  shewn 
that  the  theorem  holds  in  the  case  n  =  v,  if  it  is  assumed  to  hold  in  the  case 
n  =  v—l.  The  theorem  was  proved  in  §  318,  for  the  case  n  =  1 ;  and  therefore 
it  holds  generally. 

It  is  easy  to  see  that  the  above  general  theorem  may  be  modified  so  as  to 
apply  to  the  establishment  of  unique  values  of  y1}  y2,  ...  yn  which  satisfy  the 
n  equations 

•v  r  (&1  >  ^2 >  •  •  •  ®m  >  2/1 '  2/2 >   •  •  •  3/w  J    Cj ,  C2 ,  —  Cp)  =•  J )•  \Ci ,  C.I ,  ...  Cp), 

where  the  functions /,  (c1;  c2,  ...cp)  satisfy  the  conditions  that  they  have 
continuous  partial  differential  coefficients  in  the  domain  Xp ;  the  functions  Fr 
are  to  satisfy  the  same  conditions  as  before. 
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We  have,  in  fact,  only  to  consider  the  functions  Fr  —fr>  instead  of  the 
functions  Fr.  It  is  then  assumed,  as  before,  that  Fr  —fr  =  0,  when 

xr  —  ar,  yr  =  ftr,  for  all  values  of  (c1;  c2, ...  cp)  in  Xp. 

An  important  case  of  the  general  theorem  is  that  in  which  n  =  m,  and  in 
which  Fr  is  of  the  form  <j>r  (yl}  y2, . . .  yn  ;  d,  c2,  . . .  cp)  —  xr.  We  have  then  an 
extension  of  the  theorem  of  inversion. 

The  case  of  the  general  theorem  which  arises  when  c1}  c2, ...  cp  are  absent 
is  the  well  known  theorem  of  Dini  relating  to  implicit  functions*. 


MAXIMA  AND  MINIMA  OF  A  FUNCTION  OF  TWO  VARIABLES. 

320.  Let  us  suppose  that  a  function  f(x,  y)  is  defined  at  all  points  in  a 
two-dimensional  neighbourhood  of  the  point  (#0,  y0). 

If  the  function  be  such  that  f  (x0  +  h,  2/o  +  &)  —  /(%<»  2A>)  <  0,  for  all  values 
of  h,  k  which  are  not  both  zero,  and  are  such  that  ',  h  \ ,  k'\  are  both  less  than 
some  fixed  positive  number  8,  then  the  function /(#,  y}  is  said  to  have  a  proper 
maximum  at  the  point  (#„,  y0). 

In  case  the  fixed  number  S  can  only  be  so  determined  that  the  condition 
f(x0+h,y0  +  k}—f(a~0,y0)^0  is  satisfied,  the  function  is  said  to  have  an 
improper  maximum  at  the  point  (#•„,  y0). 

If  the  conditions  contained  in  these  definitions  be  replaced  by 
/Oo  +  /*,  2/o  +  &)-/(#„,  ?/o)  >  0,  and  /(>0  +  h,  y0  +  k)  -f(a;0,  y0)  ^  0 

respectively,  the  function  f(x,  y}  is  said  to  have,  in  the  first  case,  a  proper 
minimum,  and   in    the   second    case,   an  improper   minimum,   at   the  point 

0*o,  y0\ 

A  proper  or  improper  maximum  or  minimum  may  be  spoken  of  as  an 
extreme  of  the  function. 

At  an  extreme  (a-0,  y0\  f(xn  +  h,y0)-f(x(>,  y0),  f(x0-h,  y0)-f(ac0,  y,} 
both  have  the  same  sign,  or  are  zero,  for  all  sufficiently  small  values  of  A;  it 

follows  that,  if  J—^~-^-  exist,  it  must  be  zero.     A  similar  remark  applies 
OXQ 

.n  §/0»o,  yo) 

TJO  ^ 

cy<> 

These  conditions  are  necessary,  under  the  hypothesis  of  the  existence  of 
the  two  partial  differential  coefficients,  but  not  sufficient,  for  the  existence 
of  an  extreme  at  the  point  (#0,  2/o)- 

*  See  Jordan's  Cours  d' Analyse,  vol.  i,  §§  91, 92 ;  also  Osgood's  Lehrbuch  der  Funktionentheorie, 
vol.  i,  pp.  47-57.  On  the  general  theory  of  functions  defined  implicitly,  see  W.  H.  Young,  Proc. 
Lond.  Math.  Soc.  (2),  vol.  vn,  p.  397  ;  Hobson,  Proc.  Loud.  Math.  Soc.  (2),  vol.  xiv,  p.  147,  and 
Hedrick  and  Westfall,  Bull,  dc  la  soc.  math,  de  France,  vol.  XLIV,  p.  1. 
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If  we  write  x  =  ac0  +  r  cos  0,  y  =  y0  +  r  sin  0,f(x,  y)  =«  $  (r,  0),  it  is  clearly 
necessary  for  the  existence  of  an  extreme  of  f(x,  y}  at  (#0>  2/o)»  that  <£(?*,  6],  for 
each  constant  value  of  0,  should  have  an  extreme  at  r  =  0.  Thus,  for  an  assigned 
vajue  of  6,  a  positive  number  ae  can  be  determined,  such  that  one  of  the  four 
conditions  </>  (r,  0)  -/(#0,  y0)  <  0,  <£  (r,  (9)  -/(#„,  y0)  £0,<f>(r,0)  -:f(x0,  y0)  >  0, 
^>  (r,  6)—f(x0,  T/O)  =  0,  .according  as  the  point  is  a  proper  maximum,  an 
improper  maximum,  a  proper  minimum,  or  an  improper  minimum,  shall  be 
satisfied  for  all  values  of  r,  different  from  zero,  and  such  that  |  r  \  <  a.e.  Thus 
an  extreme  of  a  function  is  necessarily  an  extreme  for  values  of  the  function 
on  each  straight  line  drawn  through  the  point. 

This  condition,  though  necessary,  is  however  not  sufficient  ;  for  oto  may 
have  a  definite  value  for  each  value  of  0,  and  yet  the  lower  boundary  of  <xe,  for 
all  values  of  0,  may  be  zero.  In  this  case,  no  value  of  &  can  be  determined, 
as  required  in  the  definition  of  the  extreme  in  the  two-dimensional  domain. 
It  has  thus  been  shewn  that,  in  order  that  (x0,  yQ)  may  be  an  extreme  point 
for  the  function  f(x,  y},  it  is  necessary  and  sufficient  (1),  that  r  =  0  should  be 
an  extreme  point  of  <j>  (r,  0)  for  each  value  of  0,  and  (2),  that*  the  number  ad 
which  is  so  determined  for  each  value  of  0  that,  for  \  r  \  <  ae,  the  condition  as  to 
(/>  (r,  0)  —f(x0,  y0)  may  be  satisfied,  should  have  a  finite  lower  boundary,  when  all 
values  of  0,  (0  ^  0  ^  TT)  are  considered.  If  the  lower  boundary  of  ag  be  zero,  the 
point  is  not  an  extreme  point  of  the  function. 

When  the  lower  boundary  of  ae  is  d  (  >  0),  the  neighbourhood  of  (oc0,  y0\ 
which  must  exist  in  accordance  with  the  definition,  is  the  square  of  which  the 

^    f  •   4     (          d          , 

corners  are  the  tour  points  I  #0  ±  -/H  >  2/o  ±  ~ 

\  \/2 

EXAMPLE. 

As  an  example  of  a  function  which  possesses  no  minimum  at  a  point,  although  the  point 
is  a  minimum  for  each  straight  line  through  the  point,  we  may  take  the  function 


(y  -  ax2}  (y  —  6#    =y—y  ax 
where  a  and  b  have  positive  values. 

The  function  is  positive  outside  the  two  para- 
bolas ,y-a^2  =  0,     y-bzz  =  0, 

and  in  the  space  interior  to  the  inner  parabola; 
in  the  space  between  the  parabolas,  the  function 
is  negative.  Along  any  straight  line  QAR 
through  A  (0,  0),  the  function  exceeds  /(O,  0)  at 
all  points  interior  to  AP,  and  everywhere  in  PA 
produced  ;  thus  for  the  line  (JAR  the  function  has 
a  minimum  at  A.  The  point  (0,  0)  is  not  a  mini- 
mum of  the  function,  since  the  lower  limit  of  AP 
for  all  positions  of  QAR  is  zero  ;  and  thus  there 
exists  no  two-dimensional  neighbourhood  of  A  ,  in  which  the  function  is  never  less  than  at  A  . 

*  The  necessity  for  this  condition  has  been  disregarded  in  many  text-books.  The  insufficiency 
of  (1)  was  first  pointed  out  by  Peauo,  Calcolo  dijf'.,  Turin,  1884,  p.  29,  in  connection  with  the  example 
given  in  the  text.  See  also  Dantscher,  Math.  Annalen,  vol.  XLII,  p.  89,  and  Scheeffer,  Math.  A  nnalen, 
vol.  xxxv,  p.  541. 

H.  ^7 


FIG.  4. 
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321.  We  may,  without  loss  of  generality,  take  the  point  at  which  the 
conditions  for  the  existence  of  an  extreme  of  the  function  f  (x,  y)  are  to  be 
investigated  to  be  the  point  (0,  0).  It  will  be  assumed  that,  at  all  points  in  the 
neighbourhood  of  (0,  0),  f  (x,  y)  is  continuous  with  respect  to  x,  and  also  with 
respect  to  y.  The  following  theorem  contains  a  criterion  for  the  existence  of 
a  proper  maximum  (minimum)  at  the  point  (0,  0). 

The  necessary  and  sufficient  conditions  that  the  point  (0,  0)  may  be  a 
point  at  which  f  (x,  y)  has  a  proper  maximum  (minimum)  are  the  follow- 
ing*:— (1)  A  positive  number  8  must  exist  which  is  such  that,  if  x  be  any 
number  different  from  zero,  and  numerically  less  than  8,  the  upper  (lower) 
limit  off(x,  y),for  such  constant  value  of  x,  and  for  all  values  of  y  for  which 

—  x^y  ^  x,  being  f(x,  <£  (x)),  this  upper  (lower)  limit  is,  for  every  value  of 
x(—  8  <  a?  4=  0  <  8),  less  (greater)  than  /(O,  0). 

(2)  A  positive  number  8'  must  exist  which  is  such  that,  if  y  be  any  number 
different  from  zero,  and  numerically  less  than  8',  the  upper  (lower)  limit  of 
f  (x,  y),  for  such  constant  value  of  y,  and  for  all  values  of  x  for  which 

—  y^x£y,  being  f(^r  (y),  y},  this  upper  (lower)  limit  is,  for  every  value  of 
y  (—  8'  <  y  ={=  0  <  8'),  less  (greater)  than  f  (0,  0). 

It  will  be  observed  that,  since  f(x,  y)  is  assumed  to  be  continuous  with 
respect  to  x,  and  also  with  respect  to  y,  the  limit  f(x,  (f>  (x))  is  actually 
attained  for  some  value  </>  (x),  of  y,  in  the  interval  (—  x,  x),  and  the  limit 
f(^r  (y),  y}  is  actually  attained  for  some  value  -v/r  (y),  of  x,  in  the  interval 
(—  y,  y).  It  is  clear  that,  unless  both  the  conditions  stated  in  the  theorem 
be  satisfied,  /(O,  0)  cannot  be  a  proper  maximum  (minimum)  of  the  function. 
If,  for  example,  no  such  number  as  8  in  (1)  can  be  determined,  there  are 
points  in  every  neighbourhood  of  (0,  0)  at  which  f(x,  y)  is  =  (^)/(0,  0). 

The  conditions  are  sufficient.  For,  if  8,  8'  exist,  the  value  of  f(x,  y)  at  every 
point,  except  (0,  0),  within  the  neighbourhood,  the  corners  of  which  are  the 
four  points  (±  8",  +  8"),  is  less  (greater)  than  /(O,  0),  where  8"  is  the  lesser  of 
the  two  numbers  8,  8'. 

The  necessary  and  sufficient  conditions  that  the  function  f(x,  y)  may 
have  an  improper  maximum  (minimum)  at  (0,  0)  are  similar  to  the  above. 
In  this  case  f  (x,  0  (x))  must  be  less  than,  or  equal  to  (greater  than,  or  equal 
to)  /(O,  0),  for  all  the  values  of  x  in  the  interval,  and  f  (ty  (y),  y)  must  be  less 
than,  or  equal  to  (greater  than,  or  equal  to)  /(O,  0),  for  all  values  of  y  in  the 
interval.  Further,  corresponding  to  every  positive  number  8  <  8,  there  must 
be  a  value  of  x(<8),  for  which  f(x,  (f>(x))=f(Q,  0);  or  else  a  similar 
condition  must  hold  for  f(^r  (y),  y)',  or  in  both  cases,  the  condition  may  be 
satisfied. 

Other  methods  of  determining  whether  (0,  0)  be  a  point  at  which  there  is 
a  maximum  or  minimum  of  f  (x,  y}  will  be  dealt  with  in  Vol.  n. 

*  See  Stolz,  Wiener  Berichte  (Nachtrag),  vol.  c,  also  Grundziige,  vol.  i,  p.  213. 
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PROPERTIES  OF  A  FUNCTION  CONTINUOUS  WITH  RESPECT  TO 
EACH  VARIABLE. 

322.  Let  a  function  f(x,  y),  defined  for  all  values  of  x  and  y  in  a  con- 
tinuous domain,  be  everywhere  continuous  with  respect  to  y,  and  be  also 
continuous  with  respect  to  x  for  every  straight  line  parallel  to  the  #-axis, 
belonging  to  a  set  cutting  the  y-axis  in  an  everywhere  dense  set  of  points. 

Let  A  be  the  point  (x,  y},  and  let  BC  be  drawn  with  A  as  its  middle 
point,  parallel  to  the  y-axis,  and  of  length  2p.  If  &>  (p)  be  the  fluctuation  of 
f(x,  y}  in  the  interval  BC,  then  o>  (p)  is  a  continuous  function  of  p ;  and 
lim  ft>  (p)  =  0,  since  f(x,  y)  is  everywhere  continuous  with  respect  to  y.  Let  cr 

p~0 

be  a  fixed  positive  number,  and  let  $„  (x,  y}  denote  the  upper  limit  of  those 
values  of  p  for  which  &>  (p)  £  cr :  thus  w  (p)  £  cr,  if  p  ^  /S^  (x,  y) ;  and  w  (p)  >  a, 
if  p  >  $„  (x,  y). 

The  function  /?„.  (x,  y},  thus  defined  for  every  point  (x,  y},  is  everywhere 
positive ;  and  it  will  be  shewn  to  be  an  upper  semi-continuous  function  with 
respect  to  the  two-dimensional  domain  (x,  y},  in  accordance  with  the  definition 
in  §  230  and  §  234. 

Take  B0A0  =  A0C0  =  ftff(x0,  T/O);    and  also  B1B0 
fixed  positive  number.    The  fluctuation  of  f(x,  y) 
in  jBjG'i  is  greater  than  a;  let  it  be  cr  +  k.   If  ^  be 
a  fixed  positive  number  <  k,  two  points  M,  N  can  be 
found  in  Bfi^,  such  that 


where  e  is  a 
c 


IT 


f(M}-f(N} 

Moreover,  these  points  M,  N  can  be  so  chosen  as  to 
lie  on  two  straight  lines  parallel  to  the  #-axis,  which 
belong  to  the  set  along  each  of  which  f(x,  y)  is 
continuous  with  respect  to  x\  this  follows  from 
the  fact  that  this  set  of  straight  lines  cuts  BlCl  in 
an  everywhere  dense  set  of  points.  Since  f(x,  y}  is 
continuous  with  respect  to  x,  at  each  of  the  points 
M,  N,  two  segments  M'M",  N'N",  with  M  and  N 
as  their  middle  points,  can  be  determined,  so  as  to 
have  equal  lengths  28,  and  to  be  such  that 

\f(P}-f(M) 


M' 


M 


provided  P  be  any  point  in  M'M",  and  Q  be  any 
point  in  N'N". 

From  these  inequalities  and  the  former  one,  we  deduce  that 

/(P)  -  f(Q)  |  >  a. 


Q 


N" 


B! 

FIG.  5. 
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Take  the  square  of  which  A0  is  the  centre,  and  of  which  the  sides  are 
parallel  to  the  axes,  and  are  at  a  distance  from  ^40  less  than  the  smaller  of 
the  two  numbers  ^e  and  8.  If  A  be  any  point  in  this  square,  the  distance 
of  A  from  each  of  the  straight  lines  M'M",  N'N"  is  less  than  &.  (#„,  T/O)  +  e. 
Through  A  let  a  straight  line  be  drawn  parallel  to  the  y-axis,  and  mark  off 
on  it  the  segment  of  which  A  is  the  centre,  and  of  which  the  half-length  is 
/3<r  (a0,  2/0)  +  e  ;  this  segment  will  cut  M'  '  M"  and  N'N",  and  therefore  contains 
two  points  P,  Q  which  are  such  that 

\f(P}-f(Q}    >* 

Therefore  the  fluctuation  of  f(x,  y)  in  this  segment  is  >  a,  and  hence,  at 
the  point  A  (x,  y},  we  have  ^  (x,  y)  <  fi^  (x0,  T/O)  +  e.  A  square  having  been 
determined  with  its  centre  at  A0,  such  that  for  every  point  in  this  square 


it  follows  that  &„  (x,  y)  is  an  upper  semi-continuous  function  at  A0,  with 
respect  to  the  two-dimensional  domain  (x,  y}. 


323.  Let  us  now  consider  the  linear  set  C,  of  points  (x,  y}  defined  by 
y  =  <f>  (x),  where  (f>  (x)  is  a  continuous  function  of  x.  At  each  point  of  C,  the 
function  $a  (x,  y)  is  defined,  and  it  has  at  every  point  a  minimum  relatively  to 
the  set  C,  the  term  being  used  in  accordance  with  the  definition  given  in  §  224. 

If,  at  a  point  A0(x0,  y0),  of  C,  the  function  y3ff  (x,  y)  have  its  minimum  with 
respect  to  C  positive,  then  we  shall  prove  that  the  saltus  of  f(x,  y}  at  A0, 
with  respeot  to  the  two-dimensional  domain  (x,  y),  is  ^  2<r.  Let  7  denote  this 
minimum,  and  let  7!  be  a  positive  number  <  7. 

Let  an  interval  (x0  —  8,x0  +  8)  on  the  line  y  —  yQ  be  so  determined  that 
Q  in  this  interval 

[  (f>  (x}  —  <j>  (#„)    <|7j. 

This  interval  may,  if  necessary,  be  so 
reduced,  that  for  all  values  of  x  in  it, 


M. 


Describe  the  rectangle  R,  with  A0  as 
centre,  the  sides  parallel  to  the  axes  of 
x  and  y  being  2S  and  ^  respectively.  On 
every  segment  PQ,  of  R,  parallel  to  Oy, 
the  fluctuation  of  the  function  is  ^  cr ; 
for  there  is  on  PQ  a  point  A  of  the  set 
C,  and  the  segment  with  centre  A,  and 
length  2(3a  (A)>  2j^,  contains  the  whole 
segment  PQ. 

Taking  a  fixed  positive  number  e,  an  area  surrounding  A0  can  be  deter- 
mined, in  which  the  fluctuation  of  f(x,  y}  is  ^  2cr  +  e.    To  effect  this,  take  a 


FIG.  6. 
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point  Al  on  the  ordinate  through  A0,  and  in  the  rectangle  R,  such  that  A^  is 
a  point  of  continuity  of  f(x,  y)  with  respect  to  x;  then  on  the  straight  line 
y  =  yi,  take  a  segment  B1C1,  with  centre  Al}  and  of  length  28'  ^  28,  such  that 
the  fluctuation  of  /  in  BlCl  is  <  e.  Consider  the  rectangle  R  contained  in 
R,  such  that  the  sides  of  R'  are  of  lengths  28'  and  yl  parallel  to  the  axes,  its 
centre  being  at  An.  The  fluctuation  of  f(x,  y)  in  this  rectangle  is  <  2cr  +  e. 
For  if  M,  N  be  any  two  points  in  it,  let  Mlt  Nt  be  their  projections  on  BlGl  ; 
then 


\f(M)-f(Ml)  £<r,     \f(N)-f(Nl)   ^cr, 
from  these  inequalities  we  deduce  that 

\f(M)-f(N)\<2a  +  e. 

Since  this  holds  for  every  e,  the  saltus  of  f(x,  y),  at  A0,  is  ^  2<r.  If,  at  a  point 
A0,  the  saltus  off(x,  y)  be  >  2o-,  then,  at  A0,  the  minimum  of  $„  with  respect 
to  C  must  be  zero. 

Since  &a  is  positive  at  every  point  of  C,  and  is  an  upper  semi-continuous 
function  of  (x,  y),  it  follows  from  the  theorem  of  §  232,  that,  in  every  arc  D  of 
the  curve  6',  there  exists  an  arc  Dl  in  which  the  minimum  of  $a  is  positive. 
Let  us  take  a  sequence  crl,  a-2,  ...trn...  of  positive  decreasing  numbers,  of 
which  the  limit  is  zero.  It  is  then  clear  that,  in  every  arc  D,  there  exists  a 
point  where  /3<rn  has  its  minimum,  with  respect  to  C,  positive,  for  every  <rn. 
At  this  point  the  fluctuation  of  f(x,  y}  with  respect  to  the  two-dimensional 
domain  (x,  y)  is  ^  2crn,  for  all  values  of  n,  and  is  therefore  zero.  This 
point  must  be  a  point  of  continuity  of  f(x,  y)  with  respect  to  (x,  y). 

The  following  general  theorem*  has  thus  been  established: 

If  f(x,  y)  be  a  function  of  the  two  variables  x,  y,  which  is  everywhere 
continuous  with  respect  to  y,  and  is  continuous  with  respect  to  x  along  straight 
lines  parallel  to  the  x-axis,  which  cut  the  y-axis  in  an  everywhere  dense  set  of 
points,  then  in  every  portion  of  a  curve  y  =  <f>  (x)>  where  <j>  (x)  is  a  continuous 
function,  there  exist  points  at  which  f(x,  y)  is  continuous  with  respect  to  the 
two-dimensional  domain  (x,  y\ 

It  follows  from  this  theorem  that  points  of  continuity  exist  in  every  area, 
that  isf(x,  y)  is  at  most  a  point-wise  discontinuous  function. 

The  whole  of  the  reasoning  above  is  applicable,  if  only  those  points  of 
(x,  y)  are  taken  into  account  which  belong  to  a  perfect  set  G.  It  thus  appears 
that,  under  the  conditions  stated  in  the  above  theorem,  y  (a;,  y}  is  a  point-wise 
discontinuous  function  relatively  to  every  perfect  set  G,  of  points  in  (x,  y}. 
The  points  of  continuity  of  f(x,  y),  on  the  curve  y  =  <j>(x\  are  everywhere 
dense  with  respect  to  every  perfect  set  of  points  on  the  curve. 

*  Baire,  Annali  di  Mat.,  Ser.  111%  vol.  in,  p.  27. 
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EXAMPLES. 

1.  If  */(&*,  y)  be  a  function  which  is  everywhere  continuous  with  respect  to  each  of  the 
variables  x,  y,  then  the  points  at  which  the  saltus  of  f(x,  y\  with  respect  to  the  two- 
dimensional  continuum  (x,  y\  is  >  or,  form  a  set  of  points  such  that  the  projection  of  the  set 
on  either  axis,  by  lines  parallel  to  the  other  axis,  is  a  non-dense  set. 

2.  If  t  a  function  /  (x,  y,  z)  of  three  variables  x,  y,  z  be  everywhere  continuous  with 
respect  to  each  variable,  then  /  (x,  y,  z)  is  at  most  a  point-wise  discontinuous  function 
relatively  to   the   three-dimensional   continuum   (x,  y,   z).      Further,   on   every   surface 
x  =  <j>(y,  z),  where  0  is  continuous  with  respect  to  (y,  z),  the  function  f(x,  y,  2)  is  at  most 
a  point-wise  discontinuous  function  with  respect  to  (.y,  z).     The  set  of  points  at  which  the 
saltus  of  f(x,  y,  z)  >  o-  may  contain  all  the  points  of  a  continuous  curve. 

3.  Lett  (f>  (x,  y)  be  a  function  which  is  continuous  with  respect  to  each  of  the  variables 
x  and  y,  and  let  (0,  0)  be  a  point  of  discontinuity  of  $  (x,  y}  with  respect  to  (x,  y}.     Define 
f(x,  y,  z)  by  the  condition  f(x,  y,  z)  =  $  (x,  y)  ;  then  the  function  /(#,  y,  z)  is  continuous 
with  respect  to  each  of  the  three  variables,  but  every  point  on  the  z-axis  is  a  point  of  dis- 
continuity with  respect  to  (x,  y,  z). 

4.  Let  t  /  (x,  y,  z)  be  a  function  which  is  constant  along  any  straight  line  parallel  to 

the  straight  line  x=y  =  z,  and  is  such  that  f  (x,  y,  0)=^^       ,,  /(O,  0,  0)  =  0.      This 

C^+y2)* 
function  is  discontinuous  at  every  point  on  the  straight  line  x=y  =  z. 

5.  Let  \  /(O,  0)  =  0;  at  all  points  at  which  y  is  positive  and  x2/y  <  1,  let  f(x,  y)=$2/y, 
and  when  x2jy  >  1,  \eif(x,y}=y/x2.    Also  \etf(x,  —y)=f(x,  y)-    The  origin  is  a  point  of 
discontinuity,  at  which  the  saltus  is  1  ;  elsewhere  the  function  is  continuous  with  respect  to 
(x,  y).    The  function  is  however  continuous  with  respect  to  every  straight  line  ;  for  it  is 
continuous  along  the  axis  of  x,  where  it  has  the  value  0.     Consider  now  the  straight  line 
y  =  mx,  where  m=t=0;  it  lies  entirely  in  the  part  of  the  plane  in  which 

\f(x,y]  |  =x2/    y  |=    x/m  |, 
so  that,  as  the  point  (0,  0)  is  approached,  f(x,  y)  converges  to  0. 

6.  Let  \  («i,  &i),  (a2,  62)>  •••  (am  bn)  •••  denote  an  enumerable  set  of  points,  and  let 

' 


where  f(x,  y)  denotes  the  function  denned  in  Ex.  5. 

The  function  F(x,  y}  is  continuous  with  respect  to  any  straight  line,  and  is  continuous 
with  respect  to  (x,  y}  at  all  points  except  those  of  the  enumerable  set.  The  enumerable  set 
may  be  everywhere  dense  in  the  plane. 

324.  The  methods  developed  by  Baire  of  dealing  with  functions  of  two 
or  more  variables,  in  relation  to  the  distribution  of  the  points  of  discontinuity, 
have  been  applied  by  him  to  the  consideration  of  the  following  three 
problems  : 

(1)  What  must  be  the  nature  of  a  function  ^>(x],  defined  for  a^«^/3, 
in  order  that  a  function  /(a?,  y}  can  exist  which  is  defined  for  all  points  in  the 

*  Baire,  loc.  cit.,  p.  94.  t  Baire,  loc.  cit.,  p.  99. 

£  W.  H.  and  G.  C.  Young,  Quarterly  Journal  of  Math.,  vol.  XLI,  where  an  example  is  also  con- 
structed of  a  function  that  is  continuous  with  respect  to  every  straight  line  but  is  discontinuous 
with  respect  to  (x,  y)  at  points  of  an  everywhere  dense  unennmerable  set. 


. 
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square  a^ac^  8,  a  £y^.  @,  and  is  continuous  at  every  point  with  respect  to 
x  and  with  respect  to  y,  and  moreover  is  equal  to  <£  (x)  on  the  straight  line 
x  =  yi. 

(2)  What  must  be  the  nature  of  a  function  <£  (x),  defined  for  a  £  x  ^  @, 
in  order  that  a  function  f(x,  y)  can  be  defined  for  all  points  in  the  square 
«^#^jQ,  a  ^  y  ^  &  and  which  shall  satisfy  the  conditions  that  it  is  con- 
tinuous with  respect  to  (x,  y)  at  every  point  for  which  y  >  0,  is  continuous 
with  respect  to  y  at  the  points  of  y  =  0,  and  is  equal  to  <j)  (sc)  when  y  =  0  ? 

(3)  A  function  /  (x,  y)  is  defined  in  the  rectangle  OL^X  £  /3,  y^y  =  8, 
and  is  everywhere  continuous  with  respect  to  y.    Further,  there  is  a  set  of 
parallels  to  the  #-axis,  along  each  of  which  f(x,  y)  is  continuous  with  respect 
to  x  ;  these  parallels  intersecting  the  straight  line  x  =  o  in  a  set  of  points 
which  is  everywhere  dense  in  the  interval  (7,  8).    What  is  the  nature  of  the 
function  f  (x,  y)  on  a  continuous  curve  drawn  in  the  rectangle  ? 

The  problems  (1),  (2)  are  particular  cases  of  (3).  It  has  been  shewn  above 
that  a  necessary  condition  satisfied  by  f  (x,  y),  in  (3),  is  that  it  should  be  a 
point-wise  discontinuous  function  relatively  to  every  perfect  set  of  points. 
That  this  condition  is  also  sufficient,  has  been  demonstrated  by  Baire  in  his 
memoir  quoted  above.  A  proof  of  this  will  be  given,  for  the  case  of  problem 
(2),  in  Vol.  n,  in  connection  with  the  theory  of  functions  representable  as 
the  limits  of  sequences  of  functions. 


THE   REPRESENTATION   OF    A    SQUARE   ON    A    LINEAR   INTERVAL. 

325.  Let  a  point  of  a  square  whose  side  is  unity  be  denoted  by  (x,  y), 
where  0  ^  #  ^  1,  O^y^l;  and  let  t  denote  a  point  of  a  linear  interval  (0,  1). 
An  account  has  been  given  in  §  62  of  Cantor's  method  of  establishing  a  (1,  1) 
correspondence  between  the  points  of  the  square  and  those  of  the  linear  interval. 
Such  a  correspondence  denotes  functional  relations  x  =f(t),  ?/  =  <£(£)  between 
x,  y  as  dependent  variables,  and  t  as  an  independent  variable.  It  will  be  shewn 
however  that  no  (1,  1)  relation  between  the  two  sets  of  points  can  be  a 
continuous  representation  *  ;  i.e.  it  is  impossible  that  the  functions  f(t},  <£(£) 
can  be  both  continuous. 

Let  us  assume  that  such  a  continuous  representation  can  be  defined.  To 
any  closed  set  of  points  {t},  in  (0,  1),  there  will  correspond  a  closed  set  in  the 
plane  area.  For  if  t1}  t2,  ...  tn,  ...  be  a  convergent  sequence  of  points  t,  of 
which  k  is  the  limiting  point,  then  the  point  /(£„),  <f,(ta)  is  the  limiting  point 
of  the  set  of  points  (a?,  ,  y^,  (x2,  y2),  .  .  .  (xn,yn\  .  .  .  which  correspond  to  tlt  t2,  .  .  .tn,  .  .  . 

*  See  Netto,  "  Beitrag  zur  Mannigfaltigkeitslebre,  "  Crelle's  JL,  vol.  LXXXVI.  ;  also  Loria,  Giorn. 
di  Mat.,  vol.  xxv,  p.  97.  In  the  proof  given  by  these  writers  it  is  assumed  that  a  closed  curve  corre- 
sponds to  a  linear  sub-interval  of  (0,  1)  ;  this  is  not  necessarily  the  case,  for  a  non-dense  closed  set 
may  correspond  to  the  closed  curve. 


« 
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respectively;  therefore  to  a  closed  set  [t]  there  corresponds  a  closed  set  {{x,  y}}. 
Again,  to  a  convergent  sequence  (xl}  y^,  (x2,  y.2),  ...  of  points  in  the  plane  area, 
there  corresponds  a  set  of  points  tl}  t.2, ...  in  the  linear  interval,  the  latter  of 
which  has  a  limiting  point  tM  which  must  correspond  to  (xu,  yw} ;  and  since 
only  one  value  of  t  corresponds  to  one  set  of  values  of  {x,  y),  there  can  be  only 
one  such  limiting  point  ta.  Thus,  to  a  closed  set  in  the  plane,  there  corresponds 
a  closed  set  in  the  linear  interval.  Take  two  points  £1}  £2in  the  interval  (0, 1); 
these  points  correspond  to  two  points  P1 ,  P2  in  the  square  area.  To  the  closed 
linear  interval  (t1}  £2)  there  corresponds  a  closed  set  S  which  contains  the  points 
PI,  P2.  It  can  be  shewn  that  there  are  points  other  than  P1}  P2  on  the  frontier 
of  S.  Denote  by  G(S)  the  set  of  those  points  of  the  square  area  which  do  not 
belong  to  S.  Two  points  Q,  R  in  the  square  can  be  determined,  such  that  Q 
lies  on  the  straight  line  PiP2,  and  R  does  not  lie  on  this  straight  line ;  such 
that  neither  Q  nor  R  coincides  with  Pl  or  P2,  and  such  that  one  of  the  two 
belongs  to  S  and  the  other  to  (7  ($).  The  closed  set  consisting  of  the  straight 
line  QR  contains  points  both  of  8  and  of  C(S)',  those  points  of  S  which  lie  on 
it  form  a  closed  set,  and  there  must  be  one  such  point  of  S  at  least  which  is 
on  the  frontier  of  S;  such  a  point  may,  or  may  not,  coincide  with  Q  or  R. 
Since  then  S  contains  points  on  its  frontier  besides  P1  and  P2,  we  can  take  a 
point  i,  within  the  linear  interval  (^ ,  t2),  such  that  the  point  T  in  the  square 
which  corresponds  to  it  is  on  the  frontier  of  S.  Since  T  is  the  limiting  point 
of  a  sequence  of  points  of  C(S),  it  follows  that  i  mast  be  the  limiting  point 
of  a  sequence  of  points  all  of  which  are  external  to  the  interval  (t1}  t2);  and 
this  is  impossible.  It  has  thus  been  established  that : 

No  continuous  (1,  1)  correspondence  can  exist  between  all  the  points  in  a 
square  and  all  the  points  in  a  linear  interval. 

In  particular,  the  correspondence  shewn  by  Cantor  to  exist  must  be 
discontinuous. 

326.  The  reasoning  of  §  325  would  be  inapplicable  if  the  correspondence 
x  *=/(£)>  y  —  4>(t}  were  such  that,  to  a  given  point  (x,  y)  more  than  one  point 
t  may  correspond,  the  functions  f(t),  (f>  (t)  being  still  one- valued  continuous 
functions,  so  that  if  t  be  assigned,  (x,  y}  is  uniquely  determined.  In  this  case, 
the  limiting  point  of  the  set  of  points  external  to  the  interval  (tly  t2)  would 
be  not  i,  but  another  value  of  t  which  also  corresponds  to  the  point  T. 

Peano*  gave  the  first  continuous  correspondence  of  the  kind  just  indicated, 
thus  defining  what,  by  a  considerable  extension  of  the  use  of  geometrical 
language,  may  be  called  a  continuous  curve  which  passes  through  every  point 
of  the  square  at  least  once. 

Let  the  points  in  the  interval  (0,  1)  be  expressed  in  the  form 

t  =  'tti a2a3 . . .  an  . . . , 

*  "  Sur  une  courbe,  qui  remplit  toute  une  aire  plane,"  Math.  Ann.,  vol.  xxxvi,  1890 
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in  radix  fractions  in  the  ternary  scale,  so  that  each  a  is  either  0,  1,  or  2. 
Let  k  (a)  denote  the  number  2  -  a,  so  that  k  (2)  =  0,  k  (1)  =  1,  k  (0)  =  2  ;  and  let 
kn  (a)  denote  the  result  of  performing  this  operation  n  times,  so  that  kn  (a)  is 

a  or  2  —  a,  according  as  n  is  even  or  odd. 

» 
Let  x,  y  be  defined,  for  a  prescribed  t,  by 

x  =  'bJ)J)3  .  .  .  ,    2/  =  'c1c2c3..., 

the  ternary  scale  being  again  employed  ;  the  numbers  b,  c  being  defined  by 
the  relations 


Ci  =  kai  («2),     c2  =  F'+a°  (a4),  .  .  .  cn  =  F"+a'+  ••+« 
thus  bn  is  equal  to  a2n_!  or  to  2  —  a^-i,  according  as  az  +  ai+  ...  +0,^-2  is 


even  or  odd. 

The  numbers  t  may  be  divided  into  two  classes  : 

(1)  Those,  other  than  0  or  1,  which  are  capable  of  a  double  representation 

t  =  '^a^as  .  .  .  an2  2  2  .  .  .  =  •«!  a2  .  .  .  an  -f  1  0  0  0  ____ 

(2)  Those  which  have  a  single  representation  only. 

If  t  be  a  number  of  the  second  class,  a;  and  y  are  uniquely  defined.    If  t  be 
a  number  of  the  first  class 


t  —  'ftja-j ...  an2  22...  =  'a^a*  . ..  an  +  1  000  ... , 

let  '&1&2&3  •  •  •  >  'bib2'b3' . . .  denote  the  numbers  obtained  by  applying  the  definition 
of  x  to  the  two  modes  of  representation  of  t.  If  n  is  even,  say  2m,  it  is  clear 
that 

bi  =  bi,  &2  =  &2',  ...bm  =  bm'; 

also  bm+l  =  &a«+a«+-+a»»  (2),  6',M+1  =  #"+««+-+««n+i  (0), 

A  _  r«2+«1-f--"+a.2m+2  /9>  7/  _  7a2+rt4  +  ---+«2m+l  /fi\ 

ym+2  —  ^  \&J)  "m+2  — ^  i.W> 


hence  6m+1  =  b'm+1 ,     bm+2  =  6'm+2 ,  . . . ; 

and  thus  a;  has  the  same  value  whichever  of  the  two  forms  for  t  is  employed ; 

the  case  in  which  n  is  odd  may  be  similarly  treated. 

The  same  result  can  readily  be  shewn  to  hold  for  y.  Therefore,  corresponding 
to  any  assigned  t,  x  and  y  are  uniquely  determined. 

Next,  let  us  suppose  x  and  y  to  be  assigned.    We  have 

ai  =  blt  a2  =  kb>  (c,),,  og  =  k*  (6a),  «4  =  k/h+b*  (c2),  . . . 

n  _7,e,  +  c,+  ...+Cn-1/7    \  _  ;.&i  +  M-...+6n  /      \  . 

"an— i  —  A  \"n/5  t^an  —  ft  V^i/  > 

for,  if  j9  =  kr(q),  then  |>  +  5  is  an  even  number. 

In  case  x,  y  are  both  of  the  second  class,  t  is  uniquely  determined. 
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If  x  is  of  the  first  class,  and  y  of  the  second  ;  let 

x  =  •61  62  .  .  .  bn  2  2  2  .  .  .  =  •&!  62  .  .  .  6n  +  l  000..., 
2/  =  Cj  C2  .  .  .  cn  cn_[_1  .  .  .  , 
and  let  the  two  values  of  t  be  denoted  by  'a1a2a3  .  .  .  ,  'a/a/  .  .-.  . 

It  is  clear  that 


also         o»  =  &  (a'2n),  a27l+1  =  ***+-*»  (fcB),  a'2n+1  = 

thus  a2n+i,  a'oji+i  are  not  identical,  although  a2n,  a2n'  will  be  so  if  each  is  unity. 
It  is  thus  seen  that  t  has  two  distinct  values  corresponding  to  one  point  (x,y), 
when  #  is  a  number  of  the  first  class,  and  y  is  of  the  second  class.  It  can  be 
shewn  in  a  similar  manner  that  there  are  four  points  t  corresponding  to  a 
single  point  (x,  y)  such  that  x,  y  are  both  numbers  of  the  first  class. 

The  correspondence  is  continuous.  For  if  t,  t'  are  identical  as  regards  the 
first  2n  figures,  x  and  x  are  identical  as  regards  their  first  n  figures,  and  the 
same  is  true  of  y  and  y  '  . 

The  curve  which  has  thus  been  defined  is  a  continuous  curve  which  passes 
through  each  point  in  the  square  at  least  once;  there  is  an  everywhere  dense 
enumerable  set  of  points  through  each  of  which  the  curve  passes  twice,  and 
another  everywhere  dense  enumerable  set  of  points  through  each  of  which  it 
passes  four  times;  through  each  point  of  the  remaining  unenumerable  set  of 
points,  the  curve  passes  once  only. 

The  plane  measure  of  an  arc  of  Peano's  curve  which  corresponds  to  an 
interval  (t0,  ^)  is  not  zero,  i.e.  the  area  which  a  number  of  rectangles  enclosing 
all  the  points  of  the  arc  have  in  common  has  a  lower  limit  greater  than  zero. 

The  two  continuous  functions  f(f),  </>  (t),  which  define  x,  y  as  functions  of 
t,  do  not  possess,  for  any  value  of  t,  definite  differential  coefficients,  and  are 
perhaps  the  simplest  examples  of  continuous  non-differentiable  functions. 

327.  It  might  at  first  sight  appear  that  a  curve  having  the  same  properties 
as  that  of  Peano  might  have  been  defined  by  restricting  t  —  'a^a*  ...  to  be  such 
that  an  infinite  number  of  digits  other  than  0  are  present,  and  then  defining 

x,  y  by 

x  =  'ajttstt.,  .  .  .  ,  y  =  1a,2a4a6  — 

If  however  the  double  representation  of  x,  y  were  not  restricted,  as  in  the 
case  of  t,  there  would  be  no  value  of  t  corresponding  to,  say, 

£•  =  •1000...,     y  =  -2000.... 

If  (x,  y)  were  on  the  other  hand  so  restricted,  there  would  be  no  values  of  (x,  y) 
corresponding,  for  example,  to 

£  =  •111010101., 
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It  thus  appears  that  some  such  rule  as  that  given  by  Peano  is  necessary  to 
obviate  the  difficulty  caused  by  the  double  representation  of  a  certain  class  of 
rational  numbers,  in  a  given  scale. 

The  method  may  easily  be  extended  to  obtain  a  continuous  correspondence 
between  the  points  in  a  cube  and  those  in  a  linear  interval. 

A  somewhat  different  method  of  establishing  correspondence  between  the 
points  of  the  square,  and  those  of  the  linear  interval,  is  the  following*  : 

Let  ti  denote  one  of  the  perfect  set  of  points  defined  by 


when  every  a  is  either  0  or  2.   For  such  a  point  t1}  x  and  y  may  be  defined  by 

/yi    ^^    _         I        _!._ I 

~  2  V  2      2*     2*      '" 

/  Ct'2         Ct^         Cvg 

^,2\22t2I 

A  point  £  which  does  not  belong  to  the  perfect  set  is  interior  to  one  of  the 
complementary  intervals  (£/,  t")  of  the  set;  in  such  an  interval  we  may  define 
x,  y  as  linear  functions  of  t,  thus 

,     of -a?  ,4       ,x 

ff  —  of  JL.    if f  M 

•^  —  •*     i    j.  i        j.  n  \y  —  fcl  />  • 

tj         fcj 


where  (x',  y'},  (x",  y")  correspond  to  £/,  t"  respectively. 

328.    A  method  of  constructing  a  continuous  curve  which  fills  a  square  has 
been  given  in  a  geometrical  form  by  Hilbertf. 


1  234 

FIG.  7. 

See  Lebesgue,  Le<;ons  sur  I 'integration,  p.  44.         f  See  Math.  Annalen,  vol.  xxxvm,  p.  459. 
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Divide  the  interval  (0, 1)  into  four  equal  parts,  and  number  them  in  order 
as  1,  2,  3,  4.  Then  divide  the  square  into  four  equal  parts,  as  in  Fig.  7,  and 
number  them  1,  2,  3,  4,  to  correspond  with  the  segments  of  the  linear  interval. 
Next  divide  each  segment  of  the  straight  line  into  four  equal  parts,  and  each 
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of  the  four  squares  into  four  equal  parts  as  in  Fig.  8.  The  sixteen  squares  so 
formed  are  then  numbered  in  order,  so  that  each  square  has  one  side  in  common 
with  the  one  next  in  order ;  the  squares  then  correspond  with  the  segments 
numbered  in  the  same  way.  At  the  next  stage  there  are  (Fig.  9)  64  squares 
corresponding  to  64  segments  of  the  interval  (0, 1).  Proceeding  in  this  manner 
indefinitely,  any  point  of  (0, 1)  is  determined  by  the  intervals  of  the  successive 
set  of  sub-divisions  in  which  it  lies.  The  corresponding  point  in  the  square 
area  is  determined  by  the  succession  of  squares,  each  containing  the  next,  in 
which  it  lies.  The  curve  is  thus  determined  as  the  limit  of  a  sequence  of 
polygons  denoted  by  the  thickened  lines  in  the  figures.  The  curve  thus 
obtained  is  continuous,  but  has  no  tangent.  Hilbert  remarks  that,  if  the 
interval  (0, 1 )  be  taken  as  a  time  interval,  a  kinematical  interpretation  of  the 
functional  relation  between  the  curve  and  the  segment  is  that  a  point  may 
move  so  that  in  a  finite  time  it  passes  through  every  point  of  the  square 
area. 

Continuous  curves  of  this  kind  can  be  constructed  by  any  method  by 
which  an  everywhere  dense  enumerable  set  of  points  in  the  square  can  be 
made  to  correspond  with  a  similar  set  of  points  in  the  linear  interval ;  provided 
the  functional  relation  x  =f(t),  y  =  <j>  (t),  in  such  correspondence,  is  uniformly 
continuous.  For,  when  this  condition  is  satisfied,  the  functions  obtained  by 
the  method  of  extension  of  f(t),  (f>(t)  to  the  remaining  points  of  (0,  1)  as 
secondary  points  (see  §  287)  will  yield  a  correspondence  of  all  the  points  of 
the  square  with  those  of  the  linear  interval,  of  the  required  character. 

Another  method  differing  from  that  of  Hilbert  has  been  given  by  E.  H. 
Moore*  and  by  Schoenfliesf. 

Let  m  be  an  uneven  number  (in  the  figure,  m  =  3);  divide  the  linear  interval 
j(0,  1)  into  m?  equal  parts,  and  also  the  square  into  m2  equal  parts.  Let  these 
squares  be  passed  through  by  a  polygonal  line,  of  which  the  sides  are  diagonals 
of  the  squares,  as  in  the  figure ;  in  this  manner  the  squares  are  arranged  in 
order  1,  2,  3,  ...m2,  and  are  placed  into  correspondence  with  the  segments 
bearing  the  same  numbers.  At  the  same  time  the  end-points  of  a  diagonal  so 
traversed  are  made  to  correspond  with  the  end-points  of  a  segment  of  the  linear 
interval.  Thus  m2  -f  1  points  in  the  linear  interval  are  placed  into  correspondence 
with  points  in  the  square,  so  that  to  each  of  the  m2  + 1  points  of  the  linear 
interval  there  is  one  point  in  the  square ;  but  the  converse  is  not  the  case. 
Next,  divide  each  of  the  m-  linear  intervals  into  m2  equal  parts,  and  the 
corresponding  squares  into  ?/i2  equal  parts;  then  construct  as  before  a  polygon 
traversing  diagonals  of  all  the  m4  squares,  making  their  end -points  correspond 
to  the  end-points  of  the  corresponding  m4  parts  of  the  linear  interval.  Pro- 
ceeding in  this  manner,  we  gradually  place  points  in  the  square,  consisting 

*  Trans.  A  mer.  Math.  Soc.,  vol.  i,  p.  77. 

t  Bericlit  iiber  die  Mengenlehre,  vol.  i,  p.  121. 
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of  an  everywhere  dense  enumerable  set,  into  correspondence  with  a  set  in  the 
linear  interval  which  possesses  the  same  property;  and  the  functional  relation 
so  set  up  is  uniformly  continuous.  The  definition  of  the  functions  for  the  whole 
linear  interval  is  then  obtained,  as  explained  above,  by  the  method  of  extension. 


0123456789 

FIG.  10. 

The  case  in  =  3  corresponds  to  Peano's  analytical  method.  In  the  method 
of  Moore  and  Schoenflies,  the  curve  is  determined  as  the  limit  of  a  sequence 
of  polygons  inscribed  in  the  curve.  In  Hilbert's  method  the  polygons  which 
approximate  to  the  form  of  the  curve  are  not  inscribed  in  the  curve,  but  are 
otherwise  determined. 


CHAPTER  VI 

THE  RIEMANN  INTEGRAL 

329.  THE  fundamental  operation  of  the  calculus,  known  as  integration, 
regarded  from  one  point  of  view,  consists  essentially  in  the  determination  of 
the  limit  of  the  sum  of  a  finite  series  of  numbers,  as  the  number  of  terms  of 
the  series  is  indefinitely  increased,  whilst  the  numerically  greatest  of  the 
individual  terms  of  the  series  approaches  the  limit  zero.  The  laws  which 
regulate  the  specification  of  the  terms  of  the  series  must  be  supposed,  in  any 
given  instance,  to  be  assigned,  and  to  be  of  such  a  character  that  the  limit  in 
question  exists.  It  is  in  this  form  that  the  problem  of  integration  naturally 
presents  itself  in  ordinary  problems  of  a  geometrical  or  physical  character,  such 
as  the  determination  of  lengths,  areas,  volumes,  etc.  The  method  of  integration, 
so  regarded,  has  its  origin  in  the  method  of  exhaustions  employed  by  the  Greek 
geometers,  and  was  developed  later  in  forms  whose  theoretical  exactitude  de- 
pended at  various  epochs  upon  the  stage  which  the  development  of  Analysis 
in  general  had  reached.  In  the  hands  of  Cauchy,  Dirichlet,  and  Riemann, 
the  definition  of  the  definite  integral  gradually  attained  to  an  exact  arithmetic 
form  which  fitted  it  for  the  purposes  of  modern  analysis ;  and  in  fact  the 
definition  given  by  Riemann  leaves  nothing  to  be  desired  as  regards  precision. 
Riemann  not  only  formulated  a  rigorous  arithmetical  definition  of  the  integral 
of  a  bounded  function,  but  also  established  a  necessary  and  sufficient  con- 
dition for  the  existence  of  the  definite  integral  of  the  function.  This  definition 
of  the  definite  integral  was,  in  the  latter  part  of  the  nineteenth  century,  the 
one  that  was  employed  in  all  rigorous  mathematical  analysis,  but  in  the  present 
century  it  has,  for  the  purposes  of  theoretical  investigations,  been  largely 
superseded  by  the  more  general  formulation  of  Lebesgue.  In  accord- 
ance with  Lebesgue's  definition,  integrable  functions,  i.e.  functions  which 
possess  a  definite  integral,  form  a  class  which  is  markedly  wider  than,  and 
includes,  the  class  of  functions  that  are  integrable  in  accordance  with  Riemann 's 
definition.  Although  the  definition,  of  which  the  most  precise  formulation  is 
that  of  Riemann,  is  primarily  applicable  only  to  a  bounded  function  over  a 
bounded  interval,  it  was  extended  by  Cauchy,  Harnack,  de  la  Vallee  Poussin, 
and  others,  to  cases  in  which  the  function  to  be  integrated  is  unbounded,  and 
to  integration  over  unbounded  intervals.  It  was  also  extended  to  the  case  of 
double,  or  multiple,  integration. 

The  process  of  integration,  leading  to  the  indefinite  integral,  is  also  regarded 
as  the  operation  inverse  to  that  of  differentiation  ;  and  the  relation  of  this  mode 
of  regarding  integration  with  the  one  referred  to  above  has  been  formulated 
in  what  is  known  as  the  fundamental  theorem  of  the  Integral  Calculus.  Many 
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important  investigations  are  concerned  with  the  relation  between  the  two 
'modes  of  regarding  integration,  and  with  the  establishment  of  the  fundamental 
theorem,  including  an  examination  of  the  limitations  to  which  it  is  subject. 
It  is  in  this  connection  that  the  advantages  of  the  Lebesgue  Integral  and  its 
generalizations,  over  the  Riemann  Integral,  are  most  apparent. 

The  Riemann  Integral  is  not  only  of  interest  from  an  historical  point  of 
view,  but  it  still  possesses  great  intrinsic  importance  in  Analysis,  and  will 
continue  to  be  the  basis  upon  which  practical  applications  of  the  Integral 
Calculus  rest.  Accordingly  an  account  of  the  theory  of  Riemann  integration 
will  be  given  in  the  present  Chapter.  An  account  is  also  given  in  the  present 
Chapter  of  the  properties  of  the  Riemann  Integral,  and  of  some  of  its  extensions 
to  the  case  of  unbounded  functions,  or  of  integration  over  unbounded  domains. 
A  more  complete  account  of  these  extensions  is  however  included  in  the  more 
general  theory  of  Lebesgue  integration,  and  its  extensions,  which  will  be 
discussed  in  Chapters  VII  and  vin. 

THE  RIEMANN  INTEGRAL  IN  A  LINEAR  INTERVAL. 

330.  Let/(#)  be  a  bounded  function,  defined  for  the  closed  linear  interval 
(a,  b),  where  b  >  a;  so  that  there  exist  an  upper  boundary  U,  and  a  lower 
boundary  L,  of  the  functional  values  in  the  closed  interval.  Let  a  system  of 
nets,  with  closed  meshes,  be  applied  to  the  interval  (a,  b),  and  let  Sj(n),  S2(n), 
•  ••  &mn(n]  denote  the  breadths  of  the  mn  meshes  of  the  net  Dn,  of  the  system. 
Let  M  (Br{n})  denote  any  number  so  chosen  as  to  be  not  greater  than  the  upper 
boundary  of  the  function/'  (x)  in  the  closed  mesh  Sr{n},  and  so  as  not  to  be  less 
than  the  lower  boundary  of  f(x)  in  the  same  rnesh,  and  consider  the  sum 


If  'the  sequence  Slt  S2,  ...  Sn,  ...be  convergent,  and  have  the  same  number  S 
for  its  limit,  whatever  system  of  nets,  applied  to  (a,  b),  be  employed,  and  however 
the  numbers  M  (Srw)  be  chosen,  subject  only  to  their  limitation  in  relation  to  the 
upper  and  lower  boundaries  of  f(x)  in  the  meshes  Br{n),  then  the  function  f(x) 
is  said  to  have  a  Riemann  integral  in  the  interval  (a,  b),  and  the  number  S 
defines  the  value  of  its  integral.  The  integral,  when  the  limit  S  exists,  is  denoted 

rb 

by      /(a?)  dx. 


It  will  be  observed  that  J/(S)  is  not  necessarily  the  value  of  f(oc)  at  any 
point  in  the  interval  8 ',  for  all  that  is  necessary  is  that  it  should  not  be  greater 
than  the  upper  boundary,  nor  less  than  the  lower  boundary,  of  f(x)  in  the 
closed  interval  8.  In  this  respect  the  definition  is  a  slight  generalization  of 
that  given  by  Riemann*,  who  restricted  M(S)  to  have  the  value  of  f(x)  at 
some  point  in  the  interval. 

*  See  Riemann's  Gesam.  Werke,  2nd  ed.,  p.  239. 
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A  Riemann  integral  will  be  spoken  of  as  an  R-integral,  and  a  function 
f(oc)  which  has  an  .R-mtegral,  in  (a,  b),  will  be  said  to  be  integrable  (R)  in  (a,  b). 

The  definition  of  a  definite  integral,  of  which  Rieraann's  definition  is  a 
development,  was  given  by  Cauchy  for  the  case  of  a  continuous  function. 
Cauchy's  definition*  is  in  fact  what  arises  when  M(8)  is  in  every  case  restricted 
to  be  the  functional  value  at  one  end  of  the  interval  8.  Thus  it  may  be  ex- 
pressed by 

rb 

f(x)  dx  =  hm  [(ar,  -  a)f(a)  +  (x2  -  x:)  /(#,)  +  ...+  (b  -  #„_,)/  0«-i)], 


where  a,  Xi,  xz,  ...  #n_i,  b  are  the  end-points  of  a  set  of  sub-divisions  of  (a,  b), 
and  the  limit  is  determined  under  the  conditions  stated  above,  which  involve 
the  convergence  to  zero  of  the  length  of  the  greatest  of  the  sub-intervals 


THE  UPPER  AND  LOWER  RIEMANN  INTEGRALS. 

331.  The  investigation  of  the  necessary  and  sufficient  conditions  that  the 
bounded  function  f(x)  may  have  an  j?-integral  in  (a,  6),  in  accordance  with 
the  above  definition,  is  considerably  simplified  -by  the  introduction  of  the 
upper  and  lower  ^-integrals  of  the  function  /(#)  in  the  interval  (a,  b). 

If  (or,  /3)  be  any  interval  contained  in  (a,  b),  the  upper  and  lower  boundaries 
of  f(x)  in  the  closed  interval  (a,  /9)  may  be  denoted  by  £//,  La&  respectively. 
The  upper  and  lower  boundaries  of  f(x)  in  (a,  b}  are  accordingly  denoted  by 
Uab,  Lab  ;  or  simply  by  U  and  L.  If  the  interval  (a,  /3)  be  denoted  by  a  single 
letter  8,  we  .may  write  U  (8},  L  (8)  for  Uf,  La0  respectively. 

If  (a,  b)  be  divided  into  any  number  of  parts  denoted  by  8ly  82,  ...  8m,  taken 

r=m 

in  order  from  left  to  right,  the  sum  S  8r  U(8r)  has  a  definite  lower  boundary  \ 


r=m 


and  the  sum  %  8r  L  (8r)  has  a  definite  upper  boundary,  when  all  possible  modes 
?-=i 

of  dividing  {a,  b)  into  parts  are  taken  into  account.     These  lower  and  upper 
boundaries  are  defined  to  be  the  upper,  and  the  lower~\,  R-integrals  of  f(x)  in 


(a,  b),  and  are  denoted  by  \    f(x)dx,      f(x)dx  respectively. 

J  a  J  a 

In  each  of  the  sums,  8r  denotes  the  length  of  the  interval  described  by  the 

r  =  in  r  =  m 

same  letter.    Since  2  8r  U(8r)  ^=L(b  —  a),  2  8rL(8r)£  U(b  —  a),  it  is  clear 

r=\  r=l 

that  the  first  sum  has  a  finite  lower  boundary,  and  that  the  second  sum  has 

*  Jl.  de  VEcole  Polytechnique,  cab.  xix  (1823),  pp.  571  and  590. 

f  The  upper  integral  and  the  lower  integral  are  named  by  Jordan  "Pintegrale  par  exces"  and 
"  Pintegrale  par  defaut "  respectively ;  see  his  Cours  d'A nalyse,  vol.  i,  p.  34.  They  were  introduced 
by  Darboux,  Annales  de  Vecole  normal e,  (2),  vol.  iv,  and  also  by  Thomae,  Einleitung,  p.  12 ;  and  by 
Ascoli,  Atti  di  Lincei,  (2),  vol.  n,  p.  863. 
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a  finite  upper  boundary.  The  upper,  and  the  lower,  integrals  consequently 
always  exist. 

The  following  theorem  will  be  established : 

If  e  be  an  arbitrarily  chosen  positive  number,  a  number  d  can  be  so  deter- 
mined that,  for  any  net  such  that  the  breadths  of  all  its  meshes  are  <  d,  the 

r=m 

sum    2  8r  U  (8r)  exceeds  the  upper  integral  of  the  function  by  less  than  e. 

r=l 

rb  r=m 

Since   I    f(x)dx  is  the  lower  boundary  of  the  sum  2  8rU(8r),  for  all 

J  a  r  =  l 

r=m  cb 

possible  nets,  a  net  D  exists,  such  that  2  8r  U(8r)  <       /(#)  dx  +  ^e ;   where 

r=l  .'a 

the  summation  is  taken  for  the  m  meshes  of  the  net  D.  Let  d  be  chosen  to  be 
such  that  (m  +  1)  Ud<^e:  and  consider  any  net  for  which  all  the  meshes 
have  breadths  <  d.  Those  meshes  of  this  second  net  which  contain  end-points 
of  the  meshes  of  D  are  at  most  m  + 1  in  number,  and  the  part  of  the  sum 
2  8U(8),  taken  for  the  second  net,  which  corresponds  to  these  m  +  1  meshes 
is  <  |e.  All  the  other  meshes  of  the  second  net  are  interior  to  meshes  of  D, 
and  the  part  of  2  8U (8)  which  corresponds  to  these  meshes  is  less  than 

~ 
f(x)  dx  +  ^e.    Therefore  the  sum  2  817  (8),  for  all  the  meshes  of  the  second 


net,  is  < 


f(x}  dx  +  e  ;  and  this  is  the  case  for  every  net  the  breadths  of 


whose  meshes  are  all  <  d. 

For  any  system  of  nets  {J)n},  the  breadths  of  all  the  meshes  of  Dn  are  <  d, 
for  all  values  of  n,  from  and  after  some  fixed  integer.  We  thus  have  the 
following  theorem  : 

If  a  system  of  nets,  with  closed  meshes,  be  fitted  on  to  (a,  b),  and  2M  denote 
the  sum  8^  U(8^)  +  8.2™  U(82ln>)  +  ...  +  Smn™  U(Srnn{n>),  where  ^n\  8.^,  ... 
&mn{n>  denote  the  breadths  of  the  mn  meshes  of  the  nth  net  Dn,  of  the  system; 

Tb 

then  21?  22,  ...  2?l,  ...  is  a  sequence  of  numbers  which  converges  to      /(*')  dx, 

J  a 

whatever  be  the  system  of  nets  employed. 

rb 

The   corresponding  theorem  for       /(&')  dx  may  be  proved  in  a  similar 

J  a 

manner;  or  it  may  be  deduced  from  the  above  by  observing  that 

-6 

{-f(x)}dx  =  -      f(x)dx. 

J  a 

If,  in  the  sum  2n,  we  had  taken,  instead  of  U(8),  the  upper  boundary  of 
f(x}  in  the  open  interval  8,  we  should  have  obtained  the  sum 
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where  the  interval  8r(n}  is  (xr-\,  ®r),  and  UXJ~  +0  denotes  the  upper  boundary 
of  f(x)  in  the  open  interval  8r(n}. 

We  have  2n  =  2n,  hence  2  and  2,  the  lower  boundaries  of  2n,  2n,  satisfy 
the  condition  2  =  2. 

Let  Sn+p  be  compared  with  2n.  In  any  mesh  8',  of  Dn+p,  that  is  contained 
in  (#,._!,  xr),  or  S,  the  value  of  £/(&')  is  ^  ^r~,n;  unless  8'  has  an  end-point 

**V—  i  T  " 

at  #,._!  or  #rj  in  which  case  the  upper  boundaries  of  f(x)  in  the  closed  and  in 
the  open  interval  8  differ  from  one  another  by  not  more  than  U—L.  Hence 
we  have  2n+Jt,  —  in  =  2  (  U  —  L]  8mn,  where  8  is  the  length  of  the  greatest  mesh 
in  Dn+p.  As  p  is  increased  indefinitely,  8  is  diminished  indefinitely,  and  there- 
fore 2  £  2n.  As  this  holds  for  each  value  of  n,  we  have  2^2;  and  since  also 
2  =  2,  we  have  shewn  that  2  and  S  are  equal.  It  is  clear  that  the  corresponding 
result  will  hold  in  the  case  of  the  lower  integral.  It  has  thus  been  shewn  that  : 

In  the  definitions  of  the  upper  and  lower  integrals  of  a  function  the  upper 
and  lower  boundaries  of  the  function  in  an  open  interval  8  may  be  employed 
instead  of  the  upper  and  lower  boundaries  in  the  corresponding  closed  interval. 

332.  It  has  thus  been  shewn  that  a  bounded  function  /(#),  defined  for 
the  interval  (a,  b),  always  possesses  an  upper,  and  a  lower,  integral  in  that 
interval.  The  necessary  and  sufficient  condition  that  f  (x)  should  have  an 
^-integral  in  (a,  6)  is  that  the  upper,  and  the  lower,  integrals  in  the  interval 
should  be  equal. 

That  the  condition  is  necessary  follows  from  the  fact  that  all  the  numbers 
M  (8),  in  the  sums  Si,  S2,  ...,  may  be  made  identical  with  U  (8);  or  that  all 
may  be  made  identical  with  L(8).  That  the  condition  is  sufficient  follows 

r=mn  r  =  mn 

from  the  fact  that  Sn  lies  between    2    8r^U(8r^)  and    2   8,.WL  (8r{n});  and 

r=l  r=l 

thus  that,  when  the  two  latter  sums  have  the  same  limit,  that  limit  is  also 
the  limit  of  8n.  We  have  therefore  obtained  the  following  theorem  : 

The  necessary  and  sufficient  condition*  that  the  bounded.  function  /(#)  may 
be  integrable  (R),  in  (a,  b),  is  the  following  :  Let  F(8{n})  denote  the  fluctuation 
[7(8(n>)  —  L(^>(n))  of  the  function  in  the  interval  8{n},  which  may  be  either  closed 
or  open  ;  then  it  must  be  possible  to  define  a  system  of  nets  fitted  on  to  (a,  b), 
such  that,  for  the  net  Dn  ,  the  sum 


has  the  limit  zero,  as  n  is  increased  indefinitely. 

That  this  limit  exists,  and  is  equal  to  zero,  is  equivalent  to  the  statement 

*  See  Biemann's  Gesam.  Werke,  2nd  ed.,  p.  240. 

28—2 
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that,  corresponding  to  an  arbitrarily  chosen  positive  number  e,  a  net  Dn 
belonging  to  a  given  system,  exists,  such  that  the  absolute  value  of 


for  that  value  of  n,  and  for  all  greater  values,  is  less  than  e. 

The  necessary  and  sufficient  condition  for  the  existence  of  I  /  (x}  dx  may 

J  a 

be  stated  in  the  following  somewhat  more  convenient  form : 

If  any  system  of  nets  be  fitted  on  to  the  interval  (a,  b),  and  k  be  an  arbitrarily 
chosen  positive  number,  the  sum  of  the  lengths  of  those  meshes  of  Dn  in  which 
the  fluctuation  of  f  (x)  is  ~  k  must  converge  to  zero,  as  n  is  indefinitely  increased. 

To  see  that  the  condition  so  stated  is  sufficient,  we  observe  that,  if  s(n)  be 
the  sum  of  the  lengths  of  those  meshes  of  Dn  in  which  the  fluctuation  of  f(x) 

is  ^k,  then    2  "S,.(n)^(Sr(n>)  £  s(n)  (U -  L)  +  k(b-  a  -s(n)). 

r=l 

Since  s{n}  ^  0,  as  n  ~  oo  ,  the  upper  limit  of  the  sum  on  the  left-hand  side 
is  ^k(b  —  a).    Since  k  is  arbitrary,  the  sum  converges  to  zero. 
To  shew  that  the  condition  is  necessary,  we  observe  that 

r=mn 

2    BrWF($rw)  ~  ks(n)  +(b-a- s(n}) F  ^ ks(n} ; 

V=  1 

where  F  is  the  least  of  the  fluctuations  in  all  the  intervals  B(n}.  Unless  s(n> 
converges  to  zero,  it  is  impossible  that  2  Sr(n) F (Sr(n))  can  have  the  limit  zero. 

The  necessary  condition  for  the  existence  of  the  ^-integral  may,  in  view 
of  the  first  theorem  of  §  331,  be  stated  as  follows  : 

The  necessary  and  sufficient  condition  that  the  bounded  function  f(x)  may 
be  integrable  (R)  in  (a,  b),  is  that,  corresponding  to  an  arbitrarily  chosen  positive 
number  e,  a  positive  number  d  can  be  so  determined  that,  for  every  net  such  that 

r=m 

the  maximum  breadth  of  its  meshes  is  <  d,  the  sum    2    BrF(Sr)  is  less  than  e. 

r=\ 

333.  The  most  succinct  form  of  the  necessary  and  sufficient  condition  that 
a  bounded  function  is  integrable  (R)  is  the  following* : 

The  necessary  and  sufficient  condition  that  a  bounded  function  may  be 
integrable  (R),  in  the  interval  for  which  it  is  defined,  is  that  the  points  of  dis- 
continuity of  the  function  form  a  set  of  measure  zero. 

It  is  convenient  to  express  this  condition  in  the  form  that  the  function 
must  be  continuous  almost  everyivhere  in  the  interval. 

To  shew  that  the  condition  is  sufficient,  let  us  consider  the  closed  set  Gk 
at  which  the  saltus  w  (x},  oi'  f(x),  is  =  k,  where  A;  is  a  positive  number.  If  an 
interval  8  contain  a  point  of  Gk  within  it,  the  fluctuation  of  f(x)  in  8  is  £  k. 
If  a  point  of  Gk  is  the  common  end-point  of  two  intervals,  of  equal  length,  the 

*  Lebesgue,  Annali  di  Mat.,  (3),  vol.  vn,  p.  254. 
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fluctuation  of  /(#)  in  one  at  least  of  these  intervals  is  ^  \  k;  hence  the  part 
which  these  two  intervals  contribute  to  the  sum  S  BF(B)  is  ^  \  k8.  If  we  have 
a  net  with  equal  meshes  fitted  on  to  (a,  b),  the  contribution  of  all  those 
meshes  which  contain,  within  them  or  at  an  end-point,  a  point  of  Gk,  is  not 
less  than  the  product  of  ^k  into  the  sum  of  the  breadths  of  these  meshes. 
Unless  the  content  of  Gk  is  zero,  the  sum  of  the  breadths  of  these  meshes  is 
greater  than  some  fixed  positive  number,  for  all  the  nets  of  a  symmetrical 
system.  It  is  therefore  necessary  for  the  existence  of  the  U-integral  that  the 
content  of  Gk  should  be  zero;  and  this  must  be  the  case  for  every  positive 
value  of  k.  The  set  of  points  of  discontinuity  of  the  function  is  the  outer 
limiting  set  of  {Gtn},  where  {kn}  is  a  sequence  of  diminishing  values  of  k  that 
converges  to  zero.  It  follows  that  the  set  of  points  of  discontinuity  of  the 
function  must  have  measure  zero. 

To  shew  that  the  condition  is  sufficient,  we  observe  that,  if  the  content  of  Gk 
is  zero,  all  'the  points  of  Gk  are  contained  within  intervals  of  a  finite  set  the 
sum  of  whose  lengths  is  <  e.  The  intervals  complementary  to  this  finite  set 
have  a  total  measure  >  b  —  a  —  e,  and  at  every  point  in  each  of  them  ta  (x)  <  k. 
In  accordance  with  the  theorem  of  §  234,  each  of  these  complementary  intervals 
can  be  divided  into  a  number  of  parts,  in  each  of  which  the  fluctuation  is  <  2k. 
Let  this  be  done  for  each  of  the  complementary  intervals.  We  have  now  a 
net  fitted  on  to  (a,  b),  such  that  the  sum  of  the  breadths  of  those  meshes  in 
which  the  fluctuation  is  =  2k  is  <  e. 

For  this  net  2  £>F(&)  <  e  ( U  —  L)  +  2k  (b  —  a  —  e) ;  and  since  k  and  e  are  both 
arbitrarily  small,  a  net  can  be  determined  for  which  ^  BF(8)  has  an  arbitrarily 
small  value.  The  condition  of  integrability  is  therefore  satisfied  if,  for  every 
value  of  k,  Gk  has  content  zero,  that  is,  if  the  set  of  points  of  discontinuity  of 
the  function  has  measure  zero. 

334.    The  following  theorem*  will  now  be  established: 

If  f(x)  be  bounded  in  (a,  b),  and  &>  (x}  be  the  saltus  off(x),  at  x,  then 

7b  rb  "rb 

f  (x)  dx  —       f  (x)  dx=  I    u>  (x)  dx. 

.a  -a  J  a 

If  |  be  any  point  in  (a,  b),  there  exists  an  interval  (f  —  77,  £  +  r/')  containing 
£  as  an  interior  point,  such  that,  in  any  interval  interior  to  that  interval,  the 
fluctuation  of  f(x)  is  less  than  <w  (£)  +  e.  If  d  be  an  arbitrarily  chosen  number, 
and  X  be  the  smallest  of  the  three  numbers  ^  d,  ^  rj,  \  rj ',  the  interval  (£  —  X, 
£  +  X),  or  the  part  of  it  in  (a,  b),  is  of  length  not  greater  than  d,  and  the 
fluctuation  in  it  is  <  «(£)  +  e.  Taking  fixed  values  of  the  positive  numbers 
d,  e,  a  single  such  interval  corresponds  to  each  point  £,  of  (a,  b).  Employing 

*  This  theorem  was  given  by  de  la  Vallee  Poucsin,  see  his  Cours  <V  Analyse,  3rd.  ed.,  vol.  i, 
p.  254.  A  proof  of  the  theorem,  other  than  that  given  above,  was  provided  by  Pollard,  see  Messenger 
of  Math.,  vol..  xuv,  p.  141,  where  a  criticism  is  given  of  de  la  Vallee  Poussin's  proof. 
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the  Heine-Borel  theorem,  it  follows  that  there  exists  a  finite  set  of  overlapping 
intervals,  covering  (a,  6),  such  that,  if  8  be  any  one  of  them,  the  fluctuation  of 
f(x)  in  8  is  <  &)  (£)  +  e,  where  £  is  a  definite  point  interior  to  8. 

Consider  two  overlapping  intervals  (a,  /3),  (a',  /3'),  where  {3'  >  @,  of  this 
finite  set.  Let  £,  £'  be  the  two  definite  points  interior  to  these  intervals.  If 
neither  £  nor  f  '  is  in  the  part  («',  ft),  common  to  the  intervals,  we  can  replace 
the  two  intervals  by  the  non-overlapping  intervals  (a,  /3),  (/3,  ft).  In  the  first 
of  these  the  fluctuation  of  /(£)  is  <  to  (£)  +  e,  and  in  the  second  it  is 

<  o>  (£')  +  e.    If  (a',  /3)  contains  one  of  the  two  points  £,  £',  say  £',  we  replace 
the  intervals  by  (a,  a')  and  («',  /3'),  which  have  the  same  properties  as  in  the 
last  case.    If  («',  /3)  contains  both  the  points  £,  £',  let  to  (£)  >  to  (£')  ;  we  then 
take  (a,  £),  (£>  $')  in  place  of  (a,  /3)  and  (a',  ft}.     In  (a,  £)  the  fluctuation  is 

<  to  (£)  +  e,  and  in  (f,  /?')  it  is  also  <  <w  (£)  -f  e.   We  can  proceed  in  this  manner 
with  any  pair  of  intervals  that  overlap  one  another,  and  we  finally  obtain  a 
net,  fitted  on  to  (a,  b),  such  that,  in  any  mesh  8,  the  fluctuation  of  the  function 
is  <  a)  (£{)  +  e;  where  £5  is  a  definite  point  within,  or  at  an  end  of,  the  mesh  8; 
moreover  the  breadths  of  the  meshes  do  not  exceed  d.     If  co  (8)  be  the  upper 
boundary  of  to  (x)  for  all  interior  points  of  the  mesh  S,  there  is  such  a  point 
at   which  o>  (#)  >  w  (8)  —  p;    where   p  is   arbitrarily  chosen.      Therefore  the 
fluctuation  in  the  mesh  is  >  w  (8)  —  p,  and  hence  is  ~  w  (8),  since  p  is  arbitrary. 
It  follows  that,  for  the  net,  5  8  F(8)  ^  S  8  w  (8).   Taking  a  sequence  of  values 

76 

of  d  that  converges  to  zero,  S  8  &>  (8)  converges  to       GO  (x)  doc]  hence 

.'  a 
~rd  fb  ~b 

f(x)  dx  —  I    f(x)  dx^       to  (x)  dx. 

J  a  J_a  .  a 

Again,  the  fluctuation  in  8  is  <  to  (|)  +  e;  and  hence,  for  the  net, 


7b 

It  follows  that  Km  2  8  F(8)  cannot  exceed       to  (x)  dx  +  e  (b  —  a);  and  since  e 

c/~0  J  a 

is  arbitrary,  we  have 

7b  rb  ~*i-b 

f(x)  dx  —       f(x)  dx  ^       w  (x)  dx. 

J  a  J  a  J  a 

It  has  thus  been  proved  that 

7b  Cb  lb 

f(x}  dx  —  I    f(x}  dx  =  I    co  (x)  dx. 

J  a  J  a  .a 

The  theorem  of  §  333  can  be  deduced  from  this  result.    For  the  condition 
7-6  rb 

to  (x)  dx  =  0,  is  equivalent  to  the  condition       to  (x)  dx  =  0;  and  unless  the 

J  a  J  a 

set  of  discontinuities  of  the  function  has  measure  zero,  for  some  value  of 
k  (  >  0)  the  set  Gk  must  have  positive  content,  in  which  case  it  is  impossible 
that  the  integral  can  have  the  value  zero. 
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PARTICULAR  CASES  OF  FUNCTIONS  THAT  ARE  INTEGRABLE  (R). 

335.  The  following  classes  of  bounded  functions  satisfy  the  condition  of 
integrability  (R)  which  has  been  expressed  in  various  forms  above. 

(1)  All  functions  which  are  continuous  in  the  intervals  for  which  they  are 
defined. 

(2)  All  functions  with  only  a  finite  number  of  discontinuities,  or  with  any 
enumerable  set  of  discontinuities. 

(3)  Monotone  functions,  and  all  functions  with  bounded  variation. 

For,  as  has  been  shewn  in  §  243,  the  points  of  discontinuity  of  a  function 
with  bounded  variation  form  an  enumerable  set. 

(4)  Generally,  every  point-wise  discontinuous  function  which  is  such  that 
the  closed  set  of  points  for  which  the  saltus  is  =  k  has  content  zero,  whatever 
positive  value  k  may  have. 

Dini*  has  given  the  theorem  that  a  function  is  integrable  (R),  if  at  all 
points  where  the  discontinuity  is  of  the  second  kind,  it  is  so  for  all  such  points 
only  on  one  and  the  same  side  of  the  point;  and  at  these  points  the  function  may 
be  continuous  on  the  other  side,  or  may  have  ordinary  discontinuities  on  that 
side.  In  particular,  any  function  which  has  only  ordinary  discontinuities  is 
integrable  (R). 

To  prove  this  we  observe  that  it  has  been  proved  in  §  239  that,  for  such 
a  function,  the  set  of  points  for  which  the  saltus  is  =  k  has  content  zero, 
whatever  positive  value  k  may  have.  Therefore  the  condition  of  integrability 
is  satisfied. 

Riemann's  definition  of  an  integral,  and  the  condition  for  the  existence  of 
the  integral,  are  applicable,  without  essential  change,  to  the  case  of  a  function 
which,  for  particular  values  of  the  variable,  has  indeterminate  functional  values 
lying,  in  the  case  of  each  such  point,  between  finite  limits  of  indeterminancy. 
At  each  point  of  indeterminancy  of  the  function,  it  is  immaterial  whether  the 
function  be  capable  of  having  all,  or  only  some,  values  between  the  limits  of 
indeterminancy ;  thus  there  is  no  loss  of  generality,  if  the  function  be  regarded 
as  having  two  values  only  at  each  such  point,  viz.  the  two  limits  of  indeterminancy 
at  the  point.  In  estimating  the  fluctuation  of  the  function  in  a  prescribed 
interval,  the  upper  boundary  is  found  by  taking  the  upper  limits  of  indeter- 
minancy of  the  function  at  the  special  points  as  functional  values  at  those  points, 
whilst  the  lower  boundary  is  found  by  taking  the  lower  limits  of  indeterminancy 
at  the  special  points  as  the  functional  values  at  those  points.  As  in  the  case  of 
a  function  which  is  everywhere  single-valued,  the  saltus  at  any  point  is  defined 
as  the  limit  of  the  fluctuation  in  a  neighbourhood  of  the  point,  when  that 

*  See  Grundlagen,  p.  335. 
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neighbourhood  is  diminished  indefinitely.  The  conditions  of  integrability  are 
exactly  the  same  as  for  a  function  which  is  everywhere  single-  valued,  viz.  that 
the  function  be  bounded  in  its  domain,  arid  that  the  set  of  points  of  discontinuity 
of  the  function  have  measure  zero;  or,  in  other  words,  that  it  be  bounded,  and 
continuous  almost  everywhere,  in  the  domain. 

EXAMPLES. 

ty,\      (2x}  (nx} 

1.  Riemann's  function  f(x)  =  W  +  ^^2"  +  •  •  •  +  ^T  +  •••,  where  (x)  denotes  the  positive 

1,  22  ?l 

or  negative  excess  of  x  over  the  nearest  integer,  and  (#)  =  0  when  x  is  half-way  between 
two  integers,  has  been  shewn  in  Example  2,  §  240,  to  be  point-wise  discontinuous,  with  all 
its  discontinuities  ordinary  ones,  and  everywhere  dense  in  the  interval  (0,  1).  Since  all  the 
discontinuities  are  ordinary  ones,  and  the  function  is  limited,  f(x)  is  integrable  in  (0,  1). 

2.  Let  f(x)  be  denned  for  the  interval  (0,  1)  as  follows  :—  If  x  be  irrational,  let  /(#)=--  0  ; 
if  x=p/g,  where  p/q  is  in  its  lowest  terms,  let  f(x)  =  I/q;  also  let  /(0)=/(1)  =  0.    This 

function  is  an  integrable  point-wise  discontinuous   null-function;   thus  /    f(x)dx  =  Q. 

Jo 

There  is  only  a  finite  number  of  points  at  which  the  functional  value  exceeds  an  assigned 
positive  number. 

3.  Let  /(#)«=0,  for  all  rational  values  of  x;  and  /(.r)  =  l;  for  all  irrational  values  of  x. 
This  function  is  not  integrable  (R)  in  any  interval,  for  it  is  totally  discontinuous. 

4.  Let  f(x)  be  defined*  for  the  interval  (0,  1)  as  follows  :  —  For  -  <  x  £  1,  let  /(#)  =  !  ; 

22 

for  £2  •  <  x  =  2  '  let  f  W  =  2  5    f°r  2~3  <  X  =  P  '  let  ?  (^  =  2*  :  and  8enerally>  for 
~<xn,  let  /(*)  =  !;  and/(0)-0. 


[x  x  1 

This  function  is  integrable,  and   I    f(x)dx  =  -  -—  j  -  ^--^o—  r«>  where  x  is  between 

./  0  ^  O  .  Z   "* 


—  and 


GEOMETRICAL  INTERPRETATION  OF  RIEMANN  INTEGRATION. 

336.  Let  f(x)  be  a  bounded  function,  defined  for  the  interval  (a,  b), 
of  which  the  values  are  all  ^  0.  Associated  with  the  function  there  exists  a 
plane  set  of  points  (x,  y}  consisting  of  all  the  points  of  which  the  coordinates 
satisfy  the  conditions  a^x^b,  O^y^f(x).  In  accordance  with  Jordan's 
theory  of  measure  of  sets  of  points  (see  §  142),  this  set  has  an  exterior  extent, 
and  an  interior  extent ;  and  the  set  of  points  is  measurable  (J)  when  the  two 
have  the  same  value,  in  which  case  their  common  value  is  the  extent,  or 
measure  (J),  of  the  set.  The  extent  of  a  two-dimensional  set  of  points  may 
be  regarded  as  a  generalization  of  the  conception  of  area ;  thus  in  general,  the 
exterior  extent  and  the  interior  extent  may  be  spoken  of  as  the  exterior  area 
and  the  interior  area  of  the  space  bounded  by  the  axis  of  x,  the  two  straight 

*  Dini,  Grundlutjen,  p.  344. 
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lines  x=a,  #  =  &,  and  the  "curve"  defined  by  y  =  /(#).  This  set  of  points  G 
has  an  area,  in  the  ordinary  sense,  when  the  exterior  area,  and  the  interior 
area,  are  equal;  in  which  case  the  function  y  —  f(x)  is  said  to  be  quadrable 
in  the  interval  (a,  6). 

If  a  fundamental  rectangle  be  taken,  which  contains  the  plane  set  in  its 
interior,  we  may  fit  on  to  this  rectangle  a  system  [Dn\  of  nets,  with  closed 
meshes.  Consider  those  meshes  of  Dn  which  are  such  that  every  point  of  each 
mesh  is  an  interior  point  of  G.  If  the  segment  (a,  6),  on  the  #-axis,  be  divided 
by  means  of  the  boundaries  of  the  meshes  of  Dn  into  intervals  Slt  82,  ...  8^,  we 
see  that  the  sum  of  the  measures  of  those  meshes  of  Dn  which  consist  entirely 

n 

of  interior  points  of  G  is  28Z  (S)  +  T/W,  where  rfn  is  a  number  which  converges 

to  zero,  as  n  is  indefinitely  increased.  Similarly,  the  sum  of  those  meshes  of 
Dn  each  of  which  contains  at  least  one  point  of  G,  or  of  the  boundary  of  G 


is  2&{7(3)  +  pn,  where  pn  ~  0,  as  n  ~  oo  . 


i 


It  thus  appears  that       f  (x)  dx  is  the  exterior  extent  of  G,  and  that 

.'  a 

I   f(x)dx  is  the  interior  extent  of  G. 

J_a 

If       f(x)  dx  exists  as  a  definite  number,  the  plane  set  G  is  measurable 

J  a 

(J),  and  the  value  of  the  integral  measures  the  area  bounded  by  x  =  a,  x  =  b, 

It  has  been  shewn,  in  §  142,  that  the  condition  for  the  measurability  (J"), 
of  the  set  G,  is  that  the  frontier  of  G  should  have  the  plane  measure  zero. 
It  is  clear  that,  in  any  case,  the  plane  measure  of  the  three  rectilinear  portions 
of  the  boundary  of  G  is  zero ;  thus  the  condition  for  the  existence  of  the  integral 
is  that  the  set  of  boundary  points  which  consists  of  points  on  the  "curve" 
y  —  f(x\  or  °f  limiting  points  of  points  on  the  curve,  shall  have  plane  measure 
zero.  This  condition  is  equivalent  to  the  condition  that  the  linear  measure  of 
the  set  of  points  of  discontinuity  of  the  function  f(x)  in  the  linear  interval 
(a,  b)  is  zero.' 

In  case  the  bounded  function  f(x)  is  not  everywhere  S  0  in  (a,  b),  we  may 
take  f(tt}=fi(x)— /^(x)',  where  fl(x}=f(x\  for  all  values  of  x  for  which 
f(x)  ^  0,  and  /j  (x)  =  0,  for  those  values  of  x  for  which  f(x)  is  negative.  In 
case  the  two  sets  of  points  (x,  y)  for  which  a  ^  x  £b,  0  £  y^f^x),  and  a^x^b, 

rb 
0  ^  2/^/2  (#)  are  both  measurable  («/),  the  integral  /  f(x)dx  is  the  excess  of 

J  a 

the  measure  of  the  first  set  over  that  of  the  second  set ;  and  this  may  be  in- 
terpreted as  the  excess  of  that  part  of  the  area  defined  by  x  =  a,  x  =  b,  y  =f(x) 
which  is  above  the  #-axis,  over  that  part  which  is  below  it. 
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If  the  two  plane  sets  of  points  be  not  measurable  (/),  the  exterior  and 

76  /* 

interior  extents  of  the  first  set  are       /i  (us)  dx,   I    /  (x)  dx;  and  those  of  the 

.'a  J  a 

~7b  rb 

second  set  are        /„  (x)  dx,        f2  (x)  dx  respectively.     The   upper   integral 

-  Ja  ^ 

f 

f  (x)  dx  is  then  the  excess  of  the  exterior  extent  of  the  set  a  ^  x  ^  b, 

J  a 

0  g  y  ^  /!  (x),  over  the  interior  extent  of  the  set  a  ^  x  £  b,  0  ^  y  £  f2  (x) ; 

f6 
whilst  the  lower  integral       f(x)  dx  is  the  excess  of  the  interior  extent  of  the 

.'  a 

first  set  over  the  exterior  extent  of  the  second  set. 

The  condition  that  /  (x)  may  be  integrable  (R)  is  that  the  frontier  which 
consists  of  the  set  of  points  a  £  x  ^  b,  y  =  f(x),  when  closed  by  adding  the 
limiting  points,  has  its  plane  measure  zero. 

If  a  linear  set  of  points  H  be  denned  on  the  #-axis,  and  lie  in  the  interval 
(a,  b),  a  function  <£  (x)  may  be  defined  by  the  rule  that  (f>  (x)  =  1,  if  x  be  a  point 
of  H ,  and  $  (x)  =  0,  if  x  be  a  point  of  C  (H).  The  set  H  has  an  exterior  linear 

7b  rb 

extent,  and  an  interior  linear  extent,  which  are  given  by  I   <j>  (x)  dx,  I   </>  (x)  dx, 

J  a  J  a 

respectively,  as  may  be  seen  by  referring  to  the  definitions.  For  it  is  easily  seen 
that  the  exterior  and  interior  linear  extents  of  II  are  numerically  equal  respec- 
tively to  the  exterior  and  interior  plane  extents  of  the  plane  set  a  £  x  £  b, 
0  ^  y  £  <f>  (x).  When  H  is  measurable  (J)  the  function  <£  (x)  is  integrable  (R) 

rb 
in  (a,  b),  and  I  $>(x)dx  is  the  measure  (J)  of  H.    This  measure  may  be  re- 

J  a 

garded  as  a  generalization  of  the  notion  of  the  length  of  a  linear  interval. 
The  condition  that  the  linear  set  H  is  measurable  (/)  is  that  its  frontier, 
which  consists  of  those  points  of  H  that  are  limiting  points  of  C(H),  and  of 
those  points  of  C  (H)  that  are  limiting  points  of  H,  should  have  the  measure 
zero. 

PROPERTIES  OF  THE  DEFINITE  RIEMANN  INTEGRAL. 

rb 

337.    We  proceed  to  consider  the  properties  of  the  integral      /  (x)  dx,  of 

.'  a 

a  bounded  function  f(x),  defined  for  the  interval  (a,  b),  which  is  such  that 
the  condition  for  the  existence  of  the  jR-integral  is  satisfied. 

ra  rb 

(1)  The  integral      f(x)dx  is  defined  as  the  value  of—  I  f(x}dx. 

J  b  J  i 

(2)  If  f(x}  be  integrable  (R}  in  (a,  b),  so  also  is  \f(x)  ,  and 

rb 


f(x)dx 


\f(x)  \dx. 
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For  the  fluctuation  of  |/(#)  |  in  any  interval  8  cannot  exceed  that  of  f(x) 
in  the  same  interval ;  hence,  if  BF(8),  for  a  system  of  nets,  have  the  limit  zero, 
when  F(8)  is  the  fluctuation  of  f(x)  in  8,  it  has  also  the  limit  zero  when  F(8) 
denotes  the  fluctuation  of  )/(#)  ;  and  thus  the  latter  function  is  integrable 
(R).  Again  U,  the  upper  boundary  of  f(x)  in  8,  cannot  numerically  exceed 
V,  the  upper  boundary  of  \f(x)  \  in  the  same  interval ;  thus  1 2U&  \  ^  2,U'S, 
and  hence  the  absolute  value  of  the  limit  of  2J78  is  £  that  of  "SU'S. 

(3)  If  f(x)  be  integrable  (R)  in  (a,  b),  f(x  —  h)  is  integrable  (R)  in 

rb  rb+h 

(a  +  h,  b  +  h),  and  I    f  (as)  dx  =          f(x  —  h)  dx. 

J  a  J a+h 

This  follows  at  once  from  the  definitions  of  the  integrals  as  the  limits  of 
sums. 

(4)  //  the  values  of  the  integrable  function  f  (x)  be  arbitrarily  altered  at 
each  point  of  a  measurable  set  of  points  G,  the  new  function  (f>  (x}  so  obtained 
is  integrable  (R), provided  it  be  bounded,  and  also  the  measure  of  the  derivative  G' 
of  the  set  be  zero. 

For  the  only  points  of  discontinuity  of  <£  (x)  which  are  not  points  of  dis- 
continuity of  f(ac)  are  points  of  G,  or  of  G',  and  therefore  form  a  set  of  measure 
zero ;  hence  all  the  discontinuities  of  (/>  (x)  form  a  set  of  points  of  measure  zero, 
and  <f)  (x)  is  therefore  integrable  (R),  provided  it  be  bounded.  In  particular, 
the  theorem  holds  for  any  reducible  set  G. 

•Also,  if  <f)  (x)=f(x),  at  all  points  belonging  to  a  set  which  is  everywhere 
dense  in  (a,  b),  then,  provided  (f)  (x)  be  integrable  (R),  its  integral  is  identical  with 
that  of  f(x). 

For,  in  the  finite  sum  ^.8M(8),  we  may  take  the  value  of  M  (8)  in  any 
interval  8  to  be  one  of  the  values  which  the  two  functions  f(x),  <£  (x)  have  in 
common  in  that  interval ;  hence  the  sums  may  all  be  chosen  so  as  to  be  the 
same  for  the  two  functions.  Thus,  if  the  functions  be  both  integrable  (R), 
their  integrals  are  identical. 

(5)  A  function  f(x)  which  is  integrable  (R),  in  (a,  b),  is  also  integrable  (R) 
in  any  interval  (a,  /3)  contained  in  (a,  b). 

For  the  measure  of  the  set  of  points  of  discontinuity  of  f(x)  in  (a,  b)  being 
zero,  the  measure  of  the  set  of  those  points  of  discontinuity  which  are  in  (a,  ft) 
is  also  zero,  and  thus  the  function  is  integrable  (R)  in  (a,  ft). 

If  c  is  any  point  in  (a,  b),  we  have 

f  /  (x)  dx  =  Ff(x)  dx  +  I "  f(x}dx. 

la  J  a  J  c 

For  the  two  integrals  on  the  right-hand  side  both  exist ;  also  a  system  of 
nets  can  be  so  chosen  that  the  point  c  is  always  an  end-point  of  two  of  the 
meshes  of  each  net.  If  this  be  done,  the  sum  28J/(S),  for  (a,  b),  may  be 
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divided  into  two  parts,  one  of  which  contains,  all  the  intervals  on  the  left  of 
the  point  c,  and  the  other  all  those  on  the  right  of  that  point ;  thus 

2&fl/(8)  =  ^8M(S)  +  2281f  (8). 

The  limits  of  the  three  sums  are  the  three  integrals  of  f(x)  in  (a,  b),  (a,  c), 
and  (c,  b)  respectively ;  thus  the  theorem  is  established. 

(6)  //t*/i,/2)/3,  •••  fn  be  a  finite  number  of  bounded  functions,  each  of 
which  is  integrable  (R)  in  (a,  b),  and  if  F(f1}  f.2>  ...  fn)  be  a  continuous  function 
with  respect  to  (/i,  /2,  ...  fn\  then  the  function  F  is  integrable  (R)  in  (a,  b). 

For  the  only  points  of  discontinuity  of  the  function  F  (x)  are  those  of  the 
functions  fi(x),  /2  (#),  ...fn  (%)'•,  hence  the  set  of  points  of  discontinuity  of 
F  (x)  has  measure  zero;  and  thus  F(x)  is  integrable  (R),  since  it  is  also  a 
bounded  function. 

Important  particular  cases  of  the  general  theorem  are  the  following : 

(a)  If  / (x)  =  /j (x)  +  /2  (x)  +  ...  +fn  (x),  where  all  the  functions  fr  (x) 
are  integrable  (R),  then/ (a1)  is  integrable  (R). 

rb  r=n  rb 

It  can  also  be  shewn  that      f(x}dx=  2   I   fr(x)dx.  For,  in  any  interval 

J a  r=lJ a 

8,  the  upper  boundary  of  f(x)  cannot  exceed  the  sum  of  the  upper  boundaries 

~rb  n  ~rb 

of  the  functions  fr(x).    From  this  we  see  that  I  f(x)dx  =  2  I    fr  (#)  dx,  and 

J  a  I  J  a 

rb  n    Cb 

similarly  we  have  I  f(x)dx  =  ^  I   fr(x)dx. 

J_a  1  J_a 

(b)  If  f(x)=jl  (x)  .fz(x)  ...fn(x},  where  all  the  functions  fr  (x)  are 
integrable  (R)  in  (a,  b),  then  f(x)  is  also  integrable  (R)  in  (a,  b). 

(c)  If  f(x),  <f>  (x)  be  integrable  (R)  in  (a,  b),  and  \<j>(x)\  always  exceeds 

f(x) 
some  fixed  positive  number  A,  so  that  ~rj4  is  a  continuous  function  of  (/,  9), 

9  \x) 

f(x] 

then  ^7  \  is  integrable  (R)  in  (a,  b). 
<t>(®) 

(7)  //'  two  functions  f+(%),f~(%)  be  defined  as  follows: — Let  f+(x)=f(x) 
for  all  values  of  x  such  that  f(x)>0,  and  let  f+(x)  =  0,  when  f(x)^Q;  let 
f  ~(,-r)=  —  /"(,')  for  all  values  of  x  such  that  f(x}<  0,  and  f~(x)  =  0,  when 
f(x}  ^0;  then  if  f(x)  be  integrable  (R)  in  (a,  b),  the  functions  f+(x),  f~(x)  are 

rb  rb  rb 

integrable  (R)  in  (a,b),  and      f(x)dx=\  f+(x)dx-     f~(x)dx. 

J  a  -a  J  a 

For  the  fluctuation  of  f+(x)  in  any  interval  8  cannot  exceed  that  of  f(x) 
in  the  same  interval;  hence,  since  28^7(§),  for  f(x),  has  the  limit  zero,  the 
corresponding  sum  for/+(^)  has  the  limit  zero;  and  thus/+(V)is  integrable. 
In  a  similar  manner  it  can  be  shewn  that  f~{x)  is  integrable. 

*  Do  Bois  Raymond,  Hath.  Amialen,  vol.  xx,  p.  123.  See  also  W.  H.  Young,  Quarterly  Journal 
of  Math.,  vol.  xxxv,  p.  190. 
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Since  f(x)  =f+(x)  —/"(#),  we  see  from  (6)  (a)  that 

f  f(x)  dx  =  f  /+(*)  dx  -  {"/-(x)  dx. 

J  a  .'a  -'a 

It  should  be  observed  that  it  is  not  in  general  true  that,  if  f(x)  be  in- 
tegrable  in  (a,  b),  and  be  expressed  as  the  sum  /,  (x)  +  /2(X),  of  two  bounded 
functions,  then  /,  (x),  /2  (x)  are  also  integrable  in  (a,  b).  For  it  is  clear  that, 
f(x)  being  given,  we  may  take  for  fl  (x)  any  arbitrarily  defined  non-integrable 
function,  then  /2  (x)  is  also  determinate  and  non-integrable. 

(8)    If  f(x),  <j>  (x)  be  both  integrable  (R),  and  be  such  that  \  f(x)  \£\<f>((K)\ 

rb  rb 

for  every  value  of  x,  then    I  f(x)  dx  ^      |  <f>  (x)  \  dx. 

J  a  J  a 

In  particular,  if  <£  (x)  is  constant,  and  equal  to  P,  the  upper  boundary  of 


I  f& 
|  /  (x)    in  (a,  b),  then        f(x)dx\^P(b-  a). 

J  a 


rb 

For       {|  $(x)  \  —  \f(x)  \]  dx  is  =0,  since  in  every  interval  8  no  value  of 

.  a 

j  <f)(x)  \  —  \f(x)  |  is  negative,  and  thus  the  sums  of  which  the  integral  is  the 

rb  rb 

limit  are  all  ^  0.     Also  from  (2),  we  have        f(x)\dx£\  \f(x)  I  dx,  and  this 

J  a  .  a 

r6 

is  ^       <f>  (x)  |  dx.    The  particular  case  follows  by  assuming  <£  (x)  =  P. 

J  a 

If  U,  L  denote  the  upper  and  lower  boundaries  of  f(x)  in  (a,  b),  then 
L  (b  -  a)  £  \b  f(x}  dx  ^  U(b  -  a). 


For  ^8U(8),  2&L  (8)  each  lies  between  IT28  and  L"E8,  or  between  U  (b  —  a) 

and  L(b  —  a);  the  same  must  hold  of  the  common  limit,  which  is  the  integral 
i 

f(x)  dx. 
i 

(9)  If  v)i,  %,  •••  ^ny  ---be  an  enumerable  set  of  non-overlapping  intervals 
contained  in  (-a,  b},  in  descending  order  of  length,  then  the  sum  of  the  integrals 
of  f(x)  taken  through  r)l,  t]2,  ...  ijn,  ...  converges  to  a  definite  finite  limit,  as  n  is 
increased  indefinitely ;  f(-r)  being  a  function  which  is  integrable  (R)  in  (a,b). 

Let  us  denote  by  Sn  the  sum  of  the  integrals  of  f(x)  taken  through  the 
intervals  171,  ?;2, ...  rjn.  Since  TJI  +  ij2  +  ...  +r)n  increases  with  n,  and  is  always 
less  than  b  —  a,  it  has  a  definite  limit  as  n  is  increased  indefinitely ;  we  can 
therefore  choose  n  so  great  that  rjn+1  +  rjn+.2  +  . . .  +  rjn+m  <  e,  for  every  value  of 
m,  where  e  is  an  arbitrarily  chosen  positive  number.  With  this  value  of  »,  we 
see  that  Sn+m  —  Sn\<  e.  P,  where  P  is  the  upper  boundary  of  f(x)  i  in  (a,  b). 
If  77  be  an  arbitrarily  chosen  positive  number,  we  can  choose  e  such  that 
e  <  r)/P ;  thus  n  can  be  so  chosen  that  j  Sn+m—  Sn  '  <  fj,  and  hence  Sn  has  a 
definite  limit,  as  n  is  increased  indefinitely. 
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(10)  If  yi  (x),  f2 (x),  ...  fn (x), ...  be  a  sequence  of  functions,  defined  in  the 
interval  (a,  b\  and/(#)  be  such  that,  for  all  values  of  x,  in  (a,  b}, 

l/O)-/nO)   <e, 

where  e  is  an  arbitrarily  chosen  positive  number,  provided  n  is  ^  a  fixed  integer 
we,  dependent  on  e;  the  sequence  {/„(&)}  is  said  to  converge  uniformly  to 
f  (x),  in  the  interval  (a,  6). 

If  a  sequence  of  functions  {fn  (x)},  all  integrable  (R),  in  the  interval  (a,  b), 
converges  uniformly,  in  that  interval,  to  the  bounded  function  f  (x),  then  f  (x)  is 
integrable  (R),  and 

b  rb 

f  (x)  dx  =  lim  I  fn  (x)  dec. 


At  almost  every  point  of  (a,  b),  that  is,  almost  everywhere  in  (a,  b),  (see 
§  333),  all  the  functions  of  the  sequence  [fn(x}\  *we  continuous;  for  the  set 
of  all  those  points  at  which  any  of  them  are  discontinuous  has  the  measure 
zero.  Let  £  be  a  point  at  which  all  the  functions  fn  (x)  are  continuous ;  if 
(£  —  h,  %+h)  is  a  neighbourhood  of  f,  the  fluctuation  of  /Wg  (x)  in  (£  —  h,^  +  h) 
is  <  e,  if  k  be  sufficiently  small;  the  integer  ne  having  been  so  chosen  that 
\f(x)  —  fne(x)\<€,  in  (a,  b).  It  follows  that  the  fluctuation  of  f  (x}  in 
(£—/?,  £+  h)  is  <  3e.  Since  e  is  arbitrary,  £  is  a  point  of  continuity  of  f(x) ; 
therefore  /(#)  is  continuous  almost  everywhere  in  (a,  b),  and  is  therefore 
integrable  (R).  Also 

rb  rb 

'  f(x)  dx  -     /„,(«)  dx    <e(b-a}; 

J  a  J  a 

rb  rb 

and  therefore  I  f  (x)  dx  =  lim  /  fn  (x)  dx. 

J  a  «~cc  J  a 

jR-INTEGRALS   OF   FUNCTIONS   OF   TWO   OR   MORE    VARIABLES. 

338.  The  Riemann  definition  of  the  integral  of  a  bounded  function, 
defined  for  an  interval  (a,  b),  may  be  extended  at  once*  to  the  case  of  a 
bounded  function  f  (xw ,  #<2))  defined  in  a  rectangular  cell  (a(l),  a(2);  b(1\  6('2)), 
or  more  generally  to  the  case  of  a  bounded  function  f(x(l\  x{2\  ...  #(p)),  of 
p  variables,  defined  in  a  p-dimensional  cell  (a(l),  a(2),  ...  a(p] ;  b(l),  6(2),  ...  b(p)). 

In  the  definitions  given  in  §§  330,  331,  we  have  only  to  consider  systems  of 
nets  in  two,  or  in  ^-dimensions,  instead  of  linear  systems. 

Thus  the  upper  and  lower  integrals  f  of  f(x(l\  #/2))  in  the  cell  (a(l),  a(2); 
6(l),  6(2))  may  be  denoted  by 

*  This  extension  was  made  by  H.  J.  S.  Smith,  Proc.  Loud.  Math.  Soc.,  (1),  vol.  vi  (1875), 
p.  152,  and  also  by  Thomae,  Einleitung  in  die  Theorie  der  bextimmten  Integrate,  1875,  p.  33; 
also  Schlomilch's  Zeitschr.,  vol.  xxi,  p.  224. 

t  See  Jordan's  Cours  d" Analyse,  vol.  i,  p.  34.  An  elaborate  treatment  of  double  integration  has 
been  given  by  Stolz,  Grundziige,  vol.  in,  where  the  triangle  or  polygon  is  employed  in  relation  to 
the  measure  of  sets  of  points,  instead  of  the  rectangle.  On  this  matter  see  Schoennies,  Berichi, 
vol.  i,  p.  179. 
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7(4"),  W2)) 


which  represent  the  limits  of  the  sums  of  the  products  of  the  areas  of  the 
meshes  of  a  net  Dn  into  the  upper,  or  the  lower,  boundaries  of  the  function 
in  the  corresponding  meshes.  That  these  limits  exist,  and  are  independent  of 
the  particular  system  of  nets,  is  proved  exactly  as  in  §  332. 

When  the  upper  and  lower  integrals  have  equal  values,  f(x(1\  a/2')  is 
integrable  (R)  in  the  fundamental  cell,  and  their  common  value  is  denoted  by 


!),  a<2))  * 

The  necessary  and  sufficient  condition  that  f '(x{l},  #(2))  is  integrable  (R),  viz. 
that  the  plane  measure  of  its  points  of  discontinuity  should  be  zero,  or  in 
other  words,  that  the  function  should  be  continuous  almost  everywhere  in  the 
cell,  is  established  in  the  same  manner  as  in  §  333,  the  proof  requiring  only  a 
slight  modification. 

The  integral 


),  where  f(x(l},  x{2))  =  0, 

•),  a(2))  "' 

in   the  cell  is   the   measure   (/)   of   the    three-dimensional   set   of    points 
(#(1),  x®,  x{3))  defined  by 

it  being  assumed  that  this  measure  (J)  exists. 

A  function  f(xw,  x®)  which  takes  both  signs  in  the  fundamental  cell 
is  the  difference  of  two  functions  ft  (x(l),  #(2)),  f2(x(l),  #(2)),  both  of  which  are 
=  0.  If  the  frontiers  of  both  the  three-dimensional  sets 


a(1>  ^  «w  ^  6(1>  ;  a*2)  S  ^(2»  ^  6(2)  ;  0  <  #<3>  ^  /, 
a(1>  ^  a?w  ^  6<J>  ;  a<2)  ^  a'2'  ^  6(2)  ;  0  ^  «(3)  ^  fa 

have  their  three-dimensional  measures  zero,  the  function  f(x(l\  x('2))  is  in- 
tegrable (R)  in  the  cell,  and  the  integral  is  the  excess  of  that  of  /i  (#(1),  #(2)) 
over  that  of  /„  (x(l\  at®). 

The  integral 


may  be  considered  in  a  precisely  similar  manner,  and  the  condition  for  its 
existence  is  an  extension  of  that  for  the  cases  p  =  I,  2. 
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339.  Next,  let  G  be  a  bounded  set  of  points  in  plane  space  ;  it  is 
therefore  contained  in  the  interior  of  a  fundamental  rectangular  cell  (a(1),  a(2)  ; 
6(1),  6(a)).  Let  us  suppose  that  a  bounded  function  f(x(l\  #(2))  is  defined  for  all 
the  points  of  G.  Let  /  (x(l},  x(2))  be  defined  in  the  fundamental  cell  by  the 
conditions 

f(x(1\  x®)  =/0(1),  x®\  at  all  points  of  G, 
and  /(«<«,  x®)  =  0,  at  all  points  of  C  (G). 

If  we  consider,  for  the  function  /(a?(1),  x®),  the  sum  28{U($)-L  (8)},  for 
those  meshes  of  the  plane  net  Dn,  fitted  on  to  the  fundamental  cell,  which 
contain  a  point  of  G  and  also  a  point  of  C(G),  we  see  that,  for  all  such  meshes, 
U(8)  —  L(8)>  0,  unless  U(8)  and  L  (8)  are  both  zero,  since  there  is  at  least 
one  point  in  the  mesh  at  which  /(»»»,  a?<2')  =  0.  Thus  28  {17(8}  -  L  (8)}, 
taken  for  all  such  meshes,  is  >  0,  unless  f(x(1),  #(2))  =  0  at  every  point  of  all 
these  meshes. 


Unless  f(xw,  #(2))  =  0,  at  every  point  of  the  frontier  of  G,  with  the  possible 
exception  of  a  part  of  G  of  which  the  measure  (J)  is  zero,  the  limit  of 
28  {U(8)  —  L(8)},  as  n  ~  oo  ,  will  not  be  zero  unless  28,  taken  for  all  meshes 
that  contain  points  on  the  frontier  of  G,  converges  to  zero,  as  n  ~  oo  .  Thus  it 
follows  that  the  upper  and  lower  integrals  of  J  (x(l),  #(2))  in  the  fundamental 
cell  will  have  unequal  values,  and  thus  that  /(#'",  #(2))  is  not  integrable  (R)y 
unless,  either  the  set  G  is  measurable  (J),  or  f(xw,  #(2))  =  0  at  all  the  points 
of  its  frontier  with  the  possible  exception  of  a  part,  of  which  the  measure 
(J)  is  zero. 

We  define  the  .R-integral 


(G) 

of  /(#(1),  #<2))  over  the  set  6r,  to  be  the  value  of 


when  the  latter  integral  exists,  which  we  have  seen  is  only,  in  general,  the 
case  when  the  set  G  is  measurable  (/). 

In  the  definition  of  the  ^-integral  of  a  bounded  function  f(x(l),  x®~), 
defined  for  a  bounded  set  G,  it  will  accordingly  be  assumed  that  G  is 
measurable  (/)  ;  and  then  the  integral  exists  when  the  function  that  is  equal  to 
f(x(l\  #(2))  at  all  points  of  (7,  and  is  elsewhere  zero,  has  an  E-integral  in  a 
cell  that  contains  G. 

An  integral 


or  f      f(x(l},x®)d(a;W,a;®), 

J  (G) 
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is,  in  accordance  with  tradition,  spoken  of  as  a  double  integral,  although  it  is 
defined  as  the  single  limit  of  a  finite  sum,  and  accordingly  the  sign  of  integra- 
tion is  here  employed  only  once.  The  term  "  double,"  in  the  name  double 
integral,  must  be  taken  to  have  reference  to  the  two-dimensional  set  of  points 
for  which  the  function  is  defined.  A  similar  remark  applies  to  the  case  of  a 
£)-fold  integral.  The  above  notation  is  such  that  if 

(a;'1',  tf(2)),  or  (a?(1),  x('\  ...  x(&\ 
be  denoted  by  a  single  letter  x,  a  double,  or  p-fold,  integral  may  be  denoted  by 


rb 

f(x)dx, 

J  a 


or  by  If  (x)  dx, 

J  (G) 

independently  of  the  number  of  dimensions  of  the  space  in  which  the  points 
a,  6,  x  lie.  This  is  in  accordance  with  the  parity  of  the  properties  of  the 
^-integral  in  any  number  of  dimensions  with  those  of  the  -R-integral  in 
linear  space. 

340.  With  slight  adaptation,  the  properties  established  in  §  337  are 
applicable  to  the  case  of  functions  of  two  or  more  variables.  The  notation 
will  be  the  same  as  for  functions  of  one  variable,  provided  the  single  letter  x 
is  employed  as  typical  of  (a;(l\  x(-\  ...  x(p))  in  space  of  ^-dimensions. 

The  definition  in  (1)  of 

fa 

f(x)  dx 

b 
b 


rb 
as  the  value  of  —  I    f(x)  dx 

J  a 


holds  for  a  cell  (a,  b).  The  proof  of  the  property  (2)  is  unaltered  by  employing 
8  for  a  cell,  instead  of  a  linear  interval.  The  property  (3)  is  unaltered 
essentially  ;  it  becomes,  for  the  case  p  =  2, 


f  /V<!>  —    /I  W      T^  —   /3(2h   d  ( I'M      T^\ 
J    \^  I      ,    ^  I      )   U    \'L      ,    ^       ). 

The  property  (4)  holds  with  its  form  unaltered.    The  property  (5)  holds  for  a 
cell  (a,  f3)  contained  in  the  cell  (a,  b). 

If  c  is  a  point  in  the  cell  (a,  b),  the  cell  (a,  b)  is  divided  into  'lp  cells  by 
the  (p  —  l)-dimensional  planes  through  the  point  c,  and 

I*  f(x)dx 

J  a 

is  expressed  as  the  sum  of  the  integrals  over  the  2P  cells  into  which  (a,  b)  is 
so  divided. 

H.  29 
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The  properties  (6),  (7),  (8)  remain  unaltered,  and  (9)  holds  for  an 
enumerable  non-overlapping  set  of  cells  in  the  cell  (a,  6), 

A  more  general  form  of  (5)  which  holds  for  p-ple  integrals  is  the  following  : 

If  G  be  a  bounded  set  that  is  measurable  (J),  and  if  G  be  the  sum  of  two 
sets  G1  and  G2,  both  measurable  (J),  then 

f  f  f 

f(x)  dx=\      f(x)dx+\        f(x)  dx. 

•'  (G)  .'(<?,)  J  (G,) 

To  prove  this  theorem,  let  K  be  the  set  of  points  of  G  at  which  the  saltus 
of  f(x)  is  =  k;  then  K  has  two  components,  Kl  a  part  of  Glt  and  Kz  a  part 
of  (TO.  The  only  points  at  which  the  saltus  of  /(#),  considered  as  defined  in 
GI  only,  and  zero  elsewhere,  is  =  k,  consist  of  those  points  of  A"a  that  are 
interior  to  GI,  together  with  points  forming  a  set  KJ  on  the  frontier  of  G^ 
and  Go.  Since  the  content  of  this  frontier  is  zero,  the  measure  («/)  of  KI'  is 
zero;  and  the  measure  (./)  of  the  set  of  points  of  K^  interior  to  G-^  is  also 
zero,  since  K1  has  the  content  zero.  Since  the  set  of  points  at  which  the 
saltus  of  f(x)  regarded  as  defined  in  G1}  and  elsewhere  zero,  is  zero,  it  follows 
that  f(x)  is  integrable  (R)  in  6^;  and  similarly  it  can  be  shewn  to  be 
integrable  (R)  in  G2.  The  integral 

f      f(x}dx 

•J  (G) 

is,  by  definition,  the  limit  of  the  finite  sum 


taken  for  a  net  fitted  on  to  the  fundamental  cell,  or  interval.  The  meshes  B 
consist  (1)  of  those  which  contain  interior  .points  of  G!  only,  but  no  points  on 
the  frontiers  of  G^  and  6r2,  (2)  of  those  which  contain  interior  points  of  G2  only, 
but  no  points  on  the  frontiers  of  Cr,  and  G2,  arid  (3)  of  those  which  contain 
points  of  Gl  and  G2  which  are  not  interior  points  of  either  set.  but  are  points 
on  the  frontiers  of  Gl  and  G2.  The  above  sum  may  be  divided  therefore  into 
three  portions  containing  those  meshes  B  which  respectively  belong  to  (1),  (2), 
and  (3).  The  limit  of  the  first  of  these  sums  is 


/  (x)  dx, 
-'  (.GI 

r 
that  of  the  second  is 


and  that  of  the  third  is  at  most  numerically  equal  to  U  multiplied  by  the 
content  of  the  points  on  the  frontiers  of  Gl  and  G2',  where  U  denotes  the 
upper  boundary  of  f(x)\  in  G.  Since  the  contents  of  the  frontiers  are  zero, 
the  limit  of  the  third  part  of  the  sum  is  zero.  The  theorem  has  accordingly 
been  established. 
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INTEGRABLE   NULL-FUNCTIONS   AND   EQUIVALENT   INTEGRALS. 

341.  If  f(x)  be  integrable  (R)  in  the  interval,  or  cell,  (a,  b),  and  be  such 
that  in  every  interval,  or  cell,  contained  in  (a,  b),  its  integral  is  zero,  then 
f(x)  is  said  to  be  an  R-inteyrable  null-function. 

The  necessary  and  sufficient  condition  that  a  bounded  function  f(x)  may  be 
an  R-integrable  null-function  is  that  f(x)  =  0,  almost  everywhere  in  its  domain. 

Let  Gk  denote  the  set  which  contains  the  points  x  at  which 

/<«)!**, 

where  k  is  any  positive  number. 

To  prove  the  sufficiency  of  the  condition,  let  Dn  be  a  net  with  closed 
meshes  fitted  on  to  the  interval,  or  cell,  (a,  b),  and  let  28'  denote  the  sum  of 
the  measures  of  those  meshes  of  Dn  which  contain  at  least  one  point  of  Gk. 
If  the  condition  of  the  theorem  is  satisfied,  SS'  has  the  limit  zero,  as  n  ~  GO  . 
The  sum  S8M  (8)  employed  in  §  330  is  in  absolute  value 

<  P2S'  +  (A-  28')  k, 

where  A  is  the  measure  of  the  interval,  or  cell,  (a,  b),  and  P  is  the  upper 
boundary  of  f(x)  \  in  (a,  b).    If  28'  converges  to  zero 


is  less  numerically  than  the  arbitrarily  chosen  positive  number  e,  for  all 
sufficiently  large  values  of  n.  If  k  be  chosen  <  e/A,  \  28.M  (8)  j  <  2e,  for  large 
enough  values  of  n.  It  follows  that  lira  28Jlf(8)=0,  and  therefore  f  (x)  is 

n~oo 

integrable  (R)  in  (a,  b),  and  its  integral  has  the  value  zero.  The  same  argu- 
ment applies  to  any  integral,  or  cell,  (a,  /3)  contained  in  (a,  b). 

To  shew  that  the  condition  is  necessary,  let  it  be  assumed  that  f(x)  has, 
in  every  interval,  or  cell,  contained  in  (a,  6),  an  JK-integral  that  vanishes. 
At  any  point  xl,  at  which  f  (x)  is  continuous,  /(X)  must  be  zero.  For  let 
f(x1)  have,  if  possible,  the  positive  value  c;  then  a  neighbourhood  of  ^  can 
be  determined  such  that,  at  every  point  of  it,  f(x)  lies  between  c  +  e  and  c  —  e, 
where  e  is  an  assigned  positive  number  <  c.  The  integral  of  f(x}  over  this 
neighbourhood,  which  we  take  to  have  the  measure  \  is  =  X  (c  —  e)  >  0, 
contrary  to  the  hypothesis.  In  a  similar  manner  it  can  be  shewn  that/(^) 
cannot  be  negative;  therefore  y  (#,)  =  ().  Since  f(x]  is  continuous  almost 
everywhere  in  its  domain,  it  follows  that  f(x}  has  the  value  zero,  almost 
everywhere,  that  is,  with  the  possible  exception  of  points  belonging  to  a  set 
of  measure  zero. 

342.  Two  functions  f(x),  <j)  (x)-  both  integrable  (R),  which  have  their 
integrals  equal,  when  taken  over  any  interval,  or  cell,  contained  in  the  funda- 
mental interval,  or  cell,  (a,  b),  must  differ  from  one  another  by  an  ^-integrable 
null-function. 

29—2 
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If  f(x)  be  integrable  (R),  and  consequently  point- wise  discontinuous,  and 
ty(x)  be  that  function  denned,  as  in  §  241,  by  extension  of  the  function 
which  is  defined  only  at  the  points  of  continuity  of  f(x),  and  has  at  those 
points  the  same  functional  values  as  /(#)  itself,  then  i/r  (x)  is  integrable  (R), 
although  it  is  in  general  multiple-valued  at  the  points  of  discontinuity  of 
f(x).  It  has  been  explained  in  §  235  that  Riemann's  definition  is  applicable 
to  such  a  function  as  i/r  (#).  That  ty(x)  is  integrable  (.R)  follows  from  the  fact 
that  its  points  of  discontinuity  form  a  set  of  which  the  measure  is  zero.  The 
function  f(x)  —  ty  (x)  is  zero  at  the  points  of  continuity  of  /(a?),  and  is  dis- 
continuous only  at  the  points  of  discontinuity  of  f  (x),  which  form  a  set  of 
measure  zero.  Consequently  f(x\  —  -^r(x}  is  an  jtf-integrable  null-function, 
and  the  two  functions  have  equal  integrals  in  any  interval,  or  cell,  for  which 
f  (x)  is  defined.  It  has  thus  been  shewn  that: 

A  function  f(x)  that  is  integrable  (R)  in  an  interval,  or  cell,  is  the  sum  of 
an  R-integrable  null-function  and  of  the  function  ty  (x)  obtained  by  extension 
of  the  function  defined  by  the  values  of  f(x}  at  its  points  of  continuity. 


THE    FUNDAMENTAL   THEOREM    OF   THE    INTEGRAL    CALCULUS. 

343.  The  fundamental  theorem  of  the  Integral  Calculus  asserts  that  the 
operations  of  differentiation  and  of  integration  are  in  general  inverse  operations. 
Before  we  proceed  to  consider  the  conditions  under  which  this  is  the  case,  the 
following  theorem  will  be  established  : 

Iff(x)  be  a  bounded  function  which  is  integrable(R)  in  the  interval  (a,  b),  then 

rx     • 

I  f(x}  dx  is  a  continuous  function  of  x,  for  the  whole  interval  (a,  b),  and  it  is 

.'  a 

a  function  of  bounded  variation  in  (a,  b).  It  is  also  absolutely  continuous 
in  (a,  b). 

'  X 

It  has  already  been  shewn  that       f  (x}  dx  exists,  for  any  point  x  of  the 

.  a 

interval  (a,  b};  denoting  its  value  by  F(x),  we  have 

l'X±h 

F(x  ±h)-  F(x)  =  1      f(x)dx; 

J  X 

hence  by  (8),  of  §  337,  F(x  ±  h)  —  F(x)  \  ^  Ph,  where  P  is  the  upper  boundary 
of  \f(x)\  in  (a,  b}.  If  e  be  any  arbitrarily  chosen  positive  number,  and  we 
take  A!  <  e/P,  then  for  all  values  of  A  which  are  ^  A1?  we  have 


but  this  is  the  condition  of  continuity  of  F  (x}  at  the  point  x.    In  case  x  be  one 
of  the  end-points  of  (a,  b),  h  must  be  restricted  to  have  one  sign  only. 

To  prove  that  F  (x)  has  bounded  variation  in  (a,  b),  let  (a,  b)  be  divided 
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into  n  sub-intervals  by  the  points  a,  #,,  a-z,  ...  av_j,  b.  The  sum  of  the  absolute 
differences  of  the  values  of  F  (x)  at  the  ends  of  these  sub-intervals  is 

I  fi  (Xi  Cb 

II  f(x)dx    +        f(x)dx +  ...+  f(x)dx\ 

Ja  J  e}  J  a?M_, 

rb 
and  this  is,  in  accordance  with  the  theorem  (8)  of  §  337,  ^.  I    |  f(x)   dx ;  and 

•  a 

therefore  the  sum  is  less  than  a  fixed  positive  number.  Since  the  total  variation 
of  F(x)  in  (a,  b)  is  bounded,  it  follows  from  the  theorem  of  §  246  that  the  total 
fluctuation  in  the  interval  is  also  bounded. 

If  (ttj,  6j),  (a2}  62), . . .  (an,  bn), ...  be  a  finite,  or  infinite,  set  of  non-overlapping 
intervals  contained  in  (a,  6),  of  which  the  total  measure  is  <  rj,  we  have  for 
the  finite,  or  infinite  sum 

2  j  F(br)  -  F(ar)   £  UZ  (br -  a,)  <i,U, 
r-l 

where  U  is  the  upper  boundary  of  \f(x)  in  (a,  b).  The  number  rj  being 
taken  to  be  e/(7,  where  e  is  an  arbitrarily  chosen  positive  number,  we  see 
that  the  condition,  given  in  §  218,  for  the  absolute  continuity  of  F(x)  is 
satisfied. 


When  f(x)  is  integrable  in  (a,  6),  the  function  I  f(x)  dx,  which  has  been 

' a 
shewn  to  be  absolutely  continuous,  and  of  bounded  variation  in  (a,  b),  is  said 

to  be  the  indefinite  integral  corresponding  to  f  (x). 

Iff(ai)  be  any  function  defined  in  (a,  b),  a  function  cf>(x)  which,  at  every 
point  x  of  the  interval,  possesses  a  differential  coefficient  equal  to  f(x),  is  said 
to  be  a  primitive  of  f  (x). 

The  definition  is,  however,  frequently  extended  to  cases  in  which  <j>'  (x)  either 
does  not  exist,  or  is  not  equal  to  f(x),  at  points  belonging  to  an  exceptional  set, 
of  measure  zero;  the  condition  <j>(x)=f(x)  being  satisfied  at  all  points  not 
belonging  to  the  exceptional  set. 

l'X 

Taking  the  function  F(x)=  I    f  (x)  dx,  as  the  indefinite  integral  corre- 

J  a 

spending  tof(x),  the  following  properties  will  be  established: 

(A)  Under  certain  restrictions  F  (x)   possesses  a  differential  coefficient 
which  is  equal  to  f(x),  and  thus  F(x)  is  a  primitive  of /(*•). 

(B)  Also  it  will  be  shewn  that,  if  6(x)  be  a  function  which  possesses  a 
differential  coefficient  f  (as),  then  f(x)  may  have  an  indefinite  integral  F(JC), 
in  an  interval  (a,  x),  which  integral  differs  from  <f>(x)  by  a  constant  only;  and 
thus  that  the  indefinite  integral  of  f(x)  is  determinate  except  for  an  additive 
constant. 

It  will  appear  that  there  are  cases  of  exception  to  both  theorems.    When 
F(x)  is  an  indefinite  integral,  it  may  happen  that  at  certain  points  F(x)  does 
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not  possess  a  differential  coefficient ;  and  when  <f>  (x)  is  a  function  which 
possesses  a  differential  coefficient,  it  is  not  always  the  case  that  the  latter  is 
integrable  (R),  and  when  integrated  yields  the  function  <£  (x}  except  as 
regards  an  additive  constant. 

344.    If  f(x)  be  continuous  in  the  interval  (a,  b),  and   F(x)  denote  the 

Cx 

indefinite  integral  I    f(x)dx,  then,  at  every  point  in  (a,  b),  F(x)  possesses  a 

•  '  a 

differential  coefficient  which  is  equal  to  f(x). 

For  since  f(x)  is  continuous,  an  interval  (x  —  h1}  x  +  Aj)  can  be  found  such 
that  \  f  (x  ±  6h-i)  —  f  (x)   <  e,  for  all  values  of  6  numerically  less  than  1.    It 

follows  that  F(x  ±  //)  -  F(x)  =  I   '   /(&•)  dx 

J  X 

lies  between  h  [/(#)  +  e]  and  h  [/(#)  —  e], 

provided  h  <  A: .    Hence,  since 

F(x±h)-F(x) 

h 

lies  between  f(x}  +  e>  f(x)  ~  e> 

for  h  <  /z1;  it  follows  that  f(x)  is  the  differential  coefficient  of  F (x).  At  the 
points  a,  b,  the  function  F(x)  possesses  derivatives  on  the  right  and  on  the  left 
respectively,  and  their  values  are  /(a),  /(&). 

If  <f>  (x)  be  a  function  which  at  every  point  of  (a,  6)  has  a  differential 
coefficient,  which  is  a  continuous  function  f(x),  then 

fx 

(f)(x)  —  (f>  (a)  =      f  (x}  dx. 


For  let  I    f(x)  dx  be  denoted  by  F(x\  then  the  function  0  (x)  —  F(x)  has 

J  a 

at  every  point  a  differential  coefficient  which  is  zero,  and  therefore,  by  the 
theorem  of  §267,  the  function  (f>(x)  —  F(x)  is  constant;  it  is  clear  that  this 
constant  must  be  (f)  (a) ;  and  thus  the  theorem  is  established.  In  this  theorem 
and  elsewhere,  a  derivative  at  a  on  the  right,  and  a  derivative  at  b  on  the  left, 
are  included  in  the  term  differential  coefficient. 

345.    If  a  given  bounded  function  f(x)  that  is  integrable  (R)  be  not  every- 
where continuous  in  the  integral  (a,  b),  the  proof  given  above  is  applicable 


to  prove  that,  at  any  point  of  continuity  of  f(x),  the  function  I    f(x]  dx  has  a 

J  a 

differential  coefficient  equal  to  f(x),  and  thus  has  a  differential  coefficient  almost 
everywhere  in  (a,  b). 

That  F(x)  has  a  differential  coefficient  almost  everywhere  has  been  shewn, 
in  §298,  to  follow  from  the  fact  that  F  (x)  is  of  bounded  variation.  It  has 
here  been  proved  independently,  for  the  special  case  of  an  indefinite  integral. 
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At  a  point  of  ordinary  discontinuity  of  /(a?),  the  same  proof,  when  modified 
by  taking  only  positive  values  of  h,  or  only  negative  values  of  h,  and  using 
f  (x  +  0)  or  f(x  —  0),  in  the  two  cases,  instead  of  f(x),  will  shew  that  F(x)  has 
at  such  a  point  derivatives  on  the  right  and  on  the  left,  and  that  these  are 
f(x  +  0),  f(x  -  0)  respectively.  At  a  point  at  which  f(x}  has  a  discontinuity 
of  the  second  kind,  the  proof  fails  altogether;  at  such  a  point  therefore  F(x) 
need  not  possess  a  differential  coefficient,  nor  definite  derivatives  on  the 
right  and  on  the  left,  but  may  have  all  its  four  derivatives  D+F (x),  D+F(x), 
D-F(x\  D-F(x)  of  different  values. 

If  f(x)  be  integrable  (R),  and  consequently  point-wise  discontinuous,  and 
i/r(#)  be  the  function  formed  by  extension  of  the  functional  values  of  f(x~)  at 
its  points  of  continuity,  as  explained  in  §  342,  we  have  f(x)  =  %  (#)  +  ^  (#), 
where  ^  (x}  is  an  integrable  null-function  ;  and  therefore  f  (x)  and  ^-(x)  have 
the  same  indefinite  integral  F(x).  The  derivatives  of  F(x)  are  independent 
of  the  function  ^  (x),  and  depend  only  upon  ^r  (x),  which  is  determined  by  the 
values  of  f(x)  at  its  points  of  continuity. 

rx-t-h 

Since  F  (x  +  h)  —  F(x)  —  \       ^r  (x}  dx,  and  since  the  values  of  -fy  (x)  in  the 


interval  (x,  x  +  h)  all  lie  between  i/r  (x  +  0)  +  ^    and    ty(x+  0)  —  e2,  where 


L   -  F(x  +  h)-  Fix}  ..      , 

1(  e2  converge  to  zero  as  h  does  so,  we  see  that  —  j—  lies  between 

IV 

(x  +  0)  +  d  and  ^(x  +  0)  -  ea,  hence  D+F(x),  D+F(x)  both  lie  between* 


•\fr(x  +  0)  and  ^r(x+  0).  By  taking  h  negative,  we  see  that  D~F(x),  D_F(x} 
both  lie  between  ty  (x  —  0)  and  ty  (®  —^)-  In  case  -^  (x)  be  continuous  on  the 
right,  F(x)  has  a  derivative  on  the  right,  -^(x  +  Q);  and  in  case  ty(x)  is 
continuous  on  the  left,  F  (x)  has  a  derivative  \lr(ae-0)  on  the  left.  It  may 
happen  that  -v/r  (x)  is  continuous  at  a  point  of  discontinuity  of  f(x)  ;  at  such 
a  point  F(x}  has  a  differential  coefficient  equal  to  the  value  of  \Jr  (x).  Even 
when  •&  (x)  has  a  discontinuity  of  the  second  kind,  it  is  possible  that  F  (x) 
may  have  a  differential  coefficient,  or  a  derivative  on  the  right  or  on  the  left, 
or  both. 

If  f  (x}  be  integrable  (R),  and  F  (x)  be  the  corresponding  indefinite  integral, 
any  one  of  the  four  derivatives  DF(x),  of  F(x),  is  integrable  (R),  and  has 
F  (x)  for  its  indefinite  integral. 

For  DF(x)  differs  from  /(&')  only  at  a  point  of  discontinuity  of  f(x),  arid 

*  It  is  stated  by  Schoenflies,  see  Bericht  iiber  die  Mengenlehre,  p.  208,  that  the  derivatives  of 
F(x)  are  equal  to  ^  (x  +  0^,  fj^+O).  \(/(x-0),  ^(^-0}.  This,  however,  is  not  necessarily  the 
case.  It  has  been  shewn  by  Hahn,  Monatshefte  der  Math.  u.  Phi/sik,  vol.  xvi,  p.  317,  that  f  (x) 
may  be  so  chosen  that  the  corresponding  integral  function  has,  at  a  particular  point,  derivatives 
on  the  right  having  arbitrarily  given  values  lying  between,  or  equal  to,  the  values  of  ^(.e  +  0), 
\//  (a'  +  O)  at  the  point;  and  in  particular  that  f(x)  niay  be  so  constructed  as  to  have,  at  the  point, 
a  definite  derivative  which  has  an  assigned  value  between  the  two  limits. 
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at  such  a  point  DF(x)  lies  between  the  upper  and  lower  boundaries  of  -vjr  (x)  ; 
thus  f(x)  —  DF  '  (x)  is  an  integrable  null-function.    Therefore 


=  I*  D+F(x)dx  =  !X  D~F(x}dx 

J  a  J  a 

=  T  D_F(x}dx  =  r  f  (x)  dx  =  F  (x}. 

J  a  la 


It  has  been  shewn  that  the  indefinite  integral  of  a  function  that  is 
integrable  (R)  has  a  differential  coefficient  at  the  everywhere  dense  set  of 
points  of  continuity  of  the  discontinuous  function  ;  there  may  however  also  be 
an  everywhere  dense  set  of  points  at  which  this  continuous  function  does  not 
possess  a  differential  coefficient. 

346.  It  has  been  shewn  that,  if  the  continuous  function  $>(x)  possesses 
everywhere  a  differential  coefficient  f(x]  which  is  everywhere  a  continuous 
function,  then 


</>  (a,-)  -  <f>  (a)  =      /(or)  dx  =  F(x). 

J  a 

This  is  a  particular  case  of  the  following  more  general  theorem: 

If  <j>(x)  be  a  function  continuous  in  the  interval  (a,  b),  and  if  one  of  its  four 
derivatives  D+^>(x\  D+(f)(x),  D~<f>(x),  /)_<£(#)  be  a  bounded  R-integrable 
function  in  (a,  b),  then  each  of  the  other  three  derivatives  is  also  bounded  and 
integrable  (R)  in  (a,  b),  and  </>  (x)  —  $  (a)  is  the  integral  of  any  one  of  the  four 
derivatives  through  the  interval  (a,  x). 

If  (a,  x}  be  divided  into  a  number  of  parts  (a,  Xj),  (x1;  x2),  ...  (xn_^,  x},  it 

has   been  shewn  in  §  280,  that  ™-  lies  between  the  upper  and 

ocr  —  #/•_  i 

lower  boundaries  of  any  one  of  the  four  derivatives  Jj(f>(x)  in  the  interval 
(a-V-i,  #r).    It  follows  that  </>  (x)  —  $  (a)  lies  between  two  sums 


(#1  —  a)  U  (a,  xlt  D)  +  (x2  —  A'J)  U  (#!,  x.,,  D)  -f  .  .  .  +  (b  —  a!n-i 

(xl  -  a)  L  (a,  xl,  D)  +  (xz  -  x^L  (xl,  x.2,  D)  +  ...  +  (b  -  #n_j)  L  (xn^,  b,  D), 

where  U(xr_l,  xr,  D),L  (^V-D  #V.  ^)  ar>e  the  upper  and  lower  boundaries  ofDcfr  (,r) 
in  the  interval  (xr_l}  xr);  and  it  is  known  that  these  are  the  same  for  all  four 
derivatives.  The  limits  of  the  above  sums,  when  the  intervals  are  diminished 
indefinitely,  so  that  the  greatest  of  them  converges  to  zero,  are  the  upper  and 
lower  integrals  of  any  one  of  the  four  functions  ./)</>  (x).  If  it  be  known  that 
any  one  of  these  derivatives  is  integrable  (R)  in  (a,  x},  then  the  upper  and  lower- 
integrals  are  equal,  and  the  other  three  are  also  integrable,  the  common  value 
of  the  integral  being  </>  (x)  —  </>  (a).  Thus 


<j)  (x)  dx  =  f  ^  D+(b  (x)  dx  =  /£-</>  (.r)  dx  =  \D^  (x)  dx. 
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It  should  be  observed  that,  as  has  been  shewn  in  §  280,  the  four  derivatives 
are  all  equal  to  one  another  at  a  point  at  which  one  of  them  is  a  continuous 
function ;  and  thus  at  such  a  point  there  is  a  differential  coefficient.  If  one 
of  the  derivatives  be  integrable,  there  is  therefore  a  set  of  points  of  measure 
equal  to  that  of  the  interval  (a,  6),  at  which  all  four  derivatives  have  equal 
values,  and  at  which  therefore  a  differential  coefficient  exists. 


347.  In  case  D<f>  (x)  be  a  bounded  function  which  is  not  integrable  (R), 
the  above  proof  shews  that  <£  (x)  —  <£  (a)  lies  between  the  upper  and  lower 
integrals,  in  (a,  x),  of  any  one  of  the  four  functions  D(j)  (x}.  This  includes  the 
case  in  which  <£  (x)  has  a  differential  coefficient  which  is  bounded  but  not 
integrable  (R) ;  in  that  case  <$>(x)  —  $  (a)  lies  between 


v  r  x 

</>'  (x)  dx  and    I    <£'  (x)  dx. 

',  J  <i 

Tx+h 

Since  I        cf>'  (x}  dx  is  in  absolute  value  less  than  h  .  U,  where   U  is  the 

-'  X 

upper  boundary  of  <f>'  (x)  in  (a,  6),  it  follows,  as  in  §343,  that  I    <£' (x}dx  is  a 

J  a 

rx 
continuous  function  of  x ;   similarly  it  may  be  seen  that    I    $'  (x)  dx  is  a 

J  a 

continuous  function  of  x.    At  a  point  of  continuity  of  <£'  (x),  both 

~rx  rx 

I    (f)'  (x)  dx,    I    (f)'  (x)  dx 

J  a  J  a 

have  the  differential  coefficient  <£'  (x),  as  may  be  seen  by  a  process  precisely 
similar  to  that  in  §  344.  Thus  the  upper  and  lower  integrals  of  (/>'  (x)  possess 
properties  similar  to  those  of  the  integral  of  <f>'  (x),  when  it  exists,  and  both  of 
them  may  be  regarded  as  primitives  of  (/>'  (x),  in  the  extended  sense. 

The  function  D</>  (x),  when  not  integrable  (R),  may  be  a  non-integrable 
point-wise  discontinuous  function,  or  it  may  be  totally  discontinuous. 

If  f  (x)  be  any  non-integrable  bounded  function,  the  following  theorems 
may  be  established  by  proofs  similar  to  those  in  §343  and  §  345: 


The  upper  and  lower  integrals  I    f  (as)  due,    I    f(x}dxare  continuous,  and 

J  a  J  a 

of  bounded  variation  in  (a,  6). 

At  any  point  of  (a,  b),  at  'which  f  (x}  is  continuous,  the  upper  and  lower 

1X  /"''' 

integrals   I    f(x}dx,  I    f(x)  dx  each  possess  a  differential  coefficient  which  is 

J  a  J_a 

equal  to  f  (x). 
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348.  An  important  general  class  of  continuous  functions  for  which  the 
four  derivatives  are  not  integrable  (R),  even  when  a  differential  coefficient 
exists,  or  when  derivatives  on  the  right  and  on  the  left  always  exist,  is  the 
class  of  everywhere-oscillating  functions.  Those  functions  which  become 
everywhere-oscillating  functions  when  a  linear  function  is  added  have  the 
same  property. 

If  a  derivative  DF(a)  be  such  that,  in  every  interval,  it  has  no  finite 
upper  boundary  or  no  finite  lower  boundary,  it  is  certainly  not  integrable  (R); 
it  is  therefore  only  necessary  to  consider  an  interval  in  which  the  function 
DF(x)  is  bounded.  Let  (a,  x)  be  such  an  interval,  and  let  us  suppose  that 
F(x)  —  F(a)  is  not  zero. 

In  every  interval  (xr-i,  xr)  contained  in  (a,  ac),  U  (#V-i>  ®r,  D)  the  upper 
boundary  of  DF(x)  is  positive,  and  L(xr^1,xr,  D)  the  lower  boundary  of 
DF(x}  is  negative;  thus  the  two  sums 

(xl  —  a)  U(a,  xl,  D)  +  (x.2  —  #,)  U  (aslt  as2,  D)+  ... 


(#!  —  a)  L  (a,  x1,  D)  +  (#2  —  x^  L  (xl,  x.,,  />)+... 

+  (b  —  #„_!)  L  (#„_!,  b,  D), 

are  such  that  the  first  is  essentially  positive,  and  the  second  essentially  negative, 
the  non-  vanishing  number  F(x}  —  F(a)  lying  between  them. 

It  follows  that  the  limits  of  these  two  sums,  as  the  number  of  sub-divisions 
of  (a,  x]  is  increased  indefinitely,  must  be  different  from  one  another,  since 
they  cannot  have  zero  as  their  common  value  ;  thus 


DF(x)dx,         DF(x)dx 

^a 

are  distinct  from  one  another. 

It  has  thus  been  proved  that  a  continuous  function  which  is  everywhere- 
oscillating  in  (a,  b)  cannot  have  a  derivative  which  is  integrable  (R)  in  (a,  b), 
even  if  it  have  everywhere  a  differential  coefficient,  or  definite  derivatives  on  the 
right  and  on  the  left. 

The  function  DF  '  (x},  or  /(#)>  m  tne  case  °*  snc^  function,  may  be  a 
point-wise  discontinuous  function  such  that  the  measure  of  the  set  of  points  of 
discontinuity  is  greater  than  zero,  or  it  may  be  a  totally  discontinuous  function. 

A  continuous  monotone  function,  which  is  not  reducible  to  a  function  with 
an  infinite  number  of  oscillations  by  the  addition  of  a  linear  function,  has  at 
every  point  definite  derivatives  on  the  right  and  on  the  left,  each  of  which  is 
either  continuous  or  is  an  .R-integrable  discontinuous  function,  since  either 
derivative  has  only  ordinary  discontinuities.  Thus  such  a  function  has 
72-integrable  derivatives,  provided  these  derivatives  are  bounded  in  the 
interval. 
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In  case  the  continuous  function  F  (x)  have  a  differential  coefficient,  or  a 
derivative  which  is  not  everywhere  finite,  or  is  not  bounded  in  the  interval, 
this  derivative  is  not  integrable  in  the  sense  in  which  we  have  hitherto  defined 
integration.  This  case  will  be  considered  in  connection  with  the  theory  of 
improper  integrals. 

349.  The  preceding  investigations  provide  answers  to  the  questions  which 
arise  as  regards  the  validity  of  the  two  propositions  (A)  and  (B)  of  §  343, 
which  together  constitute  the  fundamental  theorem  of  the  Integral  Calculus, 
asserting  that  the  operations  of  differentiation  and  of  integration  are  in 
general  reversible.  The  definition  of  'a  definite  integral  has  hitherto  been 
restricted  to  that  of  Riemann,  and  is  applicable  to  bounded  functions  only. 
The  extensions  of  that  definition  to  the  case  of  unbounded  functions,  which 
will  be  considered  later,  and  also  the  more  general  definition  of  integration 
due  to  Lebesgue,  to  be  considered  in  Chap,  vn,  will  lead  to  corresponding 
extensions  of  the  scope  of  the  fundamental  theorem. 

f* 
As  regards  the  theorem  (A),  that  the  indefinite  integral  F(x)  =  \    f(x)  dx 

a, 

of  a  bounded  jR-integrable  function  possesses  a  differential  coefficient  equal, 
at  a  point  x  of  (a,  b),  to  f(x\  it  has  been  shewn  that  the  theorem  holds 
without  restriction  in  case  f(x)  is  a  continuous  function ;  but  that,  if  f(x)  be 
not  continuous,  the  theorem  still  holds  as  regards  every  point  of  continuity  of 
f(x).  It  follows  that  the  points  of  (a,  b)  at  which  F  (x}  either  possesses  no 
differential  coefficient,  or  possesses  one  which  is  not  equal  to  f(x\  form  a  set 
of  measure  zero,  which  may  however  be  everywhere  dense  in  (a,  b). 

The  theorem  (B)  that,  if  <£  (x}  possess  a  differential  coefficient  f(x),  then 

f* 
the  corresponding  indefinite  integral  F{x)  =      f(x)  dx  differs  from  </>  (x)  only 

J  a 

by  a  constant,  holds  if  f(x)  be  a  continuous  function,  arid  more  generally,  if 
f(x)  be  bounded  and  integrable  (R).  In  case  <£  (x)  does  not  at  all  points  possess 
a  differential  coefficient,  the  more  general  theorem  is  applicable  that,  if  any 
one  of  the  four  derivatives  of  <£  (x)  be  bounded  and  integrable  (R),  then  the 
integral  function  corresponding  to  that  derivative  differs  from  0  («.•)  by  a 
constant  only.  The  theorem  fails  either  in  case  <£  (x)  be  not  a  function  with 
bounded  derivatives,  or  in  case  it  be  a  function  with  bounded  derivatives,  but 
those  derivatives  do  not  satisfy  Riernann's  condition  of  integrability. 

The  problem  of  the  determination  of  a  continuous  function  which  shall 
have  a  given  function  f(x)  for  its  differential  coefficient,  at  every  point  at 
which  f(x}  is  continuous,  may  be  considered  here  in  the  case  in  which  /(.?')  is 
restricted  to  be  bounded  in  the  interval  (a,  b)  for  which  it  is  defined.  This 
problem  of  the  determination  of  such  a  primitive  of /'(•*')  is  regarded  as  having 
a  unique  solution  provided  functions  exist  which  satisfy  the  condition,  and 
further  provided  any  two  such  functions  differ  from  one  another  by  a  constant 
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only,  that  constant  having  one  and  the  same  value  for  the  whole  interval 
(a,  b).  In  the  first  place,  the  problem  cannot  be  determinate  unless  f(x)  be 

7*  fx 

integrable  (R)',  for  either  of  the  two  functions  I    /(#)  dx,  I    f(j.')dx  satisfies 

J  a  -'a 

the  condition  of  the   problem,  and   these  functions   do  not  differ  from  one. 

another  by  a  constant,  as  they  both  vanish  at  the  point  a,  and  are  elsewhere 

rx 

unequal.    Next,  if  f(x)  be  integrable  (R),  the  function      f(x)  dx  satisfies  the 

J  a 

condition  of  the  problem,  but  the  solution  is  not  necessarily  unique.  In  case 
however  the  points  of  discontinuity  of  the  jR-integrable  function  f(x)  form  an 
enumerable  set,  the  theorem  of  §267  shews  that  the  solution  is  determinate; 
for  any  two  functions  which  have  equal  finite  differential  coefficients  at  all 
points  of  (a,  b)  except  those  of  an  enumerable  set,  differ  from  one  another  by 

fx 
a  constant.     In  this  case  I    f(&)  dx  +  C  is  the  function  required.     When  the 

J  a 

points  of  discontinuity  of  the  integrable  function  f(x)  form  an  unenumerable 
set,  with  a  perfect  nucleus,  although  that  set  must  have  zero  measure,  the 


problem  has  not  a  unique  solution.    For,  although   I     f(x)dx  is  a  function 

J  a 

which  has  the  required  property,  another  solution  is  obtained  by  adding  to 
it  any  continuous  function  which  has  all  the  intervals  complementary  to  the 
perfect  component  of  the  unenumerable  set  as  lines  of  invariability;  that  such 
functions  exist  has  been  established  in  §  269.  There  exists  however  only  one 


function,  viz.      /'(&')  dx,  with  bounded  derivatives,  which  satisfies  the  condition 

.  a  ^ 

of  the  problem;  for  it  has  been  shewn  in  §286,  that  any  two  functions  which 
have  bounded  derivatives,  one  of  which  derivatives  is  prescribed  at  all  points 
not  belonging  to  a  certain  set  of  measure  zero,  differ  from  one  another  by  a 
constant. 

Similar  remarks  apply  to  the  more  general  problem  of  the  determination 
of  a  function  which  shall  have  one  of  its  four  derivatives,  say  the  upper  one 
on  the  right,  equal  to  a  given  function  f(x)  at  every  point  of  continuity  of 
f(x).  This  problem  has  a  solution  whenever  f(x)  is  bounded;  in  virtue  of  the 
theorem  of  §  267,  the  solution  is  unique  when  _/(.?;)  is  integrable  (R),  and  the 
points  of  discontinuity  of  f(x)  form  an  enumerable  set.  When  f(x)  is 
integrable  (R),  and  the  set  of  points  of  discontinuity  is  unenumerable,  there 
exists,  in  virtue  of  the  theorem  of  §  286,  only  one  solution  for  which  the 
derivatives  are  bounded.  As  before,  if  the  restriction,  that  the  required 
function  is  to  have  bounded  derivatives,  be  not  imposed,  the  solution  of  the 
problem  is  indeterminate. 
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EXAMPLE. 

The  following  example  was  given*  by  Volterra,  as  the   first  case  of  a  continuous 
function  possessing  a  bounded  differential  coefficient,  not  integrable  (R). 

Let  G  be  a  perfect  non-dense  set  of  points  in  the  interval  (a,  b),  and  such  that  its 
content  is  greater  than  zero.     Let  («,  /3)  be  an  interval  complementary  to  the  set  G,  and 

let  d>  (x.  a)  =  (x  —  a)2  sin  —  :— ,  and   therefore  d>'  (x,  a)  =  2(x  —  a)  sin       cos-  The 

X-a.  X-a  X-a 

function  </>'  (x,  a)  vanishes  at  an  infinite  number  of  points  in  (a,  /3) ;  let  a  +  y  be  the  greatest 
value  of  x  which  does  not  exceed  ^(<«  +  /3),  for  which  <p'  (x,  a)  vanishes.  Let  F  (x}  —  Q  at 
every  point  of  G,  and  in  each  interval  (a,  #)  complementary  to  G,  let  F  (x)  =  $  (x,  «),  for 
values  of  x  such  that  a^x  ^a  +  y;  let  F(x)  =  <j)  (a  +  y,  a),  for  values  of  x  such  that 
a  +  y  ^  x  <.  ft-y  \  and  let  F  (x)  =  -  $  (A;  /3),  for  ft  —  y£x£fi.  The  function  F  (x)  is 
continuous,  and  has  everywhere  a  finite  differential  coefficient  which  is  limited  in  the 
interval  (a,  b).  It  is  easily  seen  that  /"  (x}  vanishes  at  every  point  of  G.  The  function 
F'  (x)  has  a  discontinuity  of  measure  2  at  each  point  of  the  set  G,  which  is  not  of  zero 
content,  and  therefore  F'  (x)  is  not  an  integrable  function. 

FUNCTIONS   WHICH   ARE   LINEAR   IN   EACH    INTERVAL   OF   A   SET. 

350.  The  existence  of  continuous  functions  which  are  linear  in  each 
interval  of  an  everywhere  dense  set  of  intervals  has  been  already  referred  to 
in  §  274.    It  has  been  shewn  in  §  269  how  a  function  f(x)  can  be  constructed 
which  is  continuous,  and  has  as  lines  of  invariability  the  intervals  comple- 
mentary to  a  non-dense  perfect  set  of  points.     It  is  clear  that  the  integral 

function  I    f(x)  dx  is  linear  in  each  of  the  intervals,  and  being  also  continuous, 

J  a 

it  is  a  function  of  the  type  referred  to.     A  more  general  function  which  is 

continuous,  and  is  linear  in  each  interval   of  the  set,  may  be  obtained  by 

rx 
adding  to  I    f(x)dx  any  continuous  function  for  which  the  intervals  of  the 

set  are  lines  of  invariability. 

INTEGRATION    BY    PARTS. 

351.  If  u,  v  denote  functions  of  x,  defined  for  the  interval  (a,  b),  and  which 

have  continuous  differential  coefficients    ,        -  in  that  interval,  the  formula 

dx   dx 

d  (uv)         dv        da 
dx  dx        dx 

leads  at  once,  by  integration  over  the  interval  (a,  b),  to  the  well-known  formula 

b      dv   ,        [l>     da  , 
u  -v-  dx  +  I     v  -r  dx  —I  uv 


for  integration  by  parts,  due  to  Leibnitz. 

*  Giorn.di  Hattaglini,  vol.  xix,  1881. 
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The  simplest  generalization*  of  this  formula  is  that,  if  f(x),  g(x)  are 
bounded  and  integrable  (R)  in  the  interval  (a,  6),  then 

rb  C  rx  \  rb  (   rx  )  rb  rb 

f(x)  4  I     g  (x)  dx  [  dx  +      g  (x)  \  I    f(x)  dx\dx=\    f(x)  dx  I    g  (x)  dx. 

Ja  (J  a  j  Ja  [J  a  }  Ja  Ja 

To  prove  this  formula,  we  employ  the  identity 

r=p        r=p  r=p  r=p 

2n          >^      A?     —        >      /v     o  [        >       /Q    n 

i/j*      *w    A-'T*  —     *~*    Wyo  f i     i^    —     f^jfOf. 

r=\         r=\  r=l  r=\ 

where  sr  denotes  ^  +  a2  +  . . .  +  ar,  and  6-'r-i  denotes  &  +  J3.2+  ...  +  @r-\- 

In  a  net  Dn  fitted  on  to  the  interval  (a,  b),  let  ar  —  ^>rM^  (8r),  where  Ml  (8r) 
is  a  number  not  greater  than  the  upper  boundary  of/(#)'in  the  mesh  8r,  nor 
less  than  the  lower  boundary  of  f(x)  in  8r;  let  /3r  be  8rM2(8r),  where  M2(8r) 
is  defined  similarly  in  relation  to  g  (x).  We  have  then,  if  p  =  mn, 

•*?  £     Tif    /  £   \       ^         S>     Tt/f    /  S;  \  V     '       £     71/T    /^  \      V 

2,       orMl{or)     2<      OtM2\ot)\  +    2,        o,. M2(or)    2* 
r=l    [_  t=l  J         r=\    L  t  =  l 

=  t?n8rM1(8r)^"n8rM2(8r)  (1). 

r  b  r  =  mn 

Let  n  be  so  great  that       f(x)dx  differs  from  both     S    8rU1(8r)  and 

Jo  r  =  1 

r=m»,  rb 

2,    8rLl(8r)  by  less  than  77;  and  so  great  that      g  (x)  dx  differs  from  both 

"2,    Sr{72(<V)  an(l     2     8rL.,(8r)  by  less  than  77;  where  77  is  an  arbitrarily 

chosen  positive  number.    U1  (8r),  L^  (8r)  are  the  upper  and  lower  boundaries  of 
f(x)  in  the  mesh  8r,  and  U2  (8r),  L.2  (8r)  those  of  g  (x). 

Then  it  is  clear  that,  for  every  value  of  r, 

2  8tUl(8t)—  I     f(x)  dx  <  77,  and  I   rf(x}dx—  2   8tLl(8t)<tj, 

^  =  1  J  a  J  a  t  =  \ 

hence   I   '  f(x)dx  and  2  8tM1(8t)  differ  from  one  another  by  less  than  77. 

J  a  t  =  l 

A.  similar  result  holds  with  relation  to  the  function  g(x);  the  mesh  8r  being 
If  1/j  (8,.)  =/(#,._i ),  and  M2  (8r)  =  g  (^r_a),  we  see  that 

t  =  r 


r=l 

j'=i«n  p- 

differs  from    2     8rf(xr_l)  g  (x)  dx  by  less  than  77   S    Sr  j  y  (*V-i)  i  or  by 

j-  =  l  Jrt  ?•  =  ! 

less  than  A  (b  —  a)  77,  where  J.  is  the  upper  boundary  of   f(oc)  \  in  (a,  b).    A 
similar  result  holds  with  respect  to  g  (x). 

*  See  Thomae,  Zeitschr.  f.  Math.,  vol.  xx,  pp.  475-478;  also  Hardy,  Messenger  of  Math., 
vol.  xxx,  pp.  185-187,  and  vol.  XLVIII,  p.  90. 
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As  17  is  arbitrarily  small,  we  now  see  that,  when  n  *>  oo ,  the  formula 
obtained  by  proceeding  to  the  limit  of  the  expressions  in  (1)  gives  the  above 
formula  for  integration  by  parts. 

The  formula  is  equivalent  to  a  somewhat  more  complex  formula  which  was 
given*  by  Du  Bois  Reymond. 

Let  a,  #  be  fixed  points  in  the  interval  (a,  b).     Writing 

F  (x)  =  [*/(«)  dx,  G  (x)  =  T  g  (x)  dx, 

J  a  J  a 

the  above  formula  becomes 

(  f(x)  G(x)dx+t  g  (x}  F  (x)  dx=[  f(x)  dx  !  g  (x)  dx. 

J  a  -fa  Jo  J  a 

If  we  subtract  from  each  side  the  expression 

G  (/3)  \b  f(x)  dx  +  F  (a)  f b  g  (x)  dx, 

J  a  J  a 

and  express  each  of  the  integrals  on  the  right-hand  side  as  the  sum  of  two 
integrals,  taken  from  a  to  a  and  from  a  to  b,  in  the  first  case,  and  from  a  to  /3 
and  0  to  6,in  the  second  case,  we  find,  after  a  little  reduction,  Du  Bois  Reymond's 
formula  for  integration  by  parts: 

rb  (  rx  ]  rb  f  rx  •) 

/(«)  1       9  OB)  dx  \  dx  +       g(x}\\    f(x)  dx  \  dx 

Ja  (J  j3  -'a  [Jm 

'x  fx  ~~|  b 

f(x)dx\    g(x)dx 


This  formula  reduces  to  the  earlier  one,  if  we  take  a  =  /3  =  a. 

The  case  of  functions  f(x(l],  x(2}),  g  (x(l),  x{'2)),  both  integrable  (R)  in  a  cell 
(a(1),  a(2);  b(1),  6(2)),  may  be  considered  by  applying  the  formula  (1),  when  the 
meshes  8  are  those  of  a  net  of  a  system  fitted  on  to  the  cell.  The  form  of  the 
resulting  expression  when  we  proceed  to  the  limit  will  depend  upon  the  order 
in  which  the  meshes  of  the  net  Dn  are  arranged  in  the  series.  If  we  count  the 
meshes  from  left  to  right,  first  taking  the  lowest  row  along  the  xw  axis,  then 
the  next  row  also  from  left  to  right,  and  so  on,  it  is  easily  seen  that  the 
formula  obtained  is 


f(6(D,  6(2))  f   f(Mi),  .r(2))  "1 

f(xW,  X®)  a  (tf(1),  x®)  d  (x®,  #(2)) 

J  (aW,  a(2))  J  L^(«(1).  «(2)) 


, 

d  x*    x®  <",  a-'2>    d 


This  is  one  of  the  formulae  analogous  to  the  formula  for  integration  by 
parts  in  the  case  of  functions  of  one  variable. 

*  Abhandlungen  d.  Miincli.  Akad.,  vol.  xn,  p.  129. 
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CAUCHY  S    DEFINITION   OF   AN   IMPROPER   INTEGRAL. 

352.  The  definition  of  the  12-integral  of  a  function  f(x),  in  a  linear 
interval  (a,  b),  presupposes  that  f(x)  is  bounded  in  (a,  6).  A  method  was 
given  by  Cauchy  of  extending  the  definition  to  cases  in  which  f(x)  is  un- 
bounded in  the  neighbourhoods  of  points  of  a  finite  set. 

Let  us  suppose  that  a  point  c,  where  a<  c  <  b,  is  such  that,  in  its  arbi- 
trarily small  neighbourhood,  the  absolute  values  of  the  function  have  no  upper 
limit,  and  let  us  suppose  further  that  c  is  the  only  point  of  this  kind,  and  that 
the  function  /(a?)  is  integrable  in  any  sub-interval  of  (a,  6)  which  does  not 
contain  c  within  it,  or  at  an  end.  The  two  integrals 

rc-e  rb 

f(x)dx,  f(x)dx 

.'a  J   c  +  e' 

both  exist,  whatever  sufficiently  small  positive  values  be  assigned  to  e,  e'. 

It  may  happen  that,  as  e,  e'  are  diminished  independently,  so  as  to  converge 
in  each  case  to  the  limit  zero,  the  two  integrals  also  converge  to  definite  limits  ; 
if  this  be  Hie  case,  we  define  the  sum 


/•«-«  _      rb 

lim  I       f(x)  dx  +  lim          f(®)  dx 

e~0  J    a       '  e'~0  J    c  +  e' 


to  be  the  improper  integral  of  f(x)  in  the  interval  (a,  b),  and  we  denote  this 
improper  integral  by 

•b 

.  f(x)  dx, 

.    a 

using  the  same  notation  as  in  the  case  in  which  f(x)  is  integrable  (R)  in  (a,  6). 
The  condition  that 

/•C-e 

lim         /(#)  dx 

e~0  J  a 

should  exist,  is  that,  corresponding  to  each  arbitrarily  small  number  8  which 
may  be  chosen,  a  number  ^  can  be  found,  such  that 

/'C-e, 

f{x)  dx  <  8, 


e- 


whatever  value  d  may  have,  subject  to  the  condition  0  <  9  <  1. 

A  similar  condition  must  be  satisfied  in  order  that 

rb 

lim          /  (x)  dx 

e'~0  J   ,f+e' 

may  exist. 

It  may  happen  that,  although  the  two  limits 
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do  not  exist,  yet  if  we  take  e'  =  e,  the  sum 

r  /(*)<** +f  f(x)dx 

J  a  J  c+t 

may  have  a  definite  limit ;  when  this  is  the  case,  the  limit  defines  Cauchy's 
principal  value  of  the  integral  of/(#),  in  (a,  b). 

It  thus  appears  that  a  principal  value  may  exist  wheri  the  function 
possesses  neither  an  integral  nor  an  improper  integral  in  the  interval  (a,  b). 

In  case  the  point  a  itself  be  a  point  of  infinite  discontinuity,  then  the  limit 

fb 

f(x}dx, 

J  a+e 

for  e  ~  0,  when  it  exists,  is  defined  to  be  the  improper  integral 

\*f(x)dx 

J  a 

of  f(x)  in  the  interval  (a,  b). 

A  similar  definition  applies  in  case  the  point  b  is  a  point  of  infinite 
discontinuity  of  the  function. 

If,  in  the  interval  (a,  6),  there  are  two  points  of  infinite  discontinuity 
d,  C2  (where  a<c1<c2<  b),  let  c  be  any  point  between  Cj  and  c2.  Then  in  case 
the  four  improper  integrals 

/•PI  rc  rcz  rb 

I   f(x)dx,        f(x)dx,    I  f(x)dx,     I   f(x)dx 

J  a  J  ci  Jo  J  02 

all  exist,  their  sum  is  defined  to  be  the  improper  integral  of  f(x)  in  (a,  6), 
and  is  denoted  by 

'6 

f(x)dx; 


and  it  is  clearly  independent  of  the  value  of  c.  The  definition,  in  case  one 
of  the  points  c1}  c2  is  an  end-point  of  the  interval  (a,  6),  is  of  the  same 
character  ;  or  both  of  them  may  be  end-points.  If  cl  =  a,  c.2  =  b,  then  if  the 

two  improper  integrals 

rc  rb 

f(x}dx,        f(x)dx 

J  a  J  c 

exist,  their  sum  defines  the  improper  integral 

i 
f(x)  dx. 

The  definition  of  an  improper  integral  can  now  immediately  be  extended 
to  the  case  in  which  there  are  any  finite  number  of  points  of  infinite  discon- 
tinuity in  the  interval.  If  these  -be  cn  c.2,  c3,  ...  cn,  taken  in  order  from  left 
to  right,  and  the  improper  integrals 


H. 


30 
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all  exist,  their  sum  is  defined  to  be  the  improper  integral 

•6 

f(x)dx. 


The  definition  can  be  extended  to  the  case  in  which  there  is  an  infinite 
number  of  points  of  infinite  discontinuity  off(x),  forming  a  set  G,  of  the  first 
species,  and  of  the  first  order.  In  that  case  G'  consists  of  a  finite  number  of 
points  d,  c2,  ...  cn. 

If  the  improper  integrals 

rci-e,  /v2-*2  rc3-f3 

f(x}dx,    I         f(x)dx,  f(x}dx,..., 

•  'a  J  ci+e/  ./  e2+f2' 

each  of  which  falls  under  the  last  case,  all  exist,  and  have  values  which  converge 
to  definite  numbers  as  e1;  e/,  e2,  e2'>  •••  converge  to  zero,  independently  of  one 
another,  the  sum  of  their  limits  is  taken  to  define  the  improper  integral 

•b 

f(x)  dx. 

It  is  clear  that  this  definition  admits  of  extension  to  the  case  in  which  G 
is  of  the  first  species,  and  of  any  order. 

It  is  easily  seen  that  an  integral  which  exists  in  accordance  with  this 
definition  is  such  that  the  integral  also  exists  in  the  interval  (a,  x),  where  x  is 
any  point  in  (a,  b),  and  that  it  is  a  continuous  function  of  x.  Moreover 

f(x}dx= 

This  definition  of  an  improper  integral  was  given*  by  Du  Bois  Reymond, 
for  the  case  in  which  G'  is  finite.  The  definition  has  been  extended  f  by 
Schoenflies  to  the  case  in  which  G  is  enumerable,  but  possesses  transfinite 
derivatives. 

353.  When  the  integral  of  a  function  f(x),  with  a  finite,  or  an  enumerable, 
set  of  points  of  infinite  discontinuity,  exists,  it  may,  or  may  not,  be  the  case 
that  \f(x)  j  has  an  integral  in  the  same  interval. 

The  improper  integral 

\b  f(x}dx 

J  a 

is  said  to  be  absolutely  convergent  in  the  interval  (a,  b),  in  case 

\f(x}\dx 

exists  ;  otherwise  it  is  said  to  be  conditionally,  or  non-absolutely  ,  convergent. 

The  integral  is  said  to  be  absolutely  convergent  at  a  point  c,  of  infinite 
discontinuity  of  the  function,  when  both  the  integrals 


C  -t 

*  Crelle's  Journal,  vol.  LXXIX.  t  See  Bericht,  vol.  i,  p.  185. 
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exist,  provided  e  be  sufficiently  small.    If  either  of  these  does  not  so  exist, 
the  integral 

rb 

f  (as)  das 

J  a 

is  non-absolutely  convergent  at  the  point  c. 

An  absolutely  convergent  integral  is  absolutely  convergent  at  each  of  the 
points  of  infinite  discontinuity  of  the  function.    The  converse  also  holds. 

rb  rb 

If    I    \f  (ad)  \das  exists,  then       f(x)dx  exists.    For,  if  c  be  a  point  of 

J  a  J  a 

infinite  discontinuity  of  /(#),  we  have 

-C+e 


/  (x)  dx 


f(x) 


for  all  sufficiently  small  values  of  e,  whatever  value  e'  (<  e)  may  have.    The 
corresponding  condition,  on  the  other  side  of  c,  is  also  seen  to  be  satisfied.    The 


condition  of  convergence  of  I    f  (x)  dx,  at  the  point  c,  is  therefore  satisfied. 

J  a 

EXAMPLES. 

fc  1  1 

1.    The  integral    /    -  sin  -  dx  exists,  but  is  non-absolutely  convergent.     For  consider 

J  o  X  X 


1  .  1 

-  sin  - 

X          X 


We  have 


taken  over  the  set  of  intervals 


)  where  n  =  ly  2,  3,  ...  m. 

' 


-  sin- 

X          X 


hence  the  integral  taken  over  the  set  of  intervals  is 


n=m 

n 

71=1 


I    .     1 
-  sin  - 


thus,  as  TO  increases  indefinitely,  the  value  of  the  integral  does  so,  and  therefore 

cannot  be  integrable  in  the  interval  (0,  1),  in  accordance  with  Cauchy's  definition.     On  the 

other  hand   I     -  sin  -  dx  converges  to  a  finite  limit,  as  e~0,  so  that  the  integral  /    -  sin  -dx 

J  e    X          X  J  §X          X 

exists  in  accordance  with  Cauchy's  definition,  but  it  is  not  absolutely  convergent. 

2.  Let  f(x)  denote  a  function  which  is  integrable  (R)  in  every  interval  (a,  6),  where 
0<  a<b  ;  and  let  f(x)  be,  in  the  neighbourhood  of  the  point  0,  of  the  form  — ^-- ,  where 
k  is  positive,  and  <f>  (x}  ia  a  bounded  function,  denned  for  the  interval  (0,  c). 

We  have 


30—2 


where  A  is  some  positive  number. 
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/Y  &  !x\ 
If  0<£<1,  it  is  clear  that    I      •    .     dx  is  arbitrarily  small,  for  a  sufficiently  small 

J  e       & 

Cc 

value  of  e  (>e),  and  therefore  the  improper  integral  I    f(x)dx  exists,  being  convergent 

J  o 
at  the  point  #=0.    If  £^1,  the  improper  integral  does  not  exist. 

3.     Let  /(#)  be,  in  the  neighbourhood  on  the  right  of  the  point  0,  of  the  form 

(x] 

^  '  — ,  where  p  is  positive,  and  d>  (&•)  satisfies  the  same  condition  as  in  Ex.  2. 

l+p 


We  have 

00) 


-  {[log  *]-*>-  [log  €']-»}; 

re 

and  thus  the  improper  integral   I   f(x]  dx  exists,  being  absolutely  convergent. 

Jo 

4.     I  —    —  dx,  taken  through  any  interval  which  contains  a  point  of  infinite  discontinuity 

J      x 
of  tan  x,  does  not  exist. 

„  f2      tan  x  ,        2  ,       sin  e 

For  —  dx  >  —  log  -; — r , 

J  TT          x  TT     °  sin  e 

and  this  is  arbitrarily  great,  for  a  sufficiently  great  value  of  e/e' ;  thus  the  integral  does  not 
converge  at  the  point  x=\ir.  The  integral  possesses  however  a  principal  value  at  the 
point  \TT.  For  the  sum  of  the  integrals  taken  through  the  intervals  (in-  -  <?,  TTTT  -  e')  and 
is 


— g 2  dx  <  IT  (JTT'"'  -  e2)  ~ l  sin  e, 

and  this  converges  to.  0,  with  e. 

-      1    -  * 

5.    The  function  cos  (ex}  +  -  exs\n  (ex)  oscillates  between  indefinitely  great  positive  and 

negative  values,  in  the  neighbourhood  of  the  point  #=0.    For  every  value  of  x,  except  #  =  0, 

,  i 

the  function  =  -y-  {x  cos  (ex}}. 

[e    d  -  -  - 

Also  I    -T-  {x  cos  (ex}}  dx  =  (  cos  (e<?)  -  e'  cos  (ee/), 

_/ «'  aa; 

where  e  >  e'  >  0.    It  thus  appears  that  the  integral  of  the  function  converges  at  the  point 
,v  =  0;  and  therefore  the  function  is  integrable  in  an  interval  containing  that  point. 

RIEMANN  INTEGRALS  OVER  AN  UNBOUNDED  INTERVAL. 

354.  The  definition  of  the  integral  of  a  bounded  integrable  function  given 
in  §  330  is  applicable  only  to  the  case  in  which  both  the  limits  a,  b  are  definite 
points,  and  in  which  therefore  the  interval  of  integration  is  finite. 

Let  .T!,  #2>  •••  #n.  •••  be  a  sequence  of  increasing  numbers  having  no  upper 
limit ;  it  may  then  happen  that  the  sequence  of  ^-integrals 

CXi  /*#2  f'Xn 

t'(<r\rlv  \       f('r\dr  f(t'\rl^r 

I   \w )  Lvwj  I       J   \J~> )  UrtVj    ...  I         /   \w  t  \Aj*h,    .  .  . 

.'a  -JO,  -a 
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has  a  definite  limit  A,  independent  of  the  particular  sequence  [xn]  chosen.' 
When  this  is  the  case  f(x)  is  said  to  have  an  integral  (R) 

> 
f(x)  dx, 

in  the  unbounded  interval  (a,  oo  ),  the  value  of  this  integral  being  A.  It  has 
been  presupposed  that,  in  every  interval  (a,#),the  function/(#)  is  integrable  (R). 

If  the  integrals 

rb  rb  rb 

f(x)dx,          f(x}dx,...  f(x)dx,..., 

J  X,  J  X*  -1  Xn 

where  x1}  x2,  ...  xn,  ...  is  a  sequence  of  decreasing  values  of  x  which  has  no 
lower  limit,  all  exist,  and  the  sequence  of  integrals  have  a  limit  B,  independent 
of  the  particular  sequence  chosen,  the  limit  B  is  denoted  by 

rb 

f(x)  dx. 


If  the  two  integrals 

r*  ro 

/O)  dx,  f(x}  dx, 

JO  J    -00 

as  thus  defined,  both  exist,  their  sum  is  denoted  by 

,« 

J(x)dx. 

J     -'JO 

The  three  numbers 

f  f(x)dx,      I"    f(x}dx,      r    f(x)dx, 

J  a  J  —  oo  J  —  oo 

being  the  limits  of  integrals,  and  not  themselves  in  the  proper  sense  of  the 
term  integrals,  belong  to  the  class  of  improper  integrals. 

In  each  case  it  is  necessary,  but  not  sufficient,  for  the  existence  of  these 
improper  integrals,  that  f(x)  be  integrable  (H)  in  every  finite  interval  con- 
tained in  the  intervals  (a,  oo  ),  (—  x  ,  6),  or  (—00,00);  and  it  will  at  present  be 
assumed  that  f(x)  is  bounded  in  every  such  finite  interval,  and  has  therein  a 
proper  integral. 

re 

In  case  the  integral  I       f(®)  dx  have  a  definite  limit,  as  c  is  indefinitely 

J  —c 

increased,  that  limit  is  said  to  be  the  principal  value  of 


This  principal  value  may  exist,  even  when  the  integral 

.« 

f(x)  dx, 

J     —00 

as  defined  above,  does  not  exist ;  but  in  case  the  latter  does  exist,  its  value  is 
equal  to  its  principal  value. 
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The  necessary  and  sufficient  conditions  for  the  existence  of  the  integral 

c 

f  (x)  dx, 


are  (1)  that  the  integral  exists  in  every  interval  (a,  x),  where  x>a,  and  (2)  that, 
corresponding  to  every  arbitrarily  chosen  positive  number  e,  a  value  £,  of  x,  can 
be  found  such  that 

I    f(x)  dx  <  e, 
J  f 

for  every  value  of  £',  such  that  £'  >  £. 

A  similar  condition  applies  to  the  case  of 

•b 

f(x)  dx. 

D 

355.    It  was  shewn  in  §  332  that  the  necessary  and  sufficient  condition  that 
the  bounded  function  f(x)  be  integrable  (R)  in  the  interval  (a,  b)  is  that,  for 

m  -  mn 

a  particular  system  (Dn)  of  nets,  fitted  on  to  (a,  b),  the  sum    2    8m(n}F(Sm(n}) 

m—l 

should  converge  to  zero,  as  n  ~  oo  .  It  was  also  shewn  that,  if  this  condition 
be  satisfied  for  one  system  of  nets,  it  is  satisfied  for  all  such  systems. 

We  have  to  enquire  how  far  a  corresponding  condition  applies  to  the  case 
of  an  ^-integral  through  an  infinite  interval. 

r°°  rb 

Since       f(x)  dx,  when  it  exists,  is  the  limit  lim      /  (x)  dx,  we  see  that 

J  a  6~oc  J  a 

the  integral  is  given  as  the  repeated  limit  lim  lim  XSm(M)  U(Sm(n)),  or  as 
lim  lim  SS/n(n)i  (S,n(n)).  The  question  then  arises  whether,  or  under  what  con- 
ditions, the  order  of  the  repeated  limits  may  be  reversed  without  altering  their 
values.  When  this  can  be  done,  we  have 


where  the  summations  are  taken  for  the  meshes  of  a  net  fitted  on  to  the  in- 
finite interval  (a,  oo  ). 

In  the  first  place  it  is  necessary  that  the  two  sums  should  not  be  divergent. 
The  following  theorem  will  be  established  : 

If,  for  the  function  f(x),  bounded  in  the  interval  (a,  oo  ),  a  system  of  nets 
fitted  on  to  the  interval  exists,  such  that  2  Bm(n}U(SmM),    I  8m(n)L(8mM)  both 

m  =  l  m=l 

exist,  as  definite  numbers,  £n,  S,l(  which  converge,  as  n~  x>  ,to  one  and  the  same 

.» 
number  A,  then       f(x)dx  exists,  and  its  value  is  A. 
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00 

From  the  condition  stated  in  the  theorem  it  follows  that  S  BmMF(8min)) 

ro-l 

exists,  and  converges  to  0,  as  n  ~  oo  .    It  will  be  shewn  that  this  condition  is 

rb 
sufficient  to  ensure  that  I   f(x)dx  exists  in  every  finite  interval  (a,  b),  where 

J  a 

b  >  a.  If  6  is  not  an  end-point  of  a  mesh  of  the  system,  we  may  modify  the 
system  of  nets  by  dividing  into  two  each  mesh  that  contains  the  point  6. 
The  first  sum  will  thereby  be  not  increased,  and  the  second  sum  will  not  be 

00 

diminished;  accordingly  the  condition  that  S   $m(n)  F  (8m(n})  exists  and  con- 

m-l 

verges  to  zero,  as  n  ~  oo  ,  will  still  hold  good.    It  then  follows  that,  when  we 

m  —  mn 

take  the  finite  sum    2   Bm(n}F(8m(n)),  where  6  is  an  end-point  of  Srn7J(n),  this  sum 
»-i 

rb 

will  converge  to  zero,  as  n~  oo  ;  and  thus       f(x)dx  exists  as  an  ^-integral. 

J  a 

Again  we  have,  for  any  finite  value  m,  of  m, 

m  =  m 

2  S 

m=l 

and  thus  <  77,  where  77  is  an  arbitrarily  chosen  positive  number,  and  n  is  chosen 

00 

so  large  that  S  <$m(n}  F  (Sm(n))  <  77.    The  number  m  may  be  chosen  so  large  that 

m  =  l 

BmMU(Sm(n])<'n;  hence 


for  m  >  m1}  some  fixed  integer. 

We  may  also  suppose  n  to  have  been  chosen  so  large  that 
^  8  (n]U(8  (n]}  -  A  <  v  • 

«    °m     t/  \Vm     )       -ci  ^  '/  ? 

in—  1 

r«+f 
we  see  then  that  I      f(x)dx  differs  from  A  by  less  than  877,  for  all  values  of 

J  a 

m 

|  which  are  equal  to  a  +  2  Bm("\  where  m  =  m^.    If  X  be  any  number  greater 

m=l 

than  the  least  of  these  values  of  £,  two  values  of  £  exist,  between  which  X 

fX 

lies.    If  £j  be  the  smaller  of  these,  I    /"(a?)  cte  lies  between  Ud  and  Zrf,  where 

-  fi 
rf  is  the  maximum  of  the  meshes  dn  of  D.n  ;  and  (7,  L  are  the  upper  and  lower 

•x 
boundaries  off(x)  in  (a,  oo  ).    It  now  follows  that      /"(*')  ^  differs  from  .4  by 
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less  than  3?;  +  £,  where  £  is  the  greater  of  the  two  numbers  Ud,  Ld.    Since 

rx 
77,  %  are  both  arbitrarily  small,  it  follows  that   I    /  (x)  dx  converges  to  A,  as 

J  a 
.« 

X  ~  oo  ;  and  thus  that  the  ^-integral  I    f  (x)  dx  exists. 

.'  « 

GO 

It  should  be  observed  that  the  convergence  of   2  Bm(n)  F  (8m(n))  to  zero,  as 

m=l 

n  «*  oo  ,  for  a  particular  system  of  nets,  is  not  by  itself  sufficient  to  ensure  the 

/•°°  rb 

existence  of  I    f(x)  dx,  but  only  that  of  I   f(x)  dx,  for  every  finite  value  of 

J  a  J  a 

b  (>  a\  In  this  respect  an  integral  (R)  over  an  infinite  interval  differs  from 
one  over  a  finite  interval  ;  since,  in  the  latter  case,  the  convergence  of  the 
finite  sum  ZS(n)  F  (8('l))  to  zero,  as  n~oo,  for  a  particular  system  of  nets,  is 
sufficient  to  ensure  the  existence  of  the  integral. 

356.    The  converse  theorem  will  now  be  proved,  that  : 

r°° 
If       f(x)  dx  have  a  definite  value,  a  particular  system  of  nets  can  always 

J  a 

00 

be  determined  for  (a,  oo  ),  such  that    2  $m(n)U(&m(n})  exists  for  each  net  Dn  of 

m—l 

the  system,  and  converges  to  the  value  of  the  integral,  as  n  ~  oc  . 

Let  us  consider  a  set  of  points  a,  x1}  xz,  .  .  .  xn,  .  .  .  which  diverges  as  n  ~~  oo  . 
A  net  can  be  determined  for  the  interval  (a,  tfj,  such  that 


similarly,  a  net  can  be  determined  for  (#1(  #,),  such  that 


and  generally,  a  net  can  be  determined  for  (#„_!,  xn),  such  that 


Thus  a  net  can  be  fitted  on  to  (a,  xn),  for  which 


Now  xn  can  be  taken  so  large  that 


f '*'"/•/        N       7  f*       "/ 

I     J  (X)  dx  —   I      f(L 

J  a  J  a 


<  rj ;  therefore  a 


n  =  m  r™ 

net  can  be  fitted  on  to  (a,  oo  ).  such  that    S  SU(8)  differs  from  I    f(x)  dx  by 

n  =  1  J  a 

less  than  an  arbitrarily  small  number,  for  all  sufficiently  large  values  of  m. 

,-  00 

Therefore    S  BU  (8)  converges  to  a  limit  which  differs  from        f(x)dx  by 
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not  more  than  e.  By  taking  a  monotone  sequence  of  values  of  e  which  con- 
verges to  zero,  we  obtain  a  system  of  nets,  fitted  on  to  (a,  oo  ),  such  as  is 
required. 

It  may  also  be  possible  to  define  a  system  of  nets  Dn  such  that 
taken  over  (a,  6),  does  not  converge,  as  b  ~  oo  ;  and  thus  the  theorem 


w~x  JM  =  I  n~<x>  n=\ 

only  holds,  provided  the  nets  are  such  that  the  sums  exist  for  each  net  of  the 
system,  from  and  after  some  fixed  value  of  n. 

It  can  also  be  shewn  that  : 


When  the  integral  I    f(x)  dx  has  a  definite  value,  then  for  any  system  of 

J  a 

CO 

nets  fitted  on  to  (a,  oo  )  which  is  such  that   S  8mMF(Sm(n])  exists,  and  converges 

m=l 

00 

to  zero,  as  n~<x> ,  the  set  of  numbers   2  &m(n)U(8m(n))  exists,  and  converges  to 

m  =  \ 

the  value  of  the  integral. 

Since,  for  every  value  of  m, 

m  m 

ra+S.Sm(")  m  m+2Sm(«)  m 

f(x)dx^  2«*W &•(«»«)*  I  f(x)dx  +  28m^F(SmW), 

J  et  J  a  1 

in 

we  see  that  the  number  2&m"l)  U(8m(n))  has,  as  m  ~  oo  ,  limits  of  indeterminancy 
between  [  f(x)dx  and  [    f(x}dx+  2  Sm(H) F  (Sm(n)).     Therefore  as  n  ~  oo , 

•J  a  J  a  m  =  l 

00  /•& 

'ZSm(n)U  (8m(n))  converges  to       f(x)dx.    For  a  system  of  nets  which  is  not 

J  a 

such  that 

2  8  (n)F(8  (n)} 

— '    um     -1    \um     ) 
m  =  l 

m 

converges  to  zero,    2  Sm(H}  U (&m{"})  does  not  in  general  converge,  as  m~  x . 

m  =  l 

rx  rb 

357.    The  definitions  of  I  f(x}dx,        f(x)  dx  may  be  extended  to  the  case 

J  a  J  —oo 

in  which  f(x)  has  points  of  infinite  discontinuity.    If  the  improper  integral 
rx 

1  f(x)  dx  exist  for  every  value  of  X  which  is  >  a,  and  if  it  converge  to  a 

J  a 

definite  limit,  as  X  increases  indefinitely,  then  that  limit  defines 

f  f(x}dx. 

J  a 

r        f* 

The  integrals     f(x)dx,  I      f(x)dx,  when  they  exist,  possess  many  of  the 

J  .r  J    —  m 
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properties  of  a  proper,  or  an  improper,  integral       f(x)  dx.    These  integrals  are 

J  a 

continuous  functions  of  the  finite  limit  x.    For 

rx  ra  rx 

I      f  (x)  dx=  I      f  (x)  dx  +  I   f(x)  dx, 

7—00  J  —  oo  J  a 

rx 

where  a  <  x\  and  since      f  (x)  dx  is  a  continuous  function  of  x,  so  also  is 

.  a 
rx 

f(x)  dx. 

J   —as 

rx  rx 

When  the  integral  I    f(x)dx  exists,  the  integral  I   f(x}dx  is  continuous 

J  a  J  a 

for  all  values  of  x  in  the  interval  (a,  oo  ),  including  x  =  oo  ,  as  it  is  there  con- 
tinuous in  the  extended  sense  of  the  term,  when  the  point  x  is  regarded  as 
belonging  to  the  domain. 

rx 

If  the  integral  I   f(x)  dx  exist  for  every  finite  value  of  x  in  the  interval 

J  a 

(a,  oo  ),  and  if  <f>  (x)  be  a  function  which  is  finite  and  continuous  for  every  such 
value  of  x,  and  be  such  that 


then,  provided  (/>  (x)  be  continuous  for  x  =  x  ,  the  function  f(x~)  is  integrable 

,00 

in  (a,  oo  ),  and  <£  (oo  )  —  $  (a)  =  I   f(x)  dx.    If  the  function  <£  (x)  have  a  deri- 

J  a 

vative,  say  D+(f>  (x),  which  is  integrable  in  every  interval  (a,  x),  of  (a,  oo  );  then 
if  the  integral  be  a  proper  one,  or  be  such  an  improper  one  that  the  relation 

rx 
(j)  (x)  —  (f)  (tt)  =       1)^  (f)  (x)  dx 

J  a 

subsists,  then,  provided  the  limit  <£  (oc  )  exist,  we  have  also 
<6  (oo  )  —  0  (a)  =  I    D+<j)  (x)  dx. 

J  a 

A  similar  statement  applies  to  each  of  the  other  derivatives  of  </>  (x).    In  the 

r  x 

case  in  which  <£  (x)  -  $  (a)  differs  from      D+f  (x)  dx  by  an  integrable  null- 

J  a 

function,  this  holds  also  for  the  limit  x  =  oo  . 


358.     An    integral       f(x)dx   is    said  to   be  absolutely  convergent   when 

.'  a 

r* 

the  integral  |    ,  f(x)  j  dx  exists;    otherwise  it  is  said  to  be  conditionally  or 

•'  a  .« 

non-absolutely  convergent.    If  /    j /(•*')  j  dx  exists,  then  also  |    f(x)  dx  exists;  for 

J  a  .-a 

f(x)  dx    =         f(x)\  dx,  and  hence  the  convergence  of  the  latter  integral 

J  .r, 


x\ 
follows  from  that  of  the  former  one. 
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If*f(x)  andf(x)  <j>  (x)  be  both  integrable  in  every  interval  (a,  x),  contained  in 

.00 

(a,  oo  ),  and  if     f(x)  doc  be  absolutely  convergent,  and  if  <f>(x)  be,  from  and 

J  a 

after  some  faced  value  of  x,  numerically  less  than  some  fixed  number,  then  the 

,00 

integral  I  f(x)  <£  (x)  dx  exists,  and  is  absolutely  convergent. 

J  a 

,*> 

For,  since  I  f(x)  dx  is  absolutely  convergent,  we  can  find,  corresponding 

J  a 

rf+h 
to  a  fixed  positive  number  cr,  a  number  £  >  a,  such  that          \f(x)  \  dx<v,  for 

all  positive  values  of  h;  we  have  then 

r£+h  r£+h 

I       f(x)$(x)dx    £K\      \f(x)\dx^Ka, 

where  K  is  the  upper  boundary  of    0  (x)  ,  and  is  by  hypothesis  finite.    It  is 

,00 

thus  seen  that  I    f  (x)  <£  (x)  is  convergent.    Also  since 

J  a 

f(x)  \  dx  ^  Ka, 


we  see  that  the  convergence  is  absolute. 

359.    An  important  set  of  tests  of  the  absolute  convergence  of  an  integral 

0 

f(x)  dx  is  the  following  : 


If  f(x)  be  integrable  in  every  interval  (a,  x},  then  \  f  (x)  dx  converges  to 

J  a 

a  definite  finite  value,  provided  f(x)  converge  to  zero,  as  x  is  increased  indefi- 
nitely, in  such  a  manner  that  one  of  the  expressions  f  (x} .  x1+k,f(x)  x  (log  x)l+k, 

f  (x)  x  log  x  (log  log  x)l+k, ......  f  (x)  x  log  x  .  log  log  x (log  log  ...  log  x)1+k, 

converges  to  zerd,  as  x  is  indefinitely  increased,  k  denoting  some  fixed  number 
greater  than  zero. 


The  integral       f(%)  dx  is  not  convergent,  in  case  f(x)  be  of  invariable  sign, 

J  a 

from  and  after  some  fixed  value  of  x,  and  provided  also  any  one  of  the  above 
expressions  remains  numerically  greater  than  some  fixed  number  (>  0),  as  x  is 
increased  indefinitely,  when  k  has  the  value  zero. 

We  see  that,  in  the  first  case,  I        \f(x}  \  ax  is  numerically  less  than  one 
of  the  expressions 

fx+h  dx      n[x+h__da^_        n  [X+h  dx 

J  x     M1+k  '       J  A'     &  (log  x)l+k  '       J  x      x  log  x  (log  log  x)^+k ' 

*  Kiemann's  Ges.  Werke,  p.  229 ;  also  Pringsheim,  Muth.  Annalen,  vol.  xxxvn,  p.  591. 
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where  G  is  a  constant,  dependent  on  X,  which  converges  to  zero  as  X  is 
indefinitely  increased.    These  expressions  have  the  values 

j_.     ii   G[ 


k  \_Xk     (X+h}k\  '    k  \_(\ogX)k 


k  [(log  log  X)k     (log  \ogX  + 
hence,  k  being  positive,  it  is  clear  that  X  may  be  so  chosen  that 

•X+h 

\f(x}\dx 
is  less  than  an  arbitrarily  fixed  number,  and  thus 


f(x)  dx 
is  convergent.    In  the  second  case,  k  being  now  zero,  we  .see  that 


X+h 


is  numerically  greater  than  one  of  the  expressions 


c 


x+h 


,--j/r  (™ 

I         (ix  n  ax  ,f 

'  ^  — i >     '-' 

Jx      ^           Jx      aslogx 


dx 


I  x      x  log  x  '        J  x     x  log  x  log  log  x ' 
or  than  one  of 


and  these  expressions  increase  indefinitely,  as  h  is  increased.    It  follows  that 

r00  . 

I   /  (x)  dx  is  in  this  case  divergent. 
J  a 

CHANGE    OF   THE    VARIABLE    IN    A    SINGLE    INTEGRAL. 

360.  Let  f(x)  be  a  function  that  is  bounded  in  the  interval  (a,  b).  We 
now  assume  that  x  is  a  continuous  monotone  function,  ^  (£),  of  another  vari- 
able £,  defined  for  an  interval  (a,  /3),  of  £,  and  such  that  a  =  ty  (a),  b  —  ^  (/3). 
The  following  theorem  will  be  established: 

7/  jD^r(£),  o/;e  q/  ^Ae  derivatives  of  ty(j;),  be  integrable  (R)  in  the  interval 
(a,  /3),  then 


\f(.c)dx=\  f{+(&}Dlr(ftdZ,  and      f  (*)d* 

J  a  Jo.  -'a 

Let  a  system  of  nets  (Z)nj  be  fitted  on  to  the  interval  (a,  /3).    Since  the 

•\W£  )  —  -Wr(A  ) 
incrementary  ratio  —  -^-  —  ^-^  —  has,  in  any  interval,  the  same  upper  and  lower 

~ 


boundaries  as  _D-^(f)  in  the  same  interval  (see  §  280),  to  a  mesh  B{n)  of  Z)n 
there   corresponds  a   mesh  8(ra),  of  a  net  Z)n,   fitted  on  to  (a,  b),  such  that 
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S(n)  £  k8(n),  where  k  is  the  upper  boundary  of  Dty  (£)  in  (a,  ft).  If  u(n)  denote 
the  upper  boundary  of  fty  (y)}  Dty(y)  in  the  interval  8(n\  we  see  that  u(n) 
lies  in  the  interval  bounded  by  U(8(n})uM,  and  L(8(n))l(n),  where  w(n>,  J(n) 
are  the  upper,  and  the  lower,  boundaries  of  D-fy  (f)  in  the  mesh  8(n>.  Also  8(n} 
lies  in  the  interval  bounded  by  UM  S(n)  and  l(n)  8(n];  we  may  therefore  write 

S    («)  _  5    (w)   U,    (n)  _  $    (n)  /,.     (n)  _  /    (n)\)    wV,prf>  0  <  /9    («)  <  1 
Om       —  om       [Mm  c/m       ^M-m  (jm     ^j,  wuere  V  =  um    '  =  1. 

We  have  then 


=  2 


the  summations  being  taken  for  the  mn  meshes  of  the  net  Dn,  or  of  the  net  Dn. 
Now  we  have 

2  S(w)  [u^  U  (BM)  -  u(n}]  £  2  8(n)  U(B(n))  (u™  -  Z<»>)  ^  [72  8( 
and  2  <9(?"  8(n  (u(n)  -  l(n))  U(8(n})  £  2  S(n}  U(8(n))  (u(n]  -  l(n]} 

Since  Dty(%)  is  integrable  (R)  in  (a,  ft),  it  follows  that  2  8M(uM  -  1M)  is 
arbitrarily  small,  when  n  is  sufficiently  increased;  it  thus  appears  that 


is  arbitrarily  small.    Therefore  as  n  ~  oo  ,  the  two  sums 


_ 

5,      X    (n)  TT(X    (»)\          V      S    (n)  ,-7    (n) 
•«       Om        L/  ^Owl       ^,          A       Omv       ftmv  ' 

>n  =  1  m  =  1 

converge  to  the  same  limit.    Thus  the  theorem  is  proved  for  the  case  of  the 
upper  integrals;   for  the  lower  integrals  it  can  be  proved  in  a  similar  manner. 

We  deduce  at  once  the  following  theorem  for  the  transformation  of  a 
single  integral: 

If  D-fy(£),  one  of  the  derivatives  of  the  monotone  function  ^(%),  be  inte- 
grable (R)  in  the  interval  (a,  ft),  of  %,  and  if  either  of  the  R-integrals 


exists,  the  other  also  exists,  and  they  have  the  same  value;  f(x)  being  a  bounded 
function. 

Since  ^r(^)  has  a  finite  differential  coefficient  A//  (£),  almost  everywhere 
in  the  interval  (a,  ft),  and  ty'  (£)  is  integrable  (R)  in  the  interval,  provided 
one  of  the  derivatives  DT/T(|)  is  integrable  (R),  we  have 


provided  either  of  the  integrals  is  known  to  exist  as  an  ^-integral. 

In  case  ty  (£),  although  not  monotone,  is  such  that  it  is  monotone  in  each 
of  a  finite  number  of  intervals  (a,  7^,  (7^  7.,),  .  .  .  (<yn,  ft),  where  7,,  7.,,  ...  do  not 
necessarily  all  lie  between  a  and  ft,  these  intervals  of  £  may  be  considered 
separately,  and  the  above  result  can  be  consequently  extended  to  such  a  case. 
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The  following  theorem*  for  the  transformation  of  a  single  -R-integral  is 
a  particular  case  of  the  above  theorem: 

If  the  function  f(x)  is  integrable  (R)  in  (a,  b),  and  if  a  monotone  function 

rf 
ty  (£)  is  defined  by  x  =  -fy-  (f )  =  c  +  I    </>(£)  c££,  where  ty  (f )  varies  from  a  to  /3 


as  x  varies  from  a  to  b,  and  <£(£)  has  an  R-integral  in  (a,  /3),  then 


fV  («)*?-/*/{*  (£)}*(*)«*£• 

.'a  ^  a 


361.  It  may  happen  that,  when  -fy  (£)  is  a  monotone  function  of  £,  the 
interval  (a,  6),  of  #,  corresponds  to  the  infinite  interval  (a,  oo  ),  of  £.  We  now 
assume  that  ^r  (oo  )  is  the  limit  of  ^  (£),  as  f  ~  x>  ,  i.e.  that  i/r  (£)  is  continuous 
at  oo  .  If  the  conditions  of  the  first  theorem  above  are  satisfied  for  every 
interval  (a,  b  —  e),  of  x,  with  the  corresponding  interval  (a,  /3'),  of  £,  we  have 

~ 


Since  this  holds  for  every  value  of  e,  we  have,  on  proceeding  to  the  limit  e  =  0, 


/  <*>«** 

in  accordance  with  the  definition  in  §  355,  of  the  integral  on  the  right-hand 
side.  Similar  considerations  apply  to  the  case  in  which  the  value  a  of  x 

corresponds  to  £  =  —  20  . 

rb 
If  it  be  desired  to  transform  the  integral   I    f  (x}  dx,  by  means  of  the 

J  a 

relation  y=$  (x),  where  (f)  (x}  is  a  single-valued  function  of  x,  then,  unless 
<j>(x)  be  monotone  in  the  interval  (a,  b),  the  inverse  function  Dfy(y}  will 
not  be  everywhere  single  -valued. 

If  it  be  assumed  that  $  (x)  is  monotone  in   (a,  b),  and   that  a  =  0  (a), 
fi  =  (j}(l>')>  a  derivative  D$(x\  of  0  (x),  is  reciprocal  to  a  derivative 


of  "^(y).    If  it  be  assumed  that  is   integrable    in  (a,  /9),  and    that 

JL/ro  \OC  ) 

f(x) 
the  same  holds  for  -:        -  considered  as  a  function  of  •?/,  or  else  that  fix} 


is  integrable  in  (a,  b),  then  we  may  use  the  transformation 

dy. 
* 


f6 
If  <f>  (x)  be  not  monotone,       f(x)  dx  cannot  in  general  be  transformed  into 

J  a 

a  single  integral  in  y.    If,  for  example,  <£ (x)  increases  from  x=a  to  x  =  k, 
and  then  diminishes  from  x  —  k  to  x  =  b,  we  must  take 

J    J  (.3- 


Lebesgue,  Annales  de  Toulouse,  (3),  vol.  i,  p.  44. 


360-362]        Change  of  the  variable  in  a  single  integral  470 

and  the  integrals  on  the  right-hand  side  cannot  in  general  be  combined  into 
one  integral  through  the  interval  (a,  £),  because  in  the  two  integrals  the 
integrand  has  different  values  for  the  same  value  of  y. 

Thus,  for  example,  if  y  =  sin  x, 


1< 


j  .  , 

-  dy  +     -  ,          -  ay, 


, 
Vl  -  y2  Jo       VI  -  f 

the  value  of  cos  x  in  the  second  integral  on  the  right-hand  side  of  the  first 
equation  being  negative,  and  the  values  of  sin"1  y,  cos"1  y  being  in  the 
interval  (0,  \TT}: 


REPEATED   INTEGRALS. 

362.  The  actual  evaluation  of  a  double  integral  over  the  fundamental 
rectangle,  of  which  the  sides  are  x  =  #„,  x  —  xlt  y  =  y0,  y  =  y\,  is  usually  made 
to  depend  upon  the  evaluation  of  successive  single  integrals  taken,  first  with 
respect  to  one  of  the  variables,  and  then  with  respect  to  the  other.  The 
expression 


fxi       [v\  f&i  [vi 

I    dx\    f(®,y)dy,  or  f(x,y)dydx, 

•    *o          J   2/t,  •>  XQ    •>  ?/o 

in  which  f(x,  y}  is  supposed  to  be  integrated  first  with  respect  to  y,  for 
a  constant  value  of  x,  and  then  with  respect  to  x,  is  called  a  repeated 
integral.  Similarly,  the  expression 


in  which  the  integrations  are  performed  in  the  reverse  order,  is  called  a 
repeated  integral.  The  question  of  the  existence  of  these  repeated  integrals, 
and,  in  any  given  case,  their  relation  with  one  another,  and  with  the  double 
integral,  will  be  here  investigated.  It  will  be  observed  that  the  double 
integral  has  been  defined  as  a  single  limit;  whereas  the  repeated  integrals, 
when  they  exist,  are  each  obtained  as  the  results  of  repeated  limits.  We 
have  then  to  investigate  whether,  or  under  what  conditions,  a  double  integral 
is  capable  of  representation  as  a  repeated  limit,  of  one  of  the  forms  indi- 
cated. It  cannot  be  assumed  a  priori  that  the  existence  of  the  double 
integral  necessarily  implies  the  existence,  for  each  value  of  x,  of  the  single 
integral 

-'1 

f(x,  y}dy 
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as  a  definite  number.  Neither  is  the  existence  of  this  single  integral,  as  a 
definite  number,  necessary  for  the  existence,  as  a  definite  number,  of  the 
repeated  integral 


/fa  y)  rfy. 

In  fact,  if  we  assume  that  the  upper  and  lower  integrals 
I    f(x,y)dy,    \f(x,y}dy 

J  2/u  _£_.Vo 

have  different  values  for  some  of  the  values  of  x,  it  may  happen  that  the  two 
repeated  limits 


2/0 

have  identical  values. 


ryi 
das\    f(x,y)dy 

J    iln 


The  repeated   integral  will   consequently  be  regarded,  in   this   case,  as 
existing;  and  thus  it  may  be  defined  as 


h        ri/i 
das  I    /fay)%, 

»o         i  2/o 


where  the  upper  or  lower  integral  with   respect  to  y  is   to   be   taken  in- 
differently, provided  the  repeated  limit  exists  as  a  definite  number. 


rv\       r#i 

In  a  similar  manner  I     dy  I     f  (x,  y)  dx, 

•J  2/0       I  *o 


when  it  has  a  definite  value  independent  of  whether  the  upper  or  the  lower 
integral  with  respect  to  x  be  used,  will  be  regarded  as  the  repeated  integral, 
first  with  respect  to  x  and  then  with  respect  to  y. 

363.  It  was  first  established  by  P.  Du  Bois  Reymond*  that,  when  the 
limited  function  f(x,  y}  has  a  double  ^-integral  in  the  fundamental  rectangle, 
then  the  two  repeated  integrals  exist,  and  are  each  equal  to  the  double 
integral.  We  shall  first  give  a  proof  f  of  this  theorem  which  exhibits  its 
relation  with  the  theory  of  sets  of  points. 

The  following  preliminary  theorem  will  first  be  established : 


If  I  f(x,  y)  d(x,  y},  taken  over  the  fundamental  rectangle,  have  a  definite 
value,  then  the  values  of  x  for  which  the  single  integral  I  f(x,  y)  dy,  taken  with 


x  constant,  has  a  definite  value,  define  a  set  of  points  on  a  side  of  the  rectangle, 
of  linear  measure  equal  to  the  length,  of  that  side. 

*  Crelle's  Journal,  vol.  xciv,  1883,  p.  277. 

f  The  investigation  is  founded  on  that  of  Schoenllies,  Bericht,  vol.  i,  p.  193. 
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It  follows  from  this  theorem,  that  the  set  is  everywhere  dense  in  the 
interval,  and  of  cardinal  number  c.  Moreover,  the  points  at  which 

jf(x)y)dy,     lf(x,y)dy 

differ  from  one  another  form  a  set  of  measure  zero. 

On  the  assumption  of  the  existence  of  the  double  integral,  the  set  K  of  all 
the  points  at  which  the  saltus  of  f  (x,  y)  is  =  k,  where  k  is  an  arbitrarily  chosen 
positive  number,  is  a  closed  set,  of  plane  content  zero.  If  a  straight  line  be 
drawn  parallel  to  the  y-axis  through  the  point  x  of  the  side  of  the  rectangle, 
then  the  component  of  K  on  this  straight  line  will  be  denoted  by  Kx,  and  its 
linear  content  by  /  (Kx).  It  has  been  shewn  in  §  143  that,  cr  denoting  a 
prescribed  positive  number,  the  linear  content  of  that  set  of  points  x,  on  the 
side  of  the  rectangle,  for  which  /  (Kx)  ^  <r,  is  zero  ;  and  thus  that  /  (Kx), 
considered  as  a  function  of  x,  is  an  integrable  null-function,  for  each  value 
of  k.  The  function  %  (#)>  =  lim  /  (Kx),  is  also  an  integrable  null-function: 

k~0 

for  the  set  of  points  at  which  ^  (x)  does  not  vanish  is  made  up  of  those  sets 
of  points  at  which  I  (Kx^\  I(KX®\  ...  I(K^\  ...  do  not  vanish;  where 
K(l\  K(2\  ...  K(n},  ...  correspond  to  a  diminishing  sequence  of  values  of  k 
converging  to  the  limit  zero  :  and  since  each  of  these  sets  has  zero  measure, 
it  follows  that  the  set  of  points  at  which  ^  (x)  does  not  vanish  has  zero  measure. 
At  any  point  x1}  at  which  %  (x)  vanishes,  /  (KXl)  vanishes,  for  every  value  of  k. 

It  should  be  observed  that,  at  a  point  of  Kx,  it  is  not  necessarily  the  case 
that  /  (x,  y},  considered  as  a  function  of  y,  with  x  constant,  has  its  saltus  ^  k  ; 
in  fact,  this  saltus  may  be  less  than  k,  or  may  be  zero.  However,  all  the 
points  at  which  the  saltus  of  f(x,  y\  taken  with  x  constant,  is  ^  k,  are 
certainly  included  in  the  set  Kx. 

For  any  fixed  valuj  of  x,  the  upper  and  lower  integrals 


f 

f(x,y}dy,    ]f(x,y)dy 


both  exist,  and  the  two  have  equal  values  at  any  point  of  the  everywhere 
dense  set  of  points  x,  at  which  ^  (x)  vanishes.  The  preliminary  theorem 
above  stated  has  thus  been  established. 

Let  F(x)  denote    I  f(x,  y)  dy,  where  F  (x)  consequently  has  a  single 
determinate  value  at  each  point  x,  at  which 


T 

I 


f(x,  y}  dy 

are  equal.    At  any  point  of  that  set,  of  measure  zero,  at  which 

\f(x,y)dy,     \f(x,y)dy 

J  J 

H.  31 
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have  different  values,  F  (x)  is  regarded  as  indeterminate  ;  and  the  upper  and 
lower  integrals  are  the  upper  and  lower  limits  of  indeterminancy.  It  will 
now  be  shewn  that  the  function  F  (x},  so  defined,  is  an  integrable  function. 

Let  x  be  a  point  on  the  side  of  the  rectangle  (0,  0;  1,  1),  such  that  the  compo- 
nent Ky>,  of  K,  on  the  line  x=x\  has  content  <  o-.  A  finite  number  of  intervals 
el,  62,  ...  em  can  be  determined  on  the  straight  line  x  =  x',  neither  abutting 
on,  nor  overlapping,  one  another,  such  that  their  sum  ^  +  e2  +  .  .  .  +  em  >  b  —  <r, 
where  b  is  the  length  of  the  side  of  the  rectangle  parallel  to  the  y-axis,  and 
such  also  as  to  contain,  in  their  interiors  and  at  their  ends,  no  points  at  which 
the  saltus  off(x,  y)  is  =  k.  For  each  point  of  one  of  these  intervals  e  there 
exists  a  rectangle,  with  the  point  at  the  centre,  such  that  the  fluctuation 
off(x,  y)  in  that  rectangle  is  <  k.  The  breadths  of  these  rectangles  for  all 
points  of  e  must  have  a  finite  minimum,  for  otherwise  there  would  exist  a.  point 
of  e  which  would  belong  to  K.  It  follows  that,  for  the  point  x,  an  interval 
(x  —  a,  x  +  /3)  can  be  determined,  such  that  the  straight  lines  x  =  x'  —  a, 
x  =  x  +  j3  intersect  all  the  rectangles  corresponding  to  all  the  points  of  the 
intervals  e1}  e.2,  ...  em.  If  Xj  ,  x2  be  any  two  points  in  the  interval  (x  —  a,  x'  +  0), 

we  have  F  (a-,)  -  F  (a?a)  \<bk+tr(U-  L). 

Now  a  finite  number  of  separate  intervals  81}  82,  ...  £,.  can  be  determined 
on  the  side  x  of  the  rectangle  (length  =  a),  such  that  S:  +  82  +  .  .  .  +  8r  >  a  —  ij, 
where  rj  is  a  prescribed  positive  number,  and  such  that  each  point  of  each  of 
the  intervals  8  is  a  point  x',  for  which  an  interval  (x  —  a,x'  +  /3)  can  be  deter- 
mined as  above.  By  applying  the  Heine-Borel  theorem  we  see  that 


will  all  be  covered  by  a  finite  number  of  these  intervals  (x  —  a,  x  +  /3).  It 
thus  appears  that  the  a?-side  of  the  rectangle  can  be  divided  into  a  finite 
number  of  parts  9 


T!  ,  T2  ,   ...   Tp, 

and  Xi  ,  \2  ,  ...  \q  , 

such  that  TI  +  T2  +  .  .  .  +  Tp  >  a  —  77, 

and  \  +  \2+  ...  +  \q<ij, 

so  that  the  fluctuation  of  F  (x)  in  any  one  of  the  parts  T  is 

<bk  +  <r(U-L). 
Let  k=e/2b,  o-  =  e/2(U-L); 

then  we  see  that  F  (x}  is  such  that  the  o;-side  of  the  fundamental  rectangle 
can  be  divided  into  a  finite  number  of  parts,  such  that  the  sum  of  those  parts, 
in  which  the  fluctuation  of  F  (x)  is  =  e,  is  less  than  the  arbitrarily  chosen 
number  77.  It  follows  that  F(x)  is  integrable  along  the  side  of  the  rectangle. 
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It  has  now  been  shewn,  on  the  assumption  of  the  existence  of  the  double 
integral,  that  the  repeated  integral 


/*  f       f 

I  F(x)  das,  or    \dx    f(x,  y)  dy, 

J  J  J 


taken  through  the  fundamental  rectangle,  has  a  definite  value.  Moreover, 
this  value  is  equal  to  that  of  the  double  integral.  For,  let  the  fundamental 
rectangle  be  divided  up  by  means  of  straight  lines  parallel  to  the  ?/-axis, 
through  the  end-points  of  each  interval  of  the  two  finite  sets  {T}  and  {A,}.  Any 
one  of  the  rectangles  so  constructed,  with  T  as  base  and  with  height  b,  can  be 
divided  into  parts,  by  means  of  straight  lines  parallel  to  the  #-axis,  such  that, 
in  each  one  of  a  number  of  these  parts  the  sum  of  whose  heights  is  >  b  —  a-, 
the  fluctuation  of  f(x,  y}  is  <  k.  The  fundamental  rectangle  has  now  been 
divided  into  a  finite  number  of  parts,  such  that  the  sum  of  the  products  of  each 
part,  multiplied  by  the  upper  boundary  of  f(x,  y}  in  that  part,  exceeds  the  sum 
of  the  products  of  each  part,  multiplied  by  the  lower  boundary  of  the  function 
in  that  part,  by  less  than  abk  +  (acr  +  by)  (  U—  L],  which  is  arbitrarily  small. 

f    T  r 

Also  I  dx  J  f(x,  y}  dy,  and  J  f(x,  y}  d  (x,  y), 

both  lie  between  the  two  sums  of  products,  and  therefore  differ  from  one 
another  by  less  than  abk  +  (a<r  +  bn)  (  U  —  L).  The  equality  of  the  double 
integral  and  the  repeated  integral  is  thus  put  in  evidence  by  the  mode  of 
sub-division  of  the  rectangle  which  has  been  adopted. 

Similar  reasoning  applies  to  the  repeated  integral  in  which  the  integration 
is  taken  first  with  respect  to  x,  and  then  with  respect  to  y. 

It  has  thus  been  established  that,  if  the  double  integral  through  the  funda- 
mental rectangle  exist,  then  the  two  repeated  integrals  also  exist,  and  are  each 
equal  to  the  double  integral. 

All  the  points  at  which  ^  (x)  vanishes  are  points  of  continuity  of  the 
function  F  (x)  ;  but  there  may  also  be  other  points  at  which  F  (x)  is  con- 
tinuous; because  the  existence  of  a  saltus  of  f(x,  y)  at  a  point  (x,  y)  is 
consistent  with  f(x,  y}  being  continuous  with  respect  to  x,  and  also  with 
respect  to  y,  at  the  point. 

The  function  F(x]  may  be  replaced  by  -fy  (x),  the  most  nearly  continuous 
function  related  to  it  (see  §  242).  We  thus  have 


I  f(x>  y}  d  (%>  y} =  \  ty  (x) ax- 


If  it  be  assumed  that  the  set  of  points  G,  for  which  f(x,  y}  is  defined, 
is  measurable  in  accordance  with  Jordan's  definition  of  a  measurable  set  (see 
§  142),  then  the  double  integral  of  the  bounded  function  may  be  replaced  by  that 
of  the  function  f  (x,  y},  defined  for  all  points  of  a  rectangle  which  contains  G, 

31—2 
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by  the  convention  that/(#,  y)  shall  vanish  at  all  those  points  of  the  rectangle 
which  do  not  belong  to  G.  If  the  set  G  be  such  that  each  straight  line 
parallel  to  the  y-axis  contains  points  of  G  which  fill  up  a  finite,  or  an 
indefinitely  great  number  of  continuous  intervals,  or  more  generally,  if  the  set 
of  such  points,  for  each  value  of  x,  be  linearly  measurable  (  J),  then  the  integral 

f(x,  y}  dy, 

taken  along  the  whole  segment  of  the  line  between  the  sides  of  the  rectangle, 
may  be  replaced  by  the  same  integral  taken  through  the  component  of  G  on 
the  same  segment.  In  particular,  if  the  points  of  G  on  the  straight  line 
through  the  point  x  consist  of  all  the  points  in  the  linear  interval 


we  may  replace  \  f(x> 

/•/*<*) 

by  f(x,y}dy; 


and  therefore  in  this  case, 

r  rxi        rA(x) 

f(x,  y}  d  (x,  y)=\     das  \        f(x,  y}  dy. 

JO  -'*„  JyiW 

364.  A  simple  proof*  of  the  fundamental  theorem  of  §  363  will  be  given, 
which  depends  upon  the  fact  that,  for  any  bounded  function  /(as,  y},  if  the 
operation  of  taking  the  upper  integral  first  with  respect  to  y,  and  then  with 
respect  to  #,  be  performed,  the  result  cannot  exceed  the  upper  double  integral  ; 
and  that,  similarly,  the  result  of  successively  taking  the  lower  integrals  with 
respect  to  x  and  to  y  cannot  be  less  than  the  lower  double  integral  :  thus 

f  f    f  T   T 

/(«,  y)  d  (x,  y}^\dx\  f(x,  y}  dy  £\dx\  f(x,  y}  dy 

J  J  J  J  J 

=  J  /0<  y)  d  (x>  y\ 

the  integrals  being  all  taken  over  the  fundamental  rectangle. 
If  f{x,  y}  be  integrable,  so  that 


f(x,  y}  d  (x,  y)  =  j  f(x,  y)  d  (x,  y), 

*  This  method  of  proof  was  first  employed  by  Harnack;  see  his  edition  of  Serret's  Differential 
and  Integral  Calculus,  p.  282.  Other  proofs  of  this  kind  have  been  given  by  Arzela,  Mem.  dell'  1st. 
di  Bologna,  ser.  5,  vol.  n,  p.  123;  by  Jordan,  Liouville's  Journal,  ser.  4,  vol.  vni,  p.  84,  or  Court 
d' Analyse,  vol.  i,  p.  42;  also  by  Pringsheim,  Sitzungsberichte  d.  Munch.  Akad.,  vol.fxxvm,  p.  59, 
and  vol.  xxix,  p.  39.  See  also  Pierpont's  paper  "On  multiple  integrals,"  Trans.  Amer.  Math. 
Soc.,  vol.  vi,  1905,  where  a  proof  of  this  character  for  multiple  integrals  is  given. 
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it  follows  that 

j  dx  j/0,  y )  dy = j  dx  j  f(x,  y)  dy 

=  jdx  \f(x,  y)  dy  =  J  dx  f  f(as,  y)  dy, 
and  thus  that  the  repeated  integral 

J  dx  j  f(x,  y)  dy 

has  a  definite  value  equal  to  the  double  integral. 

To  prove  this,  let  the  rectangle  be  divided  into  a  number  of  parts  8,  by 
means  of  straight  lines  parallel  to  the  sides.  Since  the  double  integral  is 
assumed  to  exist,  this  may  be  done  in  such  a  manner  that,  e  denoting  an 
arbitrarily  chosen  positive  number,  the  conditions 


are  satisfied,  where  the  summation  2  is  taken  for  all  the  rectangles  8,  and 
U(8),  L  (8)  denote  the  upper  and  lower  boundaries  off(x,  y)  in  a  rectangle  8. 
Now,  if  we  take  the  upper  and  lower  -integrals  of  f(x,  y}  along  a  straight  line 
parallel  to  the  y-axis,  we  have 


where  the  summation  Sa  refers  to  all  those  rectangles  8  which  are  intersected 
by  the  straight  line  along  which  the  upper  and  lower  integrals  are  taken;  and 
in  the  case  when  that  straight  line  is  along  one  or  more  boundaries  of  the 
rectangles  8,  %  refers  to  all  the  rectangles  on  one  side  of  that  line  :  also  8' 
denotes  that  interval  along  the  line  of  integration  which  is  in  the  rectangle  8. 
It  follows  that 


f  da  If  (a, 

j        j 


{8V  (8)}  £    f(x,  y}  d  (a,  y}  +  e 

j 

and  (fo/(0,y)<fyS;2{8£(8)}g  \  f(x,y}d(x,y)-€- 

and  since  these  inequalities  hold  for  every  value  of  e,  we  have 
J  das  I  f(x,  y}  dy  £  j  f(x,  y)  d  (x,  y}, 

j  dx  j  f(x,  y)dy&l  f(x,  y}  d  (x,  y}  ; 


and  thus  the  theorem  is  established. 
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365.  The  converse  questions  now  arise  whether,  from  the  existence  of  one 
of  the  repeated  integrals,  or  from  the  existence  and  equality  of  both  repeated 
integrals,  that  of  the  double  integral  can  be  inferred.  The  answer  to  both 
questions  must  be  in  the  negative.  Continuity  of  a  function  f(x,  y)  with 
respect  to  x  and  y  separately  does  not  necessarily  imply  continuity  with  respect 
to  (x,  y) ;  moreover,  the  saltus  of  the  function  at  a  point  with  respect  to  x, 
when  y  has  a  constant  value,  or  with  respect  to  y,  when  x  has  a  constant  value, 
is  not  necessarily  equal  to  the  saltus  of  the  function  with  respect  to  (x,  y\  It 
may  happen  that  the  component  of  K  on  a  straight  line  parallel  to  one  of  the 
axes  may  consist  of  points,  some  or  all  of  which  are  points  of  continuity  of  the 
function  when  considered  as  a  function  of  one  variable  on  that  straight  line. 
Thus  K  may  have  a  plane  content  greater  than  zero;  and  yet  the  linear  content 
of  the  points  on  all  straight  lines  parallel  to  the  axes,  at  which  the  linear  saltus 
of  the  function  is  ^  k,  may  be  zero.  Hence  either,  or  both,  of  the  repeated 
integrals  may  exist,  whilst  for  values  of  k,  the  sets  K  are  not  of  content  zero* ; 
and  therefore  whilst /(#,  y}  does  not  admit  of  a  double  ,R-integral. 

EXAMPLES. 

It.    For  the  rectangle  bounded  by  ,v  =  0,  x=\,  y  =  0,   ?/=!,  let  f(x,y}  —  \,  for   all 
rational  values  of  #,  and  /(#,  y)=2y,  for  all  irrational  values  of  x.      We  have  then 


f(x,  y)dy  =  l,  whatever  value  x  may  have;  and  hence  the  repeated  integral 

1-1         r  i 

dx       f(x,y}dy 
J  o        Jo 

has  the  value  1  :  but  the  double  integral  does  not  exist,  since  I  (K)>  0,  for  any  value  of  k 
in  the  interval  (0,  1). 

2J.  Let  x  be  represented  by  a  finite  or  infinite  decimal,  excluding  those  decimals  in 
which  every  figure  from  and  after  some  fixed  place  is  9.  Let  px  denote  the  number  of 
decimal  places  in  the  representation  of  x  in  the  manner  described.  Let  y  be  represented 
in  a  similar  mariner,  with  a  corresponding  definition  of  pu.  Let  the  function  f(xty)  be 

defined  in  the  rectangle  bounded  by  x  =  Q,  #=1,  .y  =  0,  y  =  I,  by  f(x,y}  = - 

when  px  and  pv  are  both  finite ;  otherwise  let  f  (#,  ?/)  =  0. 

ri      i 

We  have  I        — -  dy  =  Q;  for  there  is  only  a  finite  number  of  values  of  y,  in  (0,  1),  for 
J  o  Pi/  + 1 

which  py  is  less  than  an  arbitrarily  chosen  fixed  integer,  or is  greater  than  an 


arbitrarily  chosen  fixed  proper  fraction.     The  function  f(x,y}  vanishes,  except  when  one 
at  least  of  x  and  y  is  representable  by  a  finite  decimal ;  and  thus  the  double  integral 

/  (x,  y}  d  (x,  //)  vanishes. 

*  An  incorrect  theorem,  relating  to  this  point,  has  been  given  by  Schoenflies,  see  his  Bericht, 
vol.  i,  p.  197.  In  this  theorem,  the  condition  that  A"  should  be  closed  is  stated  to  be  the  condition 
for  the  existence  of  the  double  integral.  If,  however,  K  were  not  closed,  it  could  not  represent 
the  set  of  points  at  which  any  function  had  a  saltus  >  k.  The  examples  given  by  Schoenflies  do 
not  in  reality  accord  with  his  theorem. 

f  Thomae,  Schlnmilch's  Zeitschrift,  vol.  xxin,  p.  67. 

J  Pringsheim,  Sitzungsber.  d.  Leipziger  Akad.,  vol.  xxvm,  p.  71. 
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T"1  i          f1 

Now  j    f(x,y}dy  =  -—l,        /    /(#,  y)  dy  =  0  ; 

and  thus  I    f(x^y}dy  has  no  definite  value  for  any  value  x  of  the  everywhere  dense 

Jo 
enumerable  set  of  points  for  which  px  is  finite. 

Nevertheless  I    dx  \    /(#,#)«?y»»0=  //(#,«)  d(#,y). 

J  o       2° 

3*.   With  the  same  notation  as  in  the  last  example,  let  /(#,  2/)=0,  when  px,  py  are 
both  finite,  or  both  infinite;  let  f(x,y}  =  ^-  —  ,  when  px  is  finite,  and  py  infinite;   and 

"" 


,  when  py  is  finite,  and  px  is  infinite.     In  this  case  f(x,y)  differs  from  0  at 


an  unenumerable  set  of  points;  and  yet  the  set  of  points  at  which  /(.r,  y)>e  has  the 
plane  content  zero,  since  all  such  points  are  on  a  finite  number  of  lines  parallel  to  the 
coordinate  axis,  although  they  are  everywhere  dense  on  those  lines.  The  double  integral, 
and  consequently  the  repeated  integrals,  exist  in  this  case. 

4*.  An  example  has  been  given  in  Ex.  1,  §  143,  of  a  set  of  points  K  which  is  every- 
where dense  and  unclosed,  whereas  the  sets  Kx,  Ky  are  all  finite.  Let  /(#,  #)=c',  at  every 
point  of  K,  and  =  c,  at  every  other  point.  In  this  case,  the  double  integral  does  not  exist  ;  but 

/    f(x,y)dx=c,       I    f(x,y}dy  =  c, 
Jo  Jo 

whatever  values  y  and  x  may  have  in  the  first,  and  in  the  second,  integrals  respectively. 

ri         ri  ri         r\ 

Consequently          I     dx  I    f(x,y}dy     and     I     dy  I    f(x,y)dx 
J  o        J  o  J  o        J  o 

both  exist,  and  have  the  same  value  c. 

5*.  Let  a  set  {(#',  y'}}  be  defined  as  follows:  —  Let  x'  have  any  value  for  which  px,  is 
finite;  and  with  such  a  fixed  value  of  x',  let  every  y'  be  taken  for  which  py>  £±px>.  On 
every  line  parallel  to  the  y-axis  there  is  only  a  finite  number  of  points  of  the  set  ;  but 
the  set  is  everywhere  dense  on  every  line  which  is  parallel  to  the  #-axis,  and  which  has  for 
its  ordinate  one  of  the  y'.  Let  f(x,  y)  =  c',  for  the  set  {(of,  y')},  and  let  /(#,  y)  =  c,  for  all 

remaining  points. 

/i  •          r\         ri 

f  (&•,  y}  dy  =  c,  and      /     dx  I    f  (x,  y}  dy  =  c. 

o  Jo        J  o 

But   I    f(x,  y'}dx  has  c  and  c'  for  its  upper  and  lower  values;  and  the  set  of  values 
J  o 

fi         ri 
of  y'  being  everywhere  dense,  I     dy  I     f  (x,  y}  dx  does  not  exist. 

Jo         Jo' 

6t.    Let  f(x,y)  be  defined  at  all  points  of  the  rectangle  bounded  bj  #  =  0,  x=\ 
y  =  0,  i/=l,  by  the  condition  that  f(x,  y)  =  0,  except  at  those  points  (x',  y')  at  which 

,     2m  +  l         ,     Zp  +  l  ,.  ,  1 

x  =  -  >     '      y  =     a*-  '    where  /  (*  '  y  )  =  &  ' 

wt,  n,  p  and  q  being  positive  integers. 

In  this  case  the  double  integral  exists,  and  therefore  the  repeated  integrals  both  exist  J. 

*  Pringsheirn,  Sitzungsber.  d.  Leipziger  Akad.,  vol.  xxvin,  p.  71. 

t  Du  Bois  Reymond,  Crelle's  Journal,  vol.  xciv,  p.  278;  also  Stolz,  Gruiulziige,  vol.  in,  p.  73. 

J  This  is  denied  by  Stolz,  Grundzilge,  vol.  in,  p.  88,  on  the  ground  that  /(.r,  y)  for  .r  = 

is  not  integrable  with  respect  to  y,  but  has  —  and  0  for  its  upper  and  lower  values.  We  have, 
however,  shewn  that  this  is  no  justification  for  denying  the  existence  of  the  repeated  integral. 
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IMPROPER  DOUBLE  INTEGRALS. 

366.  As  in  the  case  of  single  integrals,  the  definition  of  a  double  integral 
may  be  extended  to  the  case  in  which  the  function  has  a  set  of  points  of  infinite 
discontinuity.  This  set  is  necessarily  closed,  and  it  will  be  assumed  throughout 
that  its  plane  content  is  zero.  It  will  also  be  assumed  that  the  domain  for 
which  such  a  function  is  defined  is  bounded,  and  that  the  frontier  has  the 
content  zero,the  domain  being  therefore  measurable  in  accordance  with  Jordan's 
definition ;  and  consequently  the  function  may  be  replaced  by  another  function 
defined  for  all  the  points  in  a  fundamental  rectangle,  the  new  function  being 
taken  to  vanish  at  all  points  not  in  the  original  domain,  and  to  have  the  same 
value  as  the  original  function  at  all  points  of  that  domain.  These  assumptions 
being  made,  a  definition  of  the  improper  double  integral  which  is  substantially 
the  one  given  by  Jordan*,  and  adopted  by  Stolzf,  may  be  stated  as  follows: 

Let  D1}  D2, ...  Dn,  ...  denote  a  sequence  of  domains  contained  in  the  funda- 
mental rectangle,  each  one  of  which  consists  of  a  finite  number  of  connex  closed 
portions,  each  with  its  frontier  of  zero  content,  and  in  which  the  number  of  the 
portions  may  increase  indefinitely  with  n.  Further,  let  us  suppose  that  none  of 
these  domains  contain,  in  their  interiors  or  on  their  frontiers,  any  point  at 
which  f(x,  y)  has  an  infinite  discontinuity,  and  that  the  sequence  is  such  that 
the  measure  of  Dn  converges  to  that  of  the  fundamental  rectangle ;  then  if  the 
upper  integrals 

I     f(x,y}d(x,y),  \     f(x,y)d(x,y),...   \     f(x,y)d(x,y ),..., 

J  Dl  J  J52  J  I>n 

taken  over  the  domains  Dl,  D2, ... ,  converge  to  a  definite  limit,  independent  of 
the  particular  sequence  \Dn]  chosen,  this  limit  is  defined  to  be  the  improper 
upper  integral 

f(x,  y)  d  (ac,  y) 


of  f(x,  y)  in  the  given  domain.  A  similar  statement  applies  to  the  case  of  the 
improper  lower  integral.  When  the  improper  upper  and  lower  integrals  both 
exist,  and  have  the  same  value,  then  the  improper  integral 


f(x,y}d(x,y} 

over  the  given  domain  is  said  to  exist,  and  to  have  this  common  value. 

It  will  be  observed  that  the  domains  Dn  are  all  measurable  in  accordance 
with  Jordan's  definition  of  a  measurable  set,  and  therefore  also  in  accordance 
with  the  definition  of  Borel  and  Lebesgue. 

In  case  the  function  f(x,  y}  be  integrable  (R)  in  all  the  domains  Dl,D.2, ..., 

*  Cours  d'Analyse,  vol.  n,  p.  76. 
f  Grundzilge,  vol.  in,  p.  124. 
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however  this  sequence  may  be  chosen,  subject  to  the  conditions  stated  above, 
then,  if  the  sequence 

I    f(x,y)d(x,y),     \    f(x,y)d(x,y\... 

J  DI  .JDt 

converge  to  a  definite  limit,  independent  of  the  particular  sequence  {Dn},  that 
limit  defines  the  improper  double  integral 


J 


as,  y). 


If  a  function  f(x,  y)  have  an  improper  integral  in  the  fundamental  rect- 
angle, then  f(x,  y)  has  a  proper  .R-mtegral  in  any  connex  closed  domain  of 
which  the  frontier  has  measure  zero,  and  which  is  contained  in  the  fundamental 
rectangle,  but  itself  contains  no  points  in  its  interior,  or  on  its  frontier,  at 
which  f(x,  y}  is  infinitely  discontinuous.  For,  by  the  definition,  the  integrals 


f(x,y)d(x,y),      \     f(x,y}d(x,y) 

J  D  J  D 


converge  to  one  and  the  same  definite  limit,  as  D  converges  to  the  fundamental 
rectangle ;  therefore  D  can  be  so  chosen  that,  if  e  be  an  arbitrarily  chosen 
positive  number, 

f(x>  V)  d  (x,  y}-]     f(x,  y)  d  (x,  y)  <  e. 

Now,  if  D'  be  any  domain  of  the  type  defined  above,  in  the  interior  of  D,  it  is 
clear  that  the  difference  between  the  upper  and  lower  integrals  of  f(x,  y) 
throughout  D'  cannot  exceed  the  difference  of  the  upper  and  lower  integrals 
throughout  D,  and  is  therefore  <  e.  Since  e  is  arbitrarily  small,  it  follows  that 
the  upper  and  lower  integrals  throughout  D'  must  be  identical,  and  therefore 
that  f(x,  y}  is  integrable  in  D'.  It  has  thus  been  shewn  that  if  f(x,  y}  have 
an  improper  double  integral  in  the  fundamental  rectangle,  it  must  possess  a 
proper  R-integral  in  any  connex  closed  domain  interior  to  that  rectangle,  such 
that  the  domain  has  its  frontier  of  zero  measure,  and  contains  no  points  of  infinite 
discontinuity  of  the  function,  either  in  its  interior  or  on  its  frontier. 

367.    The  necessary  and  sufficient  condition  for  the  existence  of  the  improper 
upper  double  integral 

I  /O>  2/)  d  (x,  y} 

is  that,  corresponding  to  any  arbitrarily  chosen  positive  number  e,  another 
positive  number  &  can  be  determined,  such  that,  if  A  be  any  connex  closed 
domain  whatever,  of  which  the  frontier  has  measure  zero, -and  which  is  contained 
in  the  fundamental  rectangle,  but  itself  contains  no  points  of  infinite  discontinuity 
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off(oc,  y),  either  in  its  interior  or  on  its  frontier,  then,  provided  the  measure  of 
A  is  <  8,  the  condition 

T 

jf(x,y)d(x,y)    <  e, 

taken  over  A,  is  satisfied. 

A  similar  theorem  applies  to  the  improper  lower  double  integral. 

To  shew  that  the  condition  stated  in  the  theorem  is  sufficient,  let  D,  D' 
be  two  domains  of  the  kind  specified  in  §  366,  such  that  m  (D),  m  (D')  both 
differ  from  the  area  of  the  fundamental  rectangle  by  less  than  8 ',  they  are 
both  interior  to  the  fundamental  rectangle,  and  contain  none  of  the  points 
of  infinite  discontinuity  of  the  function.  Let  d  be  the  set  of  points  of  D  which 
do  not  belong  to  D',  and  d'  the  set  of  points  of  D'  which  do  not  belong  to  D  ; 
then 

and  m  (d')<  A  —  m  (D)  <  8, 

where  A  is  the  area  of  the  fundamental  rectangle.    Also,  since  the  domains 

D  +  d',  D'  +  d  are  identical,  we  have 

T     -  T     -  T       T 

J-Jj      -1- 1)'  —  -f-d      -Ld' > 
where  /  denotes  the  upper  double  integral 


f(x,  y}  d  (x,  y} 
taken  over  the  domain  indicated  by  a  suffix.    It  follows  that 

hence  it  is  easily  seen  that  any  two  sequences 

both  converge  to  one  and  the  same  definite  limit. 

To  shew  that  the  condition  stated  in  the  theorem  is  necessary,  let  us  suppose 
that  it  is  not  satisfied.  We  thus  assume  that  a  domain  d,  of  arbitrarily  small 
measure,  can  be  found,  such  that  Id  >  e. 

Let  D  be  interior  to  the  rectangle,  and  such  that 

A-m(D)<S. 
Taking  D  to  contain  d,  we  then  have 

m  (D-d)>A-  28, 

provided  m  (d)  <  8. 

The  two  domains  D,  D  —  d  both  converge  to  A,  if  8  be  decreased  indefinitely; 

thus  j  ID  —  I  D-d    >  e, 

however  small  8  may  be ;  hence  the  limit  does  not  exist  in  this  case. 
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The  necessary  and  sufficient  condition  that  the  improper  upper  and  lower 
integrals  of  f(x,  y)  in  the  fundamental  rectangle  may  both  exist  is  that  the 
improper  upper  integral  of  \  f(x,  y}  may  exist 

To  shew  that  the  condition  stated  is  sufficient,  we  observe  that,  on  the 
assumption  of  the  existence  of 


d(x,y\ 
it  follows  from  the  theorem  established  above  that  the  upper  integral  of 

l/(«,y)l 

through  a  connex  domain  D,  interior  to  A,  and  containing  none  of  the  points 
of  infinite  discontinuity,  tends  to  the  limit  zero,  as  m  (D)  does  so.    Also 


\L 


f(x,  y)  d  (x,  y) 


i> 


f(x,y}d(x,y} 


are  both  less  than,  or  equal  to 

[    \f(x,y)\d(x,y\ 

J  D 

as  is  easily  seen.    It  follows  that  both  the  integrals 

I     f(®,y)d(x,y),    j    f(x,y)d(x,y] 

converge  to  zero,  as  m(D)  does  so,  and  uniformly  for  all  such  domains  D;  and 
that  these  are  the  sufficient  conditions  for  the  existence  of 


f(x,  y)  d  (x,  y},     J  f(x,  y}  d  (x,  y}. 

To  prove  that  the  condition  stated  is  a  necessary  one,  let  us  assume  that, 
for  every  connex  domain  Z),  satisfying  the  specified  conditions,  and  such  that 
m  (D)  <  8,  we  have 


f(x,  y}  d  (x,  y)1  <e. 
JD 

Now  let  f(x,  y}  =f+  (x,  y)  -f~  (x,  y\ 

where  /+  (x,  y}  =f(x,  y)  , 

at  all  points  where  f(x,  y)  is  positive,  and  everywhere  else  let 

also  /-  (x,  y)  =  -f(x,  y), 

at  every  point  where  f(x,  y}  is  negative,  and  everywhere  else  /'~  (x,  y}  is  zero. 
The  domain  D  may  be  divided  into  a  finite  number  of  rectangles  8,  some  of 
which  may  lie  partly  outside  D ;  the  functions  being  taken  to  be  zero  in  all 
such  outlying  portions.  Denoting  by  U*  the  upper  boundary  of  a  function  in 
the  rectangle  8,  we  have 

in  all  elements  8  in  which  f(x,  y}  has  positive  values;  and  in  all  other  elements 
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The  elements  may  be  taken  such  that,  if  rj  be  an  arbitrarily  chosen  number, 
the  inequalities 


25  Us  {  /+  (a,  y)}  -        /+  (a,  y)  d  (a,  y)  <  , 

are  both  satisfied.    These  conditions  are  also  satisfied  for  any  domain  contained 
in  D. 

We  have  now 
f   f+ 


D  A 

T 

=       /(#>  y)  d  (x, 

J  D, 

=  6  +  77, 

where  Dl  consists  of  the  domain  which  is  composed  of  those  elements  8,  of  D, 
in  which  f(x,  y}  has  positive  values.    Since  77  is  arbitrarily  small,  we  have 


Again,  since 

I  /O>  y}  d  (x>  y}  =  -     -{  f(x>  y)}  d  (®>  y\ 

J_D  J  D 

we  see  that  —  {  /(#,  y)}  d  (x,  y) 

JD 

has  the  limit  zero,  when  m(D)  has  the  limit  zero;  and  from  this  it  follows,  as 
before,  that 


J  D 

Since  f(x,  y}  =/+  (x,  y}  -f-  (x,  y\ 

we  have  I     j  f(x,  y}  \  d  (x,  y}  £  2e  ; 

J  D 

and  therefore  I     \f(x,y}    d(x,y) 

J  D 

has  the  limit  zero,  when  m  (D)  converges  to  zero.   It  now  appears,  by  employ 
ing  the  first  theorem  of  this  section,  that 


\f(oe,y)   d(x,y}, 

taken  throughout  the  fundamental  rectangle,  has  a  definite  value. 
It  should  be  observed  that  since 

\f(x,y}\d(x,y\ 

D 
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the  lower  integral  I     |  f(x,  y)  \  d  (a;,  y) 

— 

has  the  limit  zero,  whenever  zero  is  the  limit  of 

|    \f(x,y)\d(x,y}\ 

J  D 

and  that  therefore  |  |  f(x,  y}   d  (x,  y} 

7" 
always  exists  when  I    f(x,y}\d  (x,  y} 

does  so,  the  integration  being  over  the  fundamental  rectangle. 

368.  It  has  been  seen  in  §  366  that,  in  case  f(x,  y)  have  an  improper 
integral  in  the  fundamental  rectangle,  it  has  a  proper  integral  in  any  closed 
connex  domain  D  contained  in  that  rectangle,  which  has  a  frontier  of  zero 
measure,  and  contains  no  points  of  infinite  discontinuity  of  the  function.  It 
follows  by  the  theorem  (2)  of  §340,  that  \f(x,  y}  is  also  integrable  in  the 
domain  D;  and  we  have  already  seen  that  the  existence  of  the  improper  upper 
integral  of  f(x,  y)  is  a  necessary  consequence  of  the  existence  of  the  improper 
integral  of  f(x,  y).  It  thus  appears  that  the  improper  upper,  and  lower, 
integrals  of  \/(x,y)  must  be  identical,  and  therefore  that,  if  f  (as,  y)  be  a 
function  which  has  an  improper  double  integral,  in  accordance  with  Jordan's 
definition,  then  \  f(x,  y}  \  has  also  an  improper  integral,  so  that  every  such  im- 
proper integral  is  absolutely  convergent. 

We  have  seen  that  Cauchy's  definition  of  an  improper  single  integral  is 
applicable  not  only  to  the  cases  in  which  the  convergence  is  absolute,  but  also 
to  cases  in  which  the  convergence  is  not  absolute.  The  same  remark  applies 
to  Harnack's  extension  of  Cauchy's  definition,  which  will  be  considered  in 
Chapter  vii.  Jordan's  definition  of  an  improper  double  integral  is  however 
much  more  stringent  than  Harnack's  definition  of  an  improper  single  integral. 
In  the  latter  case  the  integral  is  defined  as  the  limit  of  the  proper  integral 
taken  through  a  finite  number  of  intervals,  not  chosen  arbitrarily  in  any  manner 
consistent  with  the  condition  that  the  sum  of  these  intervals  is  to  converge  to 
the  length  of  the  interval  of  integration,  but  chosen  so  as  to  satisfy  the  special 
condition  that  they  are  complementary  to  a  finite  set  of  intervals  which  contain 
within  them  all  the  points  of  infinite  discontinuity  of  the  function,  each 
interval  of  the  finite  set  containing  at  least  one  such  point. 

If  the  proper  integrals,  of  which  the  improper  integral,  in  Harnack's  de- 
finition, is  the  limit,  were  not  subjected  to  the  above  mentioned  restriction, 
reasoning  precisely  similar  to  that  applied  above  would  shew  that  every  improper 
single  integral  must  be  absolutely  convergent.  In  order  that  a  definition  of 
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the  improper  double  integral  should  admit  of  the  existence  of  double  integrals 
which  do  not  converge  absolutely,  it  would  be  necessary  to  subject  the  domains 
D!,  D2,  ...  Dn,  ...,  (the  proper  integrals  through  which  define,  as  their  limit, 
the  double  integral),  to  some  restriction  which  would  allow  of  the  existence 
of  a  limit  in  cases  in  which  such  a  limit  does  not  otherwise  exist,  independently 
of  the  particular  set  [Dn]  chosen.  Such  a  restriction  as  to  the  nature  of  the 
domains  Dn  would  correspond  to  the  restriction  to  a  special  class  of  sets  of 
intervals,  of  the  intervals  through  which  the  proper  integrals  in  Cauchy's  or 
Harnack's  definition  of  an  improper  single  integral  are  taken.  The  true  ex- 
tension of  Harnack's  definition  to  the  case  of  double  integrals  would  be  the 
following : 

Let  the  points  of  infinite  discontinuity  of  f(x,  y}  (the  set  of  such  points  being 
of  zero  content}  be  enclosed  in  a  finite  set  of  rectangles  with  sides  parallel  to 
those  of  the  fundamental  rectangle,  each  rectangle  of  the  finite  set  containing  at 
least  one  point  of  infinite  discontinuity,  and  no  such  point  being  on  the  frontier 
of  the  set  of -rectangles ;  and  let  Dn  denote  the  remaining  part  of  the  fundamental 
rectangle  when  the  finite  set  of  rectangles  is  removed.  Then,  if  f(x,  y}  have  a 
proper  integral  in  every  such  domain  Dn ,  and  if  this  proper  integral  converge 
to  a  definite  limit  when  any  sequence  whatever  of  such  domains  Dn  is  taken,  such 
that  the  measure  of  Un  converges  to  that  of  the  fundamental  rectangle,  this  limit 
shall  define  the  improper  double  integral  of  f(x,  y}. 

This  extension  of  Cauchy's  definition  would  admit  of  the  existence  of  non- 
absolutely  convergent  improper  double  integrals,  as  in  the  case  of  improper 
single  integrals.  With  this  definition,  the  theorems  of  §  367  would  no  longer 
be  valid. 

When  it  is  asserted  that  non-absolutely  convergent  double  integrals  do  not 
exist,  the  assertion  must  be  taken  to  mean  that  such  integrals  do  not  exist  in 
accordance  with  the  definition  of  Jordan,  and  not  that  it  is  impossible  to  give 
definitions,  such  as  the  above  extension  of  Harnack,  in  accordance  with  which 
double  integrals  would  exist  that  do  not  converge  absolutely. 

The  properties  of  improper  double  integrals  which  are  not  necessarily 
absolutely  convergent  are  more  restricted  than  those  which  exist  in  accordance 
with  Jordan's  definition,  and  it  is  consequently  a  matter  of  opinion  whether, 
though  the  former  certainly  exist  as  limits,  the  name  integral  may  be  ap- 
propriately applied  to  such  limits. 

EXAMPLE. 

If  we  take,  as  the  domain  of  integration,  the  rectangle  bounded  by  x=0,  x  =  a,  #  =  0, 

y  =  b,  then  the  double  integral  of  -  sin  -  is  not  convergent,  and  therefore  in  accordance  with 
y  xx 

fa  1          1 

Jordan'sdefmition  does  not  exist ;  although  the  single  integral  I    -  sin  -  dx  is  non-absolutely 

J  o  x       x 

convergent,  and  exists  in  accordance  with  Cauchy's  definition. 


368, 369]  Improper  double  integrals  495 

/» 1          1  /"<* 1          1 

-  sin  -  dx  depends  upon  the  fact  that  |    -  sin  -  dx  converges  to  a 
0 X  X  j,X  X 

definite  limit,  as  e  converges  to  zero,  and  this  is  sufficient  to  ensure  the  existence  of  the 
single  integral.  Although  I  -  sin  -  d  (#,  y\  taken  over  the  domain  bounded  by  x=f,  x=a, 

J    30  3C 

y=0,  y  =  b,  converges  to  a  definite  limit,  as  e  converges  to  zero,  this  is  not  sufficient  to 
ensure  the  existence  of  the  Jordan  double  integral.  Taking  Jordan's  definition,  let  the 
domain  Dn  consist  of  the  rectangular  spaces  bounded  by  the  lines  y=0,  y  =  6,  and  the  lines 
parallel  to  the  y-axis  at  the  extremities  of  the  intervals  on  the  #-axis 

\        /    _J  1        \ 

T/ '       \(2n  +  5)n'    (2tt  +  4)  IT/  ' 


w 

The  double  integral  taken  through  these  spaces  is 


or 


which  is  greater  than 

fa  1.1  n         ft 

J    _JL_.  &    1D#  (47l+l)n-Joi 

,    fa  1,1,  2nb 

or  than  b  I  -  sin  -  dx+  ,-—- 

J      i        x        x          (4n  +  1) 


/"«  fb  1    .     1   ,  P=2w    ,-2^r         /7>1    .     I 

/     -srn-axdy+      2      /  -sin-a-ray, 

./_J Jo*       a?  !>=»+! ./ i_J«x       x 

(2>l+l)  7T  (2p+l)  7T 

/"a  /"6  1  1  fa  p  =  2n  \ 

I  I    -  sin  —  d.vdy-{-b  I      sinz       2  — dz ; 


,    /"«  1    .     1   7         ft 
and  this  converges  to  6  i     -  sin  -  00;+  -     : 

y  o  x        x          2ir 

whereas   I  -sin  -d(x.  y},  taken  over  the  domain  bounded  by  x=e.  x=a,  ?/  =  0.  v  =  b. 
J  x        x  •'         a 

verges  to  b  I     -  sin  -  dx. 

J     « 

Therefore  the  mode  of  choice  of  the  intervals  Dn  affects  the  limit  to  which 


converges,  as  Dn  converges  to  the  complete  domain.    Thus  it  is  clear  that  the  double 
integral,  in  accordance  with  Jordan's  definition,  does  not  exist. 

THE  DOUBLE  INTEGRAL  OVER  AN  INFINITE  DOMAIN. 

369.  The  definition  of  a  double  integral  has  been  extended  by  Jordan*  to 
the  case  in  which  the  field  is  unbounded.  Let  the  function  f(x,  y}  be  defined 
at  every  point  of  an  unbounded  set  of  points  G,  and  let  it  be  assumed  that  the 
upper  and  lower  integrals  of/  exist  over  each  set  of  points  A,  that  is  measur- 
able (J),  contained  in  G,  in  accordance  with  the  definition  in  §  338,  or  in 
accordance  with  that  of  §  366.  This  does  not  make  it  necessary  that  the 
function  /  should  be  bounded  in  G,  even  if  it  be  bounded  in  every  such  set. 
We  shall  denote  the  upper  and  the  lower  integrals  of/  in  A,  by  /A  (/),  7A  (/). 
*  See  Cours  d'Analyse,  vol.  n,  p.  81. 
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Consider  a  sequence  {AM|,  of  sets  A,  such  that  An  contains  as  a  part,  all 
those  points  of  G,  whose  distance  from  the  origin  is  <  pn  ;  where  {pn}  is  a  mono- 
tone increasing  sequence  of  positive  numbers  which  increase  indefinitely  with  n. 

If  I&n  (/)  converges  to  a  finite  number  as  n  ~  oo  ,  independent  of  the  par- 
ticular sequence  {An},  subject  to  the  above  condition,  then  the  upper  integral 
IG(f),  of  the  function  over  the  set  G,  is  said  to  exist,  and  is  defined  by  that  number. 

A  similar  definition  may  be  applied  to  7e  (f),  as  the  limit  of  /an  (  /). 

When  Io(f)>  io(f)  both  exist,  and  are  equal,  their  value  is  said  to  define 
IG  (f),  the  integral  off  over  the  unbounded  domain  G.  The  necessary  and  suffi- 
cient condition  for  the  existence  of  IG(f}  **  that,  if  e  be  an  arbitrarily  chosen 
positive  number,  the  inequality  \  I&  (f)  \  <  e  holds  for  every  boundecLset  A,  measur- 
able (J),  of  points  contained  in  G,  and  such  that  all  the  points  of  A  are  at  a  distance 
from  the  origin  3£  p  (e),  where  p  (e)  is  some  positive  number  dependent  on  e. 

To  prove  that  the  condition  is  sufficient,  let  A(1),  A(2)  be  any  two  such  sets 
of  points,  measurable  (J),  each  of  them  containing  every  point  of  G  whose 
distance  from  the  origin  is  <  p  (e).  Let  £(1)  be  the  set  of  points  of  A(1)  which 
do  not  belong  to  A(2),  and  let  8(2)  be  the  set  of  points  of  A(2)  which  do  not  belong 
to  A(1);  thus  the  sets  A(1)  +  8(2),  A(2)  +  8W  are  identical.  Therefore 


and  since  8,  8'  each  contains  no  points  of  G  at  a  distance  <  p  (e)  from  the  origin, 
we  have    TAd)(/)  —  I&w(f)  <  2e. 

If  {  An}  be  a  sequence  such  as  has  been  defined  above,  then  if  n  be  sufficiently 
large,  pn>p(e);  if  then  A(1)=An,  A(2)=An+w,  we  have  I  *»(/}-!  *n+m(f)  <2e, 
for  a  sufficiently  large  value  of  n,  and  for  all  positive  values  of  m.  Therefore 
{[&n(f)}  is  a  convergent  sequence.  Again,  if  |An'}  be  any  other  such  sequence 
as  |An],  and  n,  n  are  sufficiently  large,  we  may  take  A(1)  =  An,  A(2)  =  A'n-  ;  and 
thus  Zin  (/)  —  /A'B,  (/)  <  2e.  As  this  holds  for  every  value  of  e,  when  n,  n'  are 
sufficiently  large,  corresponding  to  each  value  of  e,  the  two  sequences  {I&n(f}}, 
{!&'  '(/)}  converge  to  the  same  limit  ;  and  thus  Tg  (/)  exists  as  a  finite  number. 

To  shew  that  the  condition  is  necessary,  let  us  assume  that  it  is  not  satisfied. 
Then  e  can  be  so  determined  that,  for  every  value  of  p,  however  large,  there 
are  sets  A  which  contain  no  points  of  G  at  a  distance  <  p  from  the  origin,  and 
for  which  \I±(f)  j  =  e.  If  {An}  be  any  sequence  of  the  kind  employed  above, 
then  for  any  fixed  value  of  n,  the  set  A  can  be  so  determined  that  every  point 
of  it  is  at  a  distance  from  the  origin  greater  than  the  upper  boundary  of  the 
distances  of  all  points  of  An;  we  have  then  !/AB+A  (/)  —  ^A»(/)  =  e.  If  this 
be  done  for  each  value  of  n,  the  sequence  {f&n+&  (f)}  cannot  converge  to  the 
same  limit  as  {/*„(/)]  ;  the  set  A  depends  on  n.  It  should  be  observed  that 
the  choice  of  A,  for  each  value  of  n,  is  a  case  where  the  multiplicative  axiom 
is  employed. 

The  theorem  for  the  lower  integral  may  be  proved  in  the  same  manner,  or 
may  be  deduced  by  changing  the  sign  of  /. 
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370.    The  following  theorem  will  be  established  : 

The  necessary  and  sufficient  condition  for  the  existence  of  the  upper  and  the 
lower  integrals  of  f  over  the  set  0  is  that  the  upper  integral  of\f\  over  G  should 
exist  as  a  finite  number. 

It  will  first  be  shewn  that  the  condition  is  sufficient. 

The  upper  boundary  of  \f\  in  any  cell  is  the  greater  of  the  numbers 
|  U  ,  L  ;  where  U  and  L  are  the  upper  and  lower  boundaries  of/  in  the  celL 
and  the  lower  boundary  is  the  lesser  of  these  numbers.  Either  U  or  L  may 
be  infinite.  If  D  be  any  closed  set  of  points,  measurable  («/),  and  contained 
in  A,  we  have  accordingly 


If  m  (D)  converge  to  the  inner  extent  of  A,  we  see  that  these  relations  still 
hold,  when  A  is  written  for  D. 

If  then  IA  (|/|)  <  e,  for  every  set  of  points  A,  all  the  points  of  which  are  at 
a  distance  =  p  (e)  from  the  origin,  we  see  that  |  /A  (/)  |  <  e,  and  |  7A  (/)  |  <  e. 
Hence  the  conditions  for  the  existence  of  IG  (/),  1G  (/)  are  satisfied. 

To  prove  the  necessity  of  the  condition,  let  /=/+—/-,  where  /+=/  at 
every  point  at  which  /is  =0,  and  elsewhere  let  /+  =  0.  By  hypothesis,  if  A 
be  any  set  of  points  such  that  all  the  points  of  G  contained  in  it  are  at  a 
distance  ^  <r  from  the  origin,  we  have  1  1\  (/)  |  <  \e,  |  7A  (/)  |  <  £e. 

Let  A1(  A2  be  the  two  parts  of  A  in  which  /^  0,/<  0,  respectively;  and 
let  A1}  A2  contain  closed  sets  D1}  D2,  measurable  (J).  JA  (/+)  is  the  limit  of 
IDl  (/+),  or  of  /D1+i>2  (/+),  as  m  (A  +  D2)  converges  to  the  lower  extent  of  A. 
Hence  D1}  Dz  can  be  so  determined  that  I*(f+)  —  lDl+D2(f+)<'n,  and 
/A  (/~)  —  ID^+DZ  (f~)  <  *7  5  where  r)  is  an  arbitrarily  chosen  positive  number. 

Since  the  conditions  j  IDl  (/)  <  £e,  IDt(f)  <  ie,  hold  by  hypothesis,  we 
have  !DI+DZ  (/t)  <  ie,  IDl+D^  (f~)  <  £e,  and  therefore  JA  (/+)  <  ie  +  77, 
Now 


/-)  ^  |e  +  277  ; 

and  since  r)  is  arbitrary,  JA  (|/  )  ^  |-e<  e;  therefore  |/|  has  an  upper  integral 
over  G.    Thus  the  sufficiency  of  the  condition  has  been  established. 

371.    It  will  now  be  shewn  that  : 

In  order  that  IG(f}  may  exist,  it  is  necessary  and  sufficient  that  IG(\f  \) 
should  exist  as  a  definite  number. 

To  shew  that  the  condition  is  sufficient,  we  observe  that,  D  being  any 

closed  and  measurable  set  of  points,    Jj>(/)    and    //>(/)!  both  lie  between 

^7)  (I/I)  and  ID  (I/I)-    Hence  we  infer  that,  in  the  interval,  /e(|/|),  io(\f\\ 

both  IG  (/),  IG  (/)  are  contained.    If  then  JG  (|/j)  and  70  (|/j)  are  equal,  which 

H.  32 
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is  the  case  when  the  integral  I0(  /j)  exists,  it  follows  that  IG(f}  exists.  To 
shew  that  the  condition  is  necessary ;  we  observe  that  if  ID  (/)  exists,  in  any 
closed  set  D  that  is  measurable  (/),  so  also  does  ID(\f\)  (see  §368),  hence 
the  same  holds  good  for  any  set  A,  to  which  D  converges ;  and  proceeding  to 
the  limit,  as  A  converges  to  G,  we  see  that,  if  I0  (f)  exists,  so  also  does  /<?(!/!). 
It  thus  appears  that,  in  accordance  with  Jordan's  definition  of  the  integral  of 
f(x,  y)  over  an  unbounded  domain  G,  given  in  §369,  the  integral  is  necessarily 
absolutely  convergent,  in  the  sense  that  f(x,  y}  \  has  a  finite  integral  in  the 
domain  G.  It  has  been  seen,  in  §  358,  that  this  is  not  the  case  for  a  single 
integral  of  a  function  f(x)  over  an  indefinitely  great  interval  of  x,  as  this 
integral  may  exist,  and  yet  the  integral  of  \f(a)  over  the  same  interval  may 

sin  IT 
not  exist  as  a  finite  number.     For  example,  the  integral  of  -     -  over  the  un- 

(K 

bounded  interval  (0,  oo  )  is  not  absolutely  convergent.  The  reason  for  this  differ- 
ence between  the  two  cases  is  that  Jordan's  definition  for  the  double  integral  is 
much  more  stringent  than  the  definition  in  §  354  for  the  single  integral.  The 
single  integral  is  defined  as  the  limit  of  a  sequence  of  integrals  taken  each  over 
a  single  finite  interval.  If  the  integrals,  of  which  the  integral  over  the  un- 
bounded interval  is  the  limit,  were  not  restricted  in  this  manner,  but  might  be 
integrals  over  other  sets  of  points  measurable  (-/),  it  would  be  possible  to  give  a 
definition  in  accordance  with  which  only  absolutely  convergent  single  integrals 
over  an  unbounded  interval  would  exist.  It  would  be  possible  to  restrict  the 
domains  A  in  Jordan's  definition  in  such  a  manner  as  to  admit  of  the  existence 
of  non-absolutely  convergent  double  integrals  over  an  unbounded  domain  G. 

When  it  is  asserted  that  non-absolutely  convergent  double  integrals  over 
unbounded  domains  do  not  exist,  the  assertion  must  be  taken  to  mean  that 
such  integrals  do  not  exist  in  accordance  with  Jordan's  definition,  and  not  that 
it  is  impossible  to  give  a  definition  in  accordance  with  which  double  integrals 
exist  that  do  not  converge  absolutely. 

EXAMPLES. 

I*.    The  integral 

sin  (ax  +  by}  xr~lys~1  d(x,  y\ 


where  0  <  r  <  1,  0<s<],  taken  over  the  positive  quadrant,  has  no  existence  as  an 
absolutely  convergent  improper  integral.  We  find  that  the  integral  taken  over  the  rect- 
angle bounded  by  #  =  0,  x=h,  y  =  0,  y  =  k  tends  to  the  limit  a~rb~8T(r)  r(s)sin|(r+s)  T> 
as  h  and  k  are  increased  indefinitely.  If  the  integral  be  taken  over  the  domain  x  >  0, 
y  >  0,  ax  +  by  <  h,  then  when  h  is  indefinitely  increased,  the  integral  has  no  limit  if 
1  <  r  +  s  <  2  ;  but  it  tends  to  the  same  limit  as  before,  when  r  +  s  <  1.  The  integral  may 
be  regarded  as  conditionally  convergent,  if  we  adopt  a  definition  in  accordance  with  which 
it  is  sufficient  that  the  integral  taken  through  the  rectangle  #  =  0,  x  =  h,  #  =  0,  y  =  k  should 
have  a  definite  double  limit,  as  h  and  k  are  indefinitely  increased. 

*  Hardy,  Messenger  of  Math.,  vol.  xxxn,  p.  96. 
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2*.    The  integrals 

\  cos  (ax*  +  2hxy  +  %2)  d  (x,  y ),         j  sin  (ax* + Zhxy  +  fy»)  d  (a?,  y), 

where  a,  aft  -  A2  are  positive,  taken  over  the  positive  quadrant,  do  not  exist  as  absolutely 
convergent  integrals.  It  may  be  seen  that,  if  the  integrals  are  taken  over  the  quadrant  of 
a  circle  bounded  by  r  =  R,  the  value  of  the  integral  has  no  definite  limit,  as  R  is  increased 
indefinitely.  If  the  integral  be  taken  over  the  rectangle  bounded  by  x = 0,  x = h',  y  =  0,  y  =  H , 
then,  when  h'  and  kf  are  increased  indefinitely,  the  integrals  have 


0,  and 


for  limits  respectively,  the  inverse  cosine  having  its  least  positive  value.  These  may  be 
regarded  as  the  values  of  the  integrals,  subject  to  a  suitable  restriction  on  the  domains  of 
which  the  positive  quadrant  is  the  limit. 

If  a=0,  6  =  0,  A  =  £, 

sin  xy  d  (x,  y\ 


over  the  positive  quadrant,  has  no  existence,  even  considered  as  the  limit  of  an  integral 
over  the  rectangle.    But 

cos-ry  d(x, 


exists,  and  is  equal  to  £TT,  when  the  integral  is  defined  as  the  limit  of  the  integral  over  the 
finite  rectangle.    It  may  be  remarked  that  the  single  integrals 

Ch'  Ckf 

I     cosxydx,  smxydy 

Jo  Jo 

are  both  non-convergent, 


THE  TRANSFORMATION  OF  DOUBLE  INTEGRALS. 

372.  Let  (x,  y}  be  a  point  of  a  bounded  perfect  and  connex  domain  H,  and 
let  x  and  y  be  expressed  by  means  of  two  functions  fi,f2  in  terms  of  two  new 
variables  £,  77,  which  may  be  represented  by  points  (£,  77)  in  another  plane. 
Let  us  suppose  that  the  functions 

*=/l(f»^),  2/=/2  (£*?)> 

and  the  reciprocal  functions 

£  =  &  to  y)»    ^  =  02  toyX 

are  such  that  the  following  conditions  are  satisfied : 

(1)  To  each  point  (x,  y}  there  corresponds  one  point  (£,  t]} ;  and,  conversely, 
to  each  point  (f,  77)  there  corresponds  one  point  (x,  y) ;  and  to  the  bounded 
domain  H  there  corresponds  a  bounded  domain  H. 

(2)  The  functions  /j  (£,  ?;),  /2  (£,»;)  are  continuous  functions  of  (£,»?) 
throughout  the  domain  H. 

*  Hardy,  Messenger  of  Math.,  vol.  xxxn,  p.  159. 
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(3)  The  functions  /i  (£,  97),  /2  (£,  77)  have,  at  every  point  (f,  77),  of  5", 
definite  partial  differential  coefficients  with  respect  to  £  and  77,  and  each  one 
of  these  is  everywhere  continuous  with  respect  to  (£,  77),  and  nowhere  vanishes. 

(4)  The  Jacobian  of  /j  (£,  77),  /2  (f ,  77)  with  respect  to  £  and  77  does  not 
vanish  in  the  domain  H.     In  virtue  of  (3)  the  Jacobian  is  everywhere  con- 
tinuous, and  of  fixed  sign. 

From  (2)  and  (3)  it  follows  that,  if  (f  +  Af,  77  +  Aiy),  (£  77)  be  two  points 
of .//,  and  («  +  A#,  ?/  +  AT/),  (a;,  T/)  the  corresponding  points  of  H,  then 


where  0i,  #2>  #s>  #4  converge  to  zero,  as  A£,  A?7  do  so,  and  (see  §310) 
uniformly  for  all  points  (£,  77)  in  any  closed  domain  contained  within  U.  On 
solving  these  equations,  we  find 


with  a  similar  expression  for  A77,  where  J  denotes  the  Jacobian 

a  (/I./.) 
9(6  17)  ' 

and  «j  is  a  function  of  01}  ^2,  63,  6±  which  converges  with  them  to  zero. 
Since,  by  (4),  /  never  vanishes,  it  follows  from  these  equations  that  A£,  A?7 
converge  to  zero  with  A#,  AT/,  and  thus  that  the  functions  fa  (x,  y},  fa  (as,  y) 
are  both  continuous  functions. 

The  partial  differential  coefficients 

dfa  _  df*  I  j       dfa        dfa        dfa 
dx      d?]/  dy  '       dx  '       dy 

are  also  continuous  in  H;  and  therefore 


where  %1;  ^,  ^3,  ^4  converge  to  zero  with  A«,  Ay,  and  uniformly  so  for  all 
points  (x,  y)  in  a  closed  domain  contained  within  H. 

Corresponding  to  any  closed  set  h,  of  zero  content,  contained  within  H,  there 
is  a  closed  set  h,  of  zero  content,  contained  within  H.  It  is  clear,  from  the 
continuity  of  the  functions  which  define  the  transformation,  that  a  limiting 
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point  of  a  sequence  of  points  in  H  corresponds  to  the  limiting  point  of  the 
corresponding  sequence  in  H  ;  and  thus  h  is  closed,  since  h  is  so. 

Writing  A£  =  L&x  +  M&y,     A?;  =  L'&x  -f  M'&y, 

it  follows,  since 


where  L,  L',  ...  converge  uniformly  to 

d<fti       9$2 
das'      dx' 

that,  if  A#  |  ,  |  Ay  |  be  both  restricted  to  be  less  than  a  fixed  positive  number 
e,  the  ratio 


has  a  finite  upper  limit  A2,  for  the  whole  domain  H.  Now  let  the  points  of 
h  be  enclosed  in  a  finite  number  of  circles,  the  radii  of  which  are  all  <  e  ; 
it  then  follows  that  the  points  of  h  can  be  enclosed  in  a  finite  number  of 
circles  of  which  the  radii  are  all  less  than  eA.  The  sum  of  the  areas 
of  these  circles  on  the  (£,  77)  plane,  which  contain  within  them  all  the 
points  of  h,  has  to  the  sum  of  the  areas  of  the  circles  on  the  (x,  y)  plane, 
which  enclose  all  the  points  of  h,  a  ratio  less  than  A2.  Since  the  sum  of  the 
latter  circles  can  be  taken  to  be  arbitrarily  small,  it  follows  that  the  points  of 
h  can  all  be  enclosed  in  a  finite  number  of  circles  the  sum  of  whose  areas  is 
arbitrarily  small.  Therefore  h  has  the  content  zero. 

373.  Let/(#,  y)  be  a  bounded  function,  defined  for  all  points  of  a  closed 
connex  domain  G,  contained  in  H,  the  frontier  of  G  having  content  zero; 
and  let  f(x,  y}  be  integrable  (R)  in  G.  If  x,  y  be  expressed  in  terms  of  £,  77 
by  the  relations  * 


which  satisfy  the  conditions  of  §  372,  then,  corresponding  to  f{x,  y)  in  G,  we 
have  a  function  F(%,  77)  in  the  domain  G,  contained  in  H,  which  corresponds 
to  G.  The  frontier  of  G,  corresponding  to  the  frontier  of  G,  has  also  the 
content  zero.  A  point  of  discontinuity  of  f(x,  y},  in  the  (x,  y}  plane,  corre- 
sponds to  a  point  of  discontinuity  of  F  (£,  77)  in  the  (£,  77)  plane,  the  measures 
of  discontinuity  at  the  corresponding  points  being  the  same.  Since  those  points 
of  (x,  y}  at  which  the  saltus  of  /(a?,  y)  is  ^  k  form  a  closed  set  of  zero  content, 
it  follows  that  the  points  of  (£,  77)  at  which  the  saltus  of  F  (£,  77)  is  £  k  form 
also  a  closed  set  of  zero  content;  and  therefore  F(^,  77)  is  integrable  in  G. 

In  order  to  transform  If  (an,  y)d(x,  y},  taken  throughout  (r,  into  an  integral 
taken  throughout  G,  it  is  convenient  to  make  use  of  an  intermediate  transfer- 
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mation*  x  =  -\Jr  (ul}  u^),  y=u2,  followed  by  the  transformation  u1=^,  u2  =/%2(£,  rn}', 
the  function  -«/r(wi,  w2)  being  such  that 


It  is  easy  to  see  that  each  of  these  transformations  satisfies  the  conditions  of 
§372. 

Since  f(x,  y)  is  integrable  in  G,  we  may,  in  accordance  with  the  result  of 
§  363,  replace  the  double  integral 

J  /0»,  y)  d  (x,  y} 

[    7 

by  the  repeated  integral  I  dy  I  f(x,  y)  dx, 

J  J 

f     r 

or  by  \  dy  I  f  (x,  y)  dx. 

J  J 

Applying  the  transformation  x  =  ty  (Wj,  w.2)  to  the  upper  and  lower  integrals 
f(x,y)dx,      \f(x,y)dx; 


these  may,  in  virtue  of  the  theorem  of  §  360,  be  transformed  into  the  single 
upper  and  lower  integrals 

/  </>  (X,  Ma)  ^du^     I  <£  (it!,  u.2)  ^  dult 

where    $(uly  u2}   represents   the   function   of  u1}  u2  which   corresponds  to 
f(x,  y).    We  thus  have 

r  /'  $x 

J  f(x,  y)  d  (x,  y)  =     duzj  4>  (u,  ,  «2)  —  du, 


dx 


the  double  integral  being  taken  through  the  domain  in  the  plane  (w3,  uz), 
which  corresponds  to  G  in  (x,  y). 

(/IT 

Since  ^—  is  the  Jacobian  of  (x,  y)  with  respect  to  (ul9  u2),  we  have, 

011*1 

by  a  known  theorem, 

-_dx   9  (MI,  %) 


*  This  method  is  employed  in  the  general  case  of  multiple  integrals  by  Pierpont;  see  his 
paper  "  On  multiple  integrals,"  Trans.  Amer.  Math.  Soc.,  vol.  vi,  p.  432.  It  is,  however,  there 
assumed  that  f(x,  y)  is  integrable  with  respect  to  x  for  each  value  of  y  :  but  this  is  unnecessary. 
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hence,  since  J  never  vanishes, 

^£   and 
9wi 
also  never  vanish. 

Applying  the  same  method  of  transformation  to 


where  Wl  «*  f,     w2  =/2  (£  77), 

we  have  ]>(*,  y)  d  (*,  y)  =  ]>(£,  r,)  ^  |^2  d  (fc  77), 

where  ^  =  9_(Wl^2). 

8*7       8  (£,7)' 
hence  finally  we  obtain  the  formula 


which  is  the  formula  of  transformation  of  the  integral  of  f(x,  y}  throughout 
G  into  an  integral  throughout  G. 

It  has  been  assumed  that  /  has  a  fixed  sign  throughout  the  domain  of 
integration.  If  now  this  sign  be  negative,  the  product  A£A?7,  in  JAfA?/, 
which  corresponds  to  A#Ay  in  the  plane  of  (x,  y),  must  be  accounted  negative, 
when  AtfAy  is  positive.  It  is  however  more  convenient  to  consider  A£A?;  as 
essentially  positive,  otherwise  the  measure  of  a  set  of  points  in  the  (£,  77)  plane 
would  have  to  be  reckoned  as  negative.  Adopting  this  convention,  we  write 
\J\d(i;,r))  instead  of  Jd  (£,  77);  and  therefore  the  formula  of  transformation  will 
be  written  in  the  form 


,  y}  d(x,y}  =     F  (f  ,  ,7)  |  J  d  (£  T,). 


374.  Let  us  now  assume  that,  at  certain  points  of  G,  which  form  a  set  L 
of  zero  content,  either  (1),  f(x,  y)  has  an  infinite  discontinuity,  or  (2),  one  or 
more  of  the  partial  differential  coefficients 

9A      §A      ?/2     5/2 
8£'    fy'    3f'    877 

is  discontinuous,  or  (3),  the  Jacobian  J  vanishes.  In  case  J  be  positive  over  a 
part  of  G,  and  negative  over  another  part,  it  is  convenient  to  divide  the  double 
integral  into  two  portions,  taken  over  these  two  parts  of  G  respectively,  and 
to  transform  these  two  portions  separately.  It  will  accordingly  be  assumed 
that  J  never  actually  changes  its  sign  in  the  domain  G,  although  it  may  vanish 
at  the  points  of  the  part  L,  of  G.  We  may  denote  by  L  the  set  of  points  on 


The  Riemann  integral 


[OH.  vi 


the  (f  ,  77)  plane  which  corresponds  to  L  :  it  will  be  assumed  that  Z"  has  zero 
content.    It  will  be  shewn  that,  if  one  of  the  two  integrals 


J 


taken  over  0,  and 


,  y}, 
I  F(%,  rj)  J\  d(%,  77), 

taken  over  G,  exists  as  an  absolutely  convergent  improper  integral,  or  as  a 
proper  integral,  then  the  other  one  exists,  and  the  two  have  the  same  value. 

Let  us  assume  that  I    f(x,y)d(x,y}  exists:  it  will  then  be  sufficient,  in 
Jo 

order  to  establish  the  existence  of  the  other  integral,  and  its  equality  with  the 
first,  to  shew  that,  for  any  domain  6r1}  contained  in  G,  and  itself  containing  no 
points  of  L,  either  in  its  interior  or  on  its  frontier  (which  frontier  is  to  be 
taken  to  be  of  zero  measure),  the  condition 


/, 


is  satisfied,  provided  that  m  (G)  —  m  (GJ  be  less  than  some  fixed  finite  number 
dependent  on  rj. 

A  domain  g,  interior  to  G,  and  containing,  in  its  interior  and  on  its  frontier, 
no  points  of  L,  can  be  found  such  that 


r  r 

f(x,  y}  d  O,  y)  -  I    f(x,  y}  d  (x,  y) 
J  G  J  g 


<  e. 


If  h  be  any  domain  contained  in  g,  such  that  m(g)  —  m(h)  is  sufficiently 
small,  we  have 


and  therefore 


f(x,  y}  d(x,y}-\    f  (x,  y}  d  (x,  y) 

g  •  h 


f(x,  y}  d  (x,  y)  -      f(x,  y)  d  (x,  y}  <  2e. 


Now  let  k  be  a  domain  interior  to  G,  containing  in  its  interior,  and  on  its 
frontier,  no  points  of  L,  and  containing  h,  then 


f(x,  y}  d  (x,  y}  -  I  f(x,  y}  d  (x,  y} 

J  G  Jk 


<2e. 


For,  let  p  denote  the  domain  obtained  by  taking  the  two  domains  g  and  k 
together,  then 

r  r 

f(x,  y)  d  (x,  y}  -      f(x,  y)  d  (x,  y)   <  e, 

G  Jp  | 

f(x,  y}  d  (x,  y)  -       /(>,  y}  d  (x,  y)  <  e, 

p  J  k 

and  by  combining  these  inequalities  the  result  follows. 
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We  have  now 


505 


V  J  v 

where  U  is  the  domain  formed  by  taking  all  points  which  belong  to  one  or 
both  of  the  domains  Gl  and  h ;  and  V  consists  of  those  points  which  belong 
to  h  but  not  to  (•?!. 

Now  U  corresponds  to  a  domain  in  the  (x,  y)  plane  which  contains  h,  and 
which  domain  may  be  taken  to  be  identical  with  k ;  therefore 

r  r 

I    F(^,r))Jd  (f,  r))  differs  from    I    f(x,  y)  d  (x,  y} 
Jff  Jo 

by  a  number  numerically  less  than  2e.    Again 
F(^,rj)\J 


f 

J  "17 


where  p  is  the  upper  boundary  of    F  (%,  77)  J  \  in  the  domain  F,  obtained  by 
removing  from  h  those  points  which  belong  to  GI. 
We  thus  have 


\ 

J 


f(x,y)d(x,y)- 

G  &i 

Now  let  e  be  so  fixed  that  it  is  <  \T),  then  h  is  fixed,  so  that  //,  cannot 
exceed  a  fixed  finite  number  fj.l.    If  then  Gl  be  so  chosen  that 


the  inequality 


will  be  satisfied.    Therefore  it  follows  that 


exists,  and  is  equal  to 


/. 


I   f(x>y}d(x,y). 
Jo 


375.  This  method  of  transformation  may  be  extended  to  the  case  in  which 
one  of  the  domains  G,  G  is  infinite,  or  to  the  case  in  which  both  are  infinite, 
It  can  be  shewn  that,  if  either  of  the  integrals 


I  f(x,y}d(x,y\  t    F(£,rj)   J 

Jo  J- 


o  -& 

exists,  the  definition  of  §  366  being  applied  when  G  or  G  is  infinite,  then  the 


506  The  Riemann  integral  [CH.  vi 

other  integral  also  exists,  and  the  two  integrals  are  equal.    The  proof  can  be 
given  by  slightly  modifying  the  procedure  of  §  372. 

There  may  be  a  set  of  points  of  zero  content,  in  the  domain  of  G,  such 
that  the  corresponding  values  of  £,  77  are  infinite,  or  such  that  one  of  them  is 
infinite.  This  set  now  takes  the  place  of  the  set  L.  Whether  G  be  finite  or 
infinite,  the  finite,  or  infinite,  domain  h,  contained  in  G,  may  be  so  fixed  as  to 
exclude  all  points  which  correspond  to  infinite  values  of  £  or  77.  The  domain 
k  including  h,  and  containing  no  points  which  correspond  to  infinite  values  of 
£  and  77,  may  then  be  fixed  as  before,  and  will  satisfy  the  condition 

I    f(as,  y)  d  (x,  y)  -  I  f(x,  y)  d  (x,  y) 

JO  J  k 

it  being  assumed  that  the  integral  of  f(x,  y)  over  G  exists. 

The  finite  domain  h  contains  all  points  of  G  of  which  the  distance  from 
the  origin  is  less  than  some  number  R  depending  on  the  domain  G  —  h,  which 
contains  in  its  interior  all  points  (x,  y}  that  correspond  to  infinite  values  of  £ 
or  77,  or  of  both.  The  same  statement  holds  for  k,  which  contains  h.  When 
the  finite  domain  Gl  is  such  that  the  condition 


<2e, 


is  satisfied  (and,  in  order  that  this  may  be  the  case,  Gl  must  certainly  contain 
all  points  of  G  whose  distance  from  the  origin  is  less  than  some  fixed  number 
R!  ^  R),  we  have  as  before 


f(x,y)d(x,y)- 
and  as  77  is  an  arbitrarily  fixed  number,  we  thus  see  that 


G 

exists,  and  is  equal  to 


f(x,y)d(x,y). 

THE   RIEMANN-STIELTJES   INTEGRAL. 

376.  The  notion  of  the  integral  of  a  bounded  function  f(x),  defined  in  the 
linear  interval  (a,  6),  with  respect  to  another  function  <£  (x),  defined  in  the 
same  interval,  is  a  generalization  of  the  integral  of  a  function  f  (x),  with 
respect  to  the  variable  x,  that  was  first  introduced*  into  Analysis  by  Stieltjes 
in  connection  with  the  theory  of  continued  fractions.  This  integral,  in  a 
generalized  form,  has  recently  become  of  considerable  importance ;  and  con- 
sequently, an  account  of  it,  so  far  as  it  can  be  regarded  as  a  generalization  of 
the  JK-integral,  will  be  given  here. 

*  See  Annales  de  la  Faculte  den  Sciences  de  Toulouse,  vol.  vm,  p.  71. 
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If  (a,  #!,  #2,  ...  a?m_i,  b)  be  a  net,  fitted  on  to  (a,  6),  and  of  which  the 
breadth  of  the  greatest  mesh  is  d,  let  us  consider  the  sum 


where  /(a?),  <£  (a?)  are  the  bounded  functions,  defined  in  the  interval  (a,  b),  and 
£i>  &>  •••  £m  are  points,  assigned  in  any  manner,  which  are  in  the  meshes 

(a,  a?!),  (a?,,  a;2)  ...  (xm^,  6)  respectively. 

If  the  functions  /(a?),  <£  (a?)  be  such  that  Sd  converges  to  a  definite  number, 
as  the  number  m  of  the  meshes  is  increased  indefinitely,  subject  to  the  condition 
that  d  converges  to  zero,  and  if  this  limit  is  independent  of  the  mode  of 
successive  sub-division  of  the  interval  by  the  nets,  and  of  the  mode  in  which 
the  sets  of  points  £,  £2,  •••  %m  are  assigned,  f(x)  is  said  to  have  a  Stieltjes 
Integral  with  respect  to  </>  (a;).  Such  integral  is  defined  to  be  the  limit  of  84, 
as  d^O,  and  it  is  denoted  by 

*/(*)«**(«). 


It  will  here  be  spoken  of  as  an  .R$-integral,  or  Riemann-Stieltjes  integral. 

In  the  case  in  which  f(x)  is  continuous  in  (a,  b),  and  $  (x)  is  monotone 
and  bounded  in  the  same  interval,  the  existence  of  the  integral  was  established 
by  Stieltjes. 

If  <f>  (x)  be  a  function  of  bounded  variation  in  (a,  b),  f(x)  being  continuous, 
the  integral  also  exists.  For  <£  (x)  is  then  the  difference  of  two  bounded 
monotone  functions  fa  (x),  <£2  (x),  and  it  is  clear  from  the  definition  that 

(*/(*)<*#<•) 

J  a 

exists,  and  is  the  difference 


the  existence  of  these  latter  integrals  being  assumed. 
Further,  if  f(x),  <f>(x)  be  any  two  functions  for  which 


rb 
exists,  then  I    <f>  (x)  df(x) 

J  a 

exists,  and  the  two  integrals  satisfy  the  relation 

I    f(x)d(j)(x)  + 1    (J3(x)df(x~)=f(b)^)(b)—f(a)<f)(a). 

•J  a  J  a 

This  is  a  generalization  of  the  formula  for  integration  by  parts. 
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may  be  written  in  the  form 

-  [/(<*)  {<£  (ft)  -  <#>  (a)}  +/(«,)  {<£  (ft)  -*  (ft)}  +  •  •  •  +/K-0  {</>  (ft»)- 

-  *  (ft») 


The  points  a,  ^x,  f2>  •••  £m>  b  form  a  net  with  m  +  1  meshes,  the  greatest 
breadth  of  which  is  2c£TO. 

The  expression  on  the  right-hand  side  converges,  as  dm  ~  0,  to 


by  hypothesis. 

Therefore  the  expression 


converges,  and  consequently 


exists,  and  the  two  integrals  satisfy  the  relation  given  above. 

rb 
In  particular,  I    ^>  (ar)  d/(a;) 

J  a 

exists,  if /(a?)  is  continuous,  and  0  (#)  is  of  bounded  variation  in  (a,  6). 

THE   UPPER   AND   LOWER   RIEMANN-STIELTJES   INTEGRALS. 

377.    Let  f(x)  be  any  function  bounded  in  (a,  6),  and  let  <£(#)  be  a 
bounded  monotone  non-decreasing  function,  defined  for  the  same  integral. 

Let  a  net  a,  xl,  x2,  ...  a;w_1;  b  be  fitted  on  to  (a,  b),  and  consider  the  two 
sums 


r=m 


=  2   ff^  (/(*)){*  («V-0)-*(«w  +  0)) 

r  =  l 

+r2  VW  {<^>  (^  +  0)  -  e£  (arr  -  0)}. 

r  =  l 
r=w 

=  2  i;"°+o  (/(«))  {*(«V-0)-0  (aw*  +  0)) 


r=l 


where  x0  =  a,    xm  =  b,     <f>  (xm  +  0)  =  <f>  (b), 

^d  v?-i+0(f(*»t  ^:;+0(/(^)) 

denote  the  upper  and  lower  boundaries  of  f(x)  in  the  open  interval  («r_i,  xr). 
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U  and  L  denoting  respectively  the  upper  and  lower  boundaries  of  f(x)  in 
(a,  b),  we  have 


Therefore,  for  all  such  nets,  S  has  a  lower  "boundary,  which  can  be  denoted  by 


This  will  be  called  the  upper  Riemann-Stieltjes  Integral  of  f(x)  with  respect  to 
<j>(x),  in  the  interval  (a,  b).  For  shortness  it  will  be  spoken  of  as  the  upper 
RS-integral  of  f(x)  with  respect  to  </>(#). 

Similarly,  S  has,  for  all  possible  nets,  an  upper  boundary  which  may  be 
denoted  by 

•b 

/(a?)  d(f>  (x) ; 

and  this  may  be  called  the  lower  .R$-integral  of  f(x)  with  respect  to  </>(#) 
in  (a,  b). 

Since  IS  =  8,  it  follows  that 

76 


In  case  of  equality  of  upper  and  lower  JK$-integrals,  their  common 
value  is  denoted  by 

•6 

f(x)  d<j>  (x), 

and  this  defines    the    Riemann-Stieltjes    integral,    or    the    jR^-integral    of 
/(#),  with  respect  to  <£  (x),  in  the  interval  (a,  b). 

The  following  theorem  will  now  be  established  : 

Iff  (x)  be  bounded  in  (a,  b),  and  <£  (x)  be  of  bounded  variation  in  the  same 
interval,  the  necessary  and  sufficient  condition*  that  f(x)  should  have  an  RS- 
integral  with  respect  to  </>  (x)  is  that  the  variation  of  <f)  (x)  over  the  set  of  points 
of  discontinuity  of  f(x)  should  be  zero.\ 

It  will  be  observed  that  the  condition  cannot  be  satisfied  if  f(x),  </>  (x)  have 
any  common  point  of  discontinuity. 

The  upper  and  lower  variations  of  <£  (x),  in  case  it  be  monotone,  over  a  set 
of  points  in  (a,  b),  have  been  defined  in  §  252;  when  the  upper  variation  is  zero, 
the  variation  of  </>  (x)  over  the  set  of  points  exists,  and  is  equal  to  zero. 

We  may,  without  loss  of  generality,  assume  </>  (x)  to  be  monotone  and  non- 
*  See  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xiu,  p.  133. 
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diminishing  ;  for  in  the  general  case  <f>  (a?)  is  the  difference  of  two  such 
functions  P(x),  N(x),  and  the  variation  of  <£(#)  over  a  set  of  points  is  the 
sum  of  the  variations  of  P  (x},  N  (so)  over  the  set. 

It  is  easily  seen  that,  if  a  system  of  nets  be  fitted  on  to  (a,  6),  the  sum 


does  not  increase  as  n  is  increased. 

We  have  only  to  shew  that  Sn  does  not  increase  when  a  mesh  is  divided 
into  two.    Let  x'  be  a  point  interior  to  (#r_i,  xr}  ;  we  have  then 

if.1      Kr 


and         {<£  (av  -  0)  -  <£  (av_,  +  0)}  =  {<f>  (a?r  -  0)  -  <f>  (x  +  0)} 


hence     tP-  °+o  (  /(*))  {</>  (a,  -  0)  -  </,  (av_,  +  0)} 


from  which  the  result  follows.  We  now  see  that  8n  ^  Sn+l ;  and  thus  Sn  has 
a  lower  boundary  S.  Similarly  it  may  be  shewn  that  Sn  has  an  upper 
boundary  S. 

It  has  been  shewn  in  §  334  that,  if  £  be  a  prescribed  positive  number,  the 
interval  (a,  6)  can  be  covered  by  a  finite  set  of  overlapping  intervals,  each  ^  £, 
and  such  that,  in  each  interval,  the  fluctuation  of  /(#)'is  <  &>  (£)  +  e,  where  £ 
is  a  definite  point  interior  to  the  interval,  and  <w  (£)  is  the  saltus  at  £.  The 
end-points  of  this  finite  set  of  intervals  divide  (a,  6)  into  a  finite  number  of 
parts ;  let  77  be  the  length  of  the  least  of  these  parts.  Any  interval  whatever, 
S,  in  (a,  6),  of  length  <  77,  is  interior  to  one  of  the  intervals  of  the  finite  set. 
Hence,  in  S,  the  fluctuation  of  f(x)  is  <  e  +  w  (f),  where  £  is  some  point  in 
an  interval  of  length  <  £,  which  contains  8  in  its  interior. 

Let  Cr€  be  the  closed  set  of  points  at  which  &>  (x)  =  e ;  with  centre  x,  any 
point  of  Cre,  an  interval  of  length  2p  can  be  taken;  all  such  intervals,  or  the 
parts  of  them  in  (a,  6),  coalesce  into  a  non-overlapping  finite  set  A6)P,  of  which 
6re  is  the  inner  limiting  set,  as  p~0.  Assuming  that  the  variation  of  <f>  (x)  over 
Gre  is  zero,  p  can  be  so  chosen  that  the  upper  variation  of  <£  (x)  over  Ae>p  is 
<  5"',  an  arbitrarily  chosen  positive  number.  In  any  interval  of  the  set 
A€)P,  there  is  no  point  of  6re  of  which  the  distance  from  either  end-point 
is  <  . 
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We  can  suppose  £  and  consequently  rj,  to  be  so  small  that,  if  8  be  not 
interior  to  an  interval  of  A,)P,  there  is  no  point  of  Gf  in  the  interval  8. 

It  will  now  be  proved  that  $"=  S,  if  the  condition  that  the  variation  of 
<£(#)  over  the  set  of  discontinuities  of/(#)  is  satisfied.    We  have 


7-1  "VI 

The  number  n  being  taken  so  large  that  every  mesh  Dn  is  <  77,  we  then 
find  that 

Sn  -£n  <  2e  [</>  (6)  -  *  (a)]  +  f  (CT  -  £), 

where  the  first  term  on  the  right-hand  side  arises  from  those  meshes  that  are 
not  interior  to  an  interval  of  A€ip,  and  in  each  such  mesh  the  fluctuation 
is  therefore  <  2e;  the  second  term  arises  from  the  remaining  meshes.  Since 
e  and  £"  can  be  made  to  converge  to  zero,  as  n  ~  oo  ,  we  see  that  S  =  S. 

To  shew  that  the  condition  that  the  variation  of  <f>  (#)  over  the  set  of 
points  of  discontinuity  is  necessary,  in  order  that  iS  =  S,  we  observe  that 
tin  -  Sn  is 


where  the  summation  is  taken  for  those  meshes  of  Dn  which  contain  points 
of  Qf  within  them,  or  at  an  end-point.  For  if  a  point  of  Ge  be  a  common 
end-point  of  two  meshes,  in  one  at  least  of  these  meshes  the  fluctuation  is 
^  £e.  Unless  the  variation  <£(&•)  over  Ge  is  zero,  the  sum 

remains  greater  than  some  positive  number,  however  large  n  may  be,  and  in 
that  case  Sn  —  Sn  cannot  converge  to  zero. 

It  will  now  be  shewn  that  8,  the  common  value  of  S  and  S,  is  independent 
of  the  particular  system  of  nets.  If  possible,  let  the  values  Sw,  S®,  of  S  for 
two  systems  of  nets,  be  such  that  Sw  >  S®.  There  must  then  exist  a  net  Dt' 
of  the  second  system,  for  which  8®  >  £f  <2>.  Let  s  be  the  number  of  meshes  in 
Dt',  and  let  us  consider  the  nets  Dn  of  the  first  system.  Suppose  Dn  and  Dt' 
superimposed.  Of  the  meshes  of  Dn,  at  most  s-  I  will  be  altered  by  super- 
imposing Dt'. 

The  meshes  of  Dn  consist  (1),  of  those  that  are  interior  to  an  interval  of 
A<iP,  and  (2),  of  those  that  are  not  interior  to  any  such  interval.  Also  some  of 
the  end-points  of  the  meshes  of  Dn  may  be  points  of  Ge.  We  take  n,  as  before, 
so  large  that  all  the  meshes  of  Dn  are  of  length  <  77.  The  diminution  in  the 
value  of  Sn®  produced  by  superimposing  on  Dn  the  net  Dt'  is  less  than 

The  numbers  e,  £"  can  be  chosen  so  small  that  the  sum  isj  for  the  new  net 
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DnDt',  is  >  SX  But,  in  DnDt',  the  sum  £  is  ^  &X  and  therefore  the 
assumption  8(1}  >  S{2)  has  led  to  a  contradiction.  Thus  it  follows  that  S  is 
independent  of  the  system  of  nets. 

It  should  be  observed  that  the  condition  that  <£  (x)  has  variation  zero  over 
the  set  of  points  of  discontinuity  of  f(x)  may  be  satisfied  even  when  the 
measure  of  the  set  is  >  0. 

For  example,  if  <£  (x)  =  x,  at  the  points  of  a  closed  set  E,  of  positive 
measure,  and  if  <f>  (x}  is  constant  over  each  contiguous  interval  of  E,  whilst 
all  the  discontinuities  of  f  (x)  are  in  these  contiguous  intervals, 


rb 
exists,  although  I    f  (x}  dx  may  not  exist. 

J  a 

In  case  <j>  (x)  is  continuous,  the  expressions  for  Sn,  Sn  become 

__       r  =  mn 

Sn  =  to  (/(*))  {<£  (Xr)  -  0  (*„)}, 


-n  = 

It  can  be  shewn  that  these  sums  may  be  replaced  by 
T     '"  )  {0  (xr)  -  4,  (*,._,)}, 


without  affecting  the  values  of  the  upper  and  lower  integrals.  The  proof  is 
as  in  the  case  </>  (x)  =  x  ;  the  only  difference  is  that,  instead  of  S,  we  have  the 
difference  of  the  values  of  <£  (x)  over  the  end-points  of  a  mesh.  But,  on 
account  of  the  uniformity  of  convergence  of  <£  (x),  this  difference  is  less  than 
an  arbitrarily  small  number,  when  m  is  so  great  that  the  meshes  of  Dn+m  are 
of  sufficiently  small  breadth  ;  thus  8  is  replaced  by  a  number  that  converges 
to  zero,  as  m~oo  . 

378.  It  was  pointed  out  in  §  252  that  the  variation  of  a  monotone  function 
<£  (x\  over  any  set  of  points  in  (a,  6),  is  the  measure  of  the  corresponding  set  of 
points  £(=</>  (^0)  i°  tne  linear  integral  of  £,  provided  the  closed  interval 


be  taken  to  correspond,  on  the  %  -segment,  to  a  point  xly  on  the  ^-segment,  at 
which  <f>  (x)  is  discontinuous. 

If  F  (£)  be  denned  on  the  ^-segment  as  equal  to  the  value  of  f(x)  at  the 
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point  x  which  corresponds  to  £,  the  function  F  (£)  is  discontinuous,  at  the 
point  f,  corresponding  to  a  point  x  at  which  f(x)  is  discontinuous,  and  <f>  (x) 
continuous;  but  if  <£(#)  is  discontinuous,  F(i;)  is  constant  in  the  interval 
corresponding  to  x. 

The  condition  in  the  theorem  of  §377,  that  (f>(x)  should  have  variation 
zero  over  the  set  of  points  of  discontinuity  of  f(x\  is  equivalent  to  the 
condition  that  the  points  on  the  ^-segment  at  which  F(£)  is  discontinuous 
should  have  measure  zero.  When  the  .R$-integral  exists,  f(x)  and  <f>(x) 

cannot  be  discontinuous  at  the  same  point. 

rb 
It  has  thus  been  shewn  that  it  is  a  necessary  condition  that  I  f(x)  d<j>  (x) 

J  a 

rft 

should  exist,  where  <f>  (x}  is  monotone,  that   I    F  (£)  d%  should  exist  as  an  R- 

J  a 

integral ;  where  a  =  <j)(a),  /3  =  <f>  (b). 

The  .R-integral  in  the  ^-segment  may  however  exist  when  the  .R$-integral 
does  not  exist,  as  is  seen  by  considering  the  case  in  which  f(x)  and  </>(#)  are 
both  discontinuous  only  at  a  single  point  xl. 

379.    A  fundamental  property  of  the  .KS-integral  is  that 
[*/(*)  d<j>  (x)  =  \°  f(x]  d$  (x)  +  [*/(*)  d$  (x). 

J  a  J  a  J  c 

First,  let  c  be  a  point  of  discontinuity  of  f(x),  then  </>  (x)  is  continuous 
at  c.  We  consider  a  system  of  nets  such  that  c  is  an  end-point  of  meshes  of 
each  net  Dn. 

The  part  of  the  sum  S  which  involves  the  meshes  of  Dn  of  which  c  is  an 
end-point  is 

*Co  </(*)>{*(')-*(*••  +  °)}+  %l"°  (/(«))  tt  (^  -  0)  -  <£  (c)}, 
where  (xr,  c),  (c,  xr+l)  are  the  two  meshes. 

In  the  corresponding  sums  for  the  two  integrals  over  (a,  c),  (c,  b)  the 
corresponding  terms  are 

UC~l,  (/(*))  {<£  (c)  -  *  <«V  +  0)},   U^~°  (/(«))  {^>  (av+1  -  0)  -  <£  (c)}. 
Hence  we  have  Sn  =  Sn(1>  +  ^(2), 

and  similarly  we  have  Sn  =  Snw  +  Sn(2). 

Thus  tin  -Sn  =  (Sn®  -  ^n(1))  +  (Sn®  ~  Sn®)  J 

and  if  Sn~Sn<  e,  both  l8nM  -  £nw  and  13n®  -  £,t(2)  are  <  e. 

It  follows  that  both  the  integrals  in  (a,  c),  (c,  b)  exist,  if  that  in  (a,  6) 
exists,  and  their  sum  is  equal  to  the  latter  integral. 

Next,  let  c  be  a  point  of  discontinuity  of  </>  (x),  and  therefore  a  point  of 
continuity  of  f(x). 

H.  33 
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The  parts  of  the  three  sums  to  be  considered  are 

{*  (c  -  0)  -  £  (air  +  0)}  +  lf*^-°  (/<*))  {*  (av+,  -  0)  -  0  (c  +  0)} 


{</>('-  0)  -  4>  (av  +  0)]  +/(c)  {0  (c)  -  <£(c  -  0)} 
/(c)  {</>  (c  +  0)  -  t£  (c)}  +  Z7^»-«  (/(*))  {<£  (av+1  -  0)  -  <£  (c  +  0)}  ; 

hence,  as  before,  the  first  sum  is  equal  to  the  second  and  third  taken  together  ; 
whence  the  result  follows. 


rx+h  rx  rx+h 

f(x)d4>(x)-      /(«)<ty(«)-/        f(x)d<f>(x); 

J  a  Jo  J  if 


the  integral  on  the  right-hand  side  is 

rx+h 

M 

J  X 

where  M  is  between  U  and  L.    It  follows  that 


is  continuous  at  a  point  of  continuity  off(x},  since 

•x+h 

d(f>  (x)  =  $  (x  +  h)  —  (j)  (x}. 


In  case  f(x)  is  continuous  at  x,  we  have 

rx+h 
lim  f(x)  d4>  (ar)/{0  (a;  +  h)  -  <j>  (x)}  = 


h~Q      x 

which  is  a  generalization  of  the  theorem  to  which  it  reduces  when  <f>  (x)  =  x, 

that  rx 

f(x)dx 

J  a 

has  a  differential  coefficient  equal  to  f(x),  at  any  point  xy  at  which  f(x)  is 
continuous. 

380.    The  following  property  of  the  J?$-integral,  which  is  an  extension  of 
the  property  of  the  /^-integral,  will  be  established  : 

Iffl(x),f2(x)  both  have  RS-integrals  with  respect  to  the  monotone  function 
(j)  (x),  then  /j  (x)  +  f2  (x)  is  integrable  (RS),  with  respect  to  $  (x),  and 


Since  the  variation  of  (j)  (x)  is  zero  over  the  sets  of  discontinuities  of  f^  (x) 
and  fz  (x),  the  variation  is  also  zero  over  the  set  of  discontinuities  of 

/i- (*)+/•(*)• 

Therefore  the  latter  RS  -integral  exists. 
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In  any  interval,  if  77,  Z  be  the  upper  and  lower  boundaries  of 


U1}  L^  of  fi(x),  and  Uz,  Lz  of  fz(x),  we  have 


Hence,  if  Snw,  oV2),  Sn  be  the  three  sums,  for  a  net  Dn,  which  converge  to 
the  three  upper  integrals;  and  Sn(l},  Sn®,  Sn  the  corresponding  sums  which 
converge  to  the  lower  integrals,  we  have 

7T  a;  _i_  "w"  (2)  >  "#    >  .Q    >  a  a)  4.  .Of  (2) 

On      T  *->«,       =  AJji  =  On  =  >Jn      f  Onv  '. 

If  *SM(1»  —  $n(1),  $/2>  —  Sn®  both  converge  to  zero,  as  n*  oo  ,  it  follows  that 
Sn  -  Sn  converges  to  zero,  and  that 

lim  £>  +  lim  S>  =  lira  Sn, 

M-~QO  n~00  M~00 

whence  the  result  follows. 

It  is  easily  seen  that,  if  /i  (#),  /2  (x),  ...fr  (x)  are  functions  all  of  which  have 
an  .R$-integral  with  respect  to  <f>(x),  then  F  '(/1,  /2,  ...  fr)  also  has  such  an 
integral,  where  F  is  continuous  with  respect  to  (/i,/2,  •••/»•)  in  the  whole 
interval  (a,  b). 

381.  Let  a  function  f(x(l\  «(2)),  defined  in  a  cell  (a(l),  a(2>;  6(1),  6(2)),  be 
bounded  in  that  cell,  and  let  (f>  (XM,  «(2))  be  a  function  defined  in  the  same  cell, 
and  quasi-monotone  in  accordance  with  the  definition  in  §  255;  thus,  employ- 

ing the  notation  there  introduced,  A|^(1)'  ^(2)|  (f>(x(1\  x(2))  >0,  provided  x(l)^x(l\ 

x®  ^  x®. 

Let  a  net  be  fitted  on  to  the  cell  in  which  the  functions  are  defined,  and 
let  $i,$a>  •••$»!  denote  the  meshes  of  the  net,  and  Asr  <jb  (x(l)  ,  x®)  denote 

A  (£"<«'  f'w)  ^  ^(I>'  ^(2>)>  where  (ar<1>'  °r(2>;  ^(1V-/8r(2))  is  the  mesh  Sr;  moreover,  let 
U(Sr),  L(8r)  denote  the  upper,  and  the  lower,  boundaries  of/(#(1),  x('2})  in  the 
cell  8r. 

r=m 

The  sum   2    Z7(8f)  ASr^>  («(1),  «(2))  has  a  lower  boundary,  when  all  possible 


r=\ 


nets  are  taken  into  account;  and  this  lower  boundary  is  said  to  define 


(a'D,  aW) 

the  upper  integral  off(x(l\  #(2))  with  respect  to  (f)(x(l\  x{'2)),  in  the  cell  (a(l),  a(2); 
Similarly,  the  lower  integral  off(xM,x^)  with,  respect  to  <£  (x(l\  #(2))  is 

r=m 

defined  as  the  upper  boundary  of  S  L  (&r)  A5f  (/>  (x(l),  x(-}),  for  all  possible  nets 
fitted  on  to  the  fundamental  cell,  and  this  lower  integral  is  denoted  by 

/  \  JO      .  SC      )  (Z(D  \X     j  Ju      I, 

",  a  I2') 

33—2 
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In  case  the  upper  and  lower  integrals  off(x  (i),  #(2))  with  respect  to  (f>  (x(l\  #(2)) 

f(6(1),  6<2») 
are  equal,  their  common  value  f(%{1\  #<2))  dd>  (x(l\  aiW)  is  taken  to  define 

J(a<D,  a®)17 

£/*e  Riemann-Stieltjes,  or  RS-integral,  off(x(l\  ac(2))  with  respect  to  <$>(x(l\  #(2)). 

In  case  <£  (#(1),,  #(2))  is  simply  the  product  x(l)  .  x(z\  we  see  that  ASr  <£  (a:(l),  #(:!)) 
reduces  to  (/3r(l)  -  cxr(1))  (&.(2)  -  «r(2)),  the  measure  of  the  cell  Sr;  the  ES-integral 
is  then  the  .R-integral,  as  defined  in  §  338. 

If  (f>  (x(l},  «(2))  be  a  function  of  bounded  variation,  in  accordance  with  the 
definition  of  Hardy  and  Krause,  given  in  §  254,  it  is  expressible  as  the  differ- 
ence of  two  quasi-monotone  functions  P  (x(l),  #(2)),  N(xw,  #(2)),  (see  §§  254,  255), 
which  are  also  monotone.  The  .fi$-integral  of  f(x(l\  #(2)),  with  respect  to 
(f)  (#(l),  #(2)),  is  defined  by 
«,  6<2>) 


(b<D  ,6(2))  r(b  a)    6(2)) 

f(x^\  ^(2))  dP  (x^\  ^(2))  -  f(xU, 

(a(1),  a(2))  J  (a^1,  a<2') 

whenever  the  two  latter  integrals  exist. 

It  can  be  shewn  that  a  continuous  function  f(x(l\  x(<i])  certainly  has  an 
.R$-integral  with  respect  to  a  function  <f>  (#(l),  #(2)),  of  bounded  variation. 
We  need  only  consider  the  case  in  which  <f)  (x(l\  #(2))  is  quasi-monotone.  In 
case  the  spans  of  all  the  meshes  of  a  net,  fitted  on  to  the  fundamental  cell,  are 
sufficiently  small, 

r=m  r=m 

2  U  (Sr)  As,  </>  O'1',  «<2>)  exceeds  2  L  (8r)  Afir  0  (^'1>,  #<2>) 

r=l  r=l 

by  less  than  eAl  (1'  a  J  <£  (#(l),  #<2>)>  which  is  an  arbitrarily  small  number. 
Consequently  the  upper  and  lower  integrals  of  f(x(l\  #(2))  with  respect  to 
<f)  (x(l\  a-(2))  have  the  same  value,  and  the  US-  integral  of  f(xM,  #(2))  there- 
fore exists. 

If  f(xM,  «(2))  have  discontinuities,  the  criterion  for  the  existence  of  the 
integrabwith  respect  to  the  function  <f>  (cc(l},  #<2))  is  contained  in  the  following 
theorem*,  which  may  be  established  by  a  modification  of  the  method  of  §  377. 

The  necessary  and  sufficient  condition  that  the  bounded  function  f(x(l\  #(2)) 
should  possess  an  RS-integral  with  respect  to  the  function  <f)  (x{}),  #(2)),  of 
bounded  variation,  in  accordance  with  the  definition  in  §  254,  is  that  the  variation 
of  (/>  (#(l),  #('2))  over  the  set  of  points  of  discontinuity  of  f(x(l\  x(z  )  should  be  zero. 

The  variation  of  a  quasi-monotone  increasing  function  <j>  (#(l),  #(2)),  over  a  set 
of  points  G,  is  here  taken  to  denote  the  lower  boundary  of  2  A  (6)  0  (#(l),  #(2)), 
where  the  summation  is  taken  for  all  cells  of  a  set  which  contain  in  them  all 
the  points  of  G,  the  lower  boundary  being  taken  for  all  such  sets.  The  varia- 
tion of  a  function  $  (#(l),  #(2)),  of  bounded  variation,  over  the  set  G,  is  taken  to 
be  the  sum  of  the  variations  over  G  of  the  two  monotone  increasing  functions, 
P((jc(1\(ic(2}),  ~N(x(l\x(z)\  the  difference  of  which  is  <j>(xw,x(~2]),  (see  §254). 

*  See  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  281;  also  Proc.  Roy.  Soc.  vol.  xcni, 
p.  28. 


CHAPTER  VII 

THE  LEBESGUE  INTEGRAL 

382.  THE  definition  of  a  definite  integral  which  has  been  introduced  into 
Analysis  by  Lebesgue  is  of  much  wider  scope  than  the  definition  of  Riemann. 
The  theory  of  Lebesgue  integration  has  as  its  foundation  the  conception  of 
the  measure  of  a  set  of  points,  in  the  sense  in  which  the  term  is  employed  by 
Lebesgue.    An  account  of  this  theory  of  measure  has  been  given  in  Chapter  in. 
In  Riemann  integration,  the  domain  over  which  the  integral  is  taken  is  divided 
into  a  finite  number  of  intervals,  or  of  cells,  and  the  integral  is  defined  as  the 
limit  of  the  Riemann  sum  for  this  set  of  intervals,  or  cells.    In  Lebesgue 
integration,  on  the  other  hand,  the  domain  over  which  the  integral  is  taken  is 
divided  into  a  number  of  measurable  sets  of  points,  having  a  certain  property 
relative  to  the  function  to  be  integrated,  and  the  integral  is  defined  as  the 
limit  of  a  certain  sum  taken  for  all  these  measurable  sets  of  points,  as  the 
number  of  sets  is  indefinitely  increased.    The  distinction  between  the  Lebesgue 
integral  and  the  Riemann  integral  rests  essentially  upon  the  difference  between 
the  two  modes  of  dividing  the  domain  of  integration  into  sets  of  points. 

MEASURABLE  FUNCTIONS. 

383.  The  definition  of  Lebesgue  is  applicable  to  functions  belonging  to 
the  family  of  measurable  functions,  in  one,  or  more,  dimensions.    A  measurable 
function  f(x)  has  already  been  defined,  in  §  295,  as  such  that  the  set  of  points  x, 
for  which  f(x)  >  A,  is  measurable,  whatever  real  number  A  may  be.    This 
definition  is  applicable,  whether  x  denote  a  point  of  a  linear  set,  or  denote  a 
point  (x(l\  x(2}, ...  x(v}),  in  any  number  p,  of  dimensions. 

If  f(x)  be  a  measurable  function,  defined  at  each  point  of  a  given  domain, 
the  sets  of  points  for  which 

A  <f(x}<B-  A  3i/C»)<  B-  A  ^f(x)±B;  f(x)<A;  f(x)£A 
are  all  measurable,  whatever  real  numbers  A  and  B  may  denote,  provided  A  <  B. 

In  the  first  place,  the  domain  for  which  f(x)  is  defined,  and  for  which  it 
has  a  definite  value  at  each  point,  is  measurable.  For  let  A  have  the  values 
—  Nlt  —  N2,  ...  —  Nn, ...  successively,  of  a  sequence  such  that  Nn  increases  in- 
definitely as  n~ao.  The  set  En,for  which  f(x)  >  —  Nn,  is  measurable,  by 
hypothesis,  for  every  value  of  n.  The  domain  for  which  f(x)  is  defined  is  the 
outer  limiting  set  of  the  sequence  [En],  of  measurable  sets,  and  is  therefore 
itself  measurable.  The  set  of  points  for  which  f(x)  ^  A,  being  complementary, 
relatively  to  the  domain  of  the  function,  to  the  measurable  set  for  which 
f(x)  >  A,  is  measurable.  If  {An}  be  a  monotone  increasing  sequence  of  numbers 
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converging  to  A,  all  the  sets  for  which  f(x)  £  An  are  measurable,  and  their 
outer  limiting  set,  for  which  /(#)<  A,  is  consequently  measurable.  As 
the  sets  for  which/ (ac)  <  A  and  f(x)  ^  A  are  measurable,  it  follows  that  the 
set  for  which  f(x)  =  A  is  measurable. 

The  sets  for  which  f(x)  <  B,  and  f{x)  <  A,  being  measurable,  their  differ- 
ence, the  set  for  which  A  £f(x)<B,  is  also  measurable.  The  other  results 
in  the  theorem  follow  at  once. 

A  function  f(x)  is  measurable  if  the  set  of  points  x  is  measurable,  for  which 
a.<f(x)<j3,  for  every  pair  a,  /3,  of  real  numbers  which  belong  to  a  given 
set,  everywhere  dense  in  the  indefinite  interval  (—00,00).  The  given  set  may 
be  taken  to  be  enumerable. 

Let  A  and  B  be  any  pair  of  real  numbers  such  that  A  <  B.  The  number 
A  can  be  expressed  as  the  upper  limit  of  a  sequence  {an}  of  increasing  numbers, 
all  of  which  belong  to  the  given  everywhere  dense  set;  and  the  number  B 
can  be  expressed  as  the  lower  limit  of  a  similar  sequence  {{3n}  of  diminishing 
numbers.  The  set  en  for  which  an  <f(x)  < /3n  is  measurable,  for  each  value 
of  n ;  the  inner  limiting  set  {en},  of  the  sequence,  is  the  set  for  which 
A  ^f(x}^B]  and  this  set  is  consequently  measurable.  Since  this  is  the 
case  whatever  values  A  and  B  may  have,  it  is  seen,  as  before,  that  f(x)  is 
measurable. 

A  function  f(x)  is  said  to  be  measurable  (B),  if  the  set  of  points  for  which 
f(x)  >  A  is  measurable  (B),  whatever  value  A  may  have. 

The  proofs  given  above  shew  that  the  sets  for  which 

A<f(x}<B-  A£f(x)<B;  A  £f(x)  £  B;  f(x}<  A,f(x}  ^  A 
are  all  measurable  (B). 

384.  If  fa,  $2,  •••  <j>n  be  a  finite  set  of  functions  that  are  measurable  in  a 
'measurable  domain  G,  linear,  or  of  higher  dimensions,  and  if  F(<f)l,  <£2, ...  <£?l) 
be  a  function  that  is  continuous  relatively  to  (<£i,  </>2,  ...  <j)n),  for  all  values  of 
</>!,  </>2, ...  (f>n,  then  F(^)1,  <£2,  •••  <f>n)  is  measurable  in  the  given  domain. 

First,  let  us  assume  that  all  the  functions  $,,  02,  ...  (f)n  are  bounded  in 
the  given  domain  for  which  they  are  denned;  suppose  their  values  all  to  be 
in  the  interval  (—  N,  N).  Let  a  net  (c0,  Cj, ...  cm)  be  fitted  on  to  the  linear 
interval  (—  N,  N),  where  c0=  —  N,  cm  =  N,  and  suppose  the  breadth  cr  —  cr-i 
of  each  mesh  to  be  less  than  the  positive  number  77.  Let  the  function  -v^s  be 
denned,  corresponding  to  each  function  <f>s(s  =  1,  2,  3,  ...  n),  by  the  conditions 
tys  =  cr_!  at  every  point  at  which  cr_j  £</>«<  cr+l>  for  r  =  1,  2,  3,  ...  m  -  1,  and 
tys  =  cm  where  4>s  =  cm.  We  have  then  0  &  <f>s  —  yfrg  <  77,  and  the  function  ^s 
taking  only  the  values  in  the  finite  set  c0,  Cj, . ..  cm,  this  function  is  measurable 
in  the  given  domain. 
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Since  F(fa,  fa, ...  <f>n)  is  continuous  in  the  closed  domain, 

C_7V_Ar      •  N  N      } 
\    *» i     •*••>•••>  •*••»•*••  i  •••  ./i 

we  have  j  F(fa,  fa,...  <£„)  -  -Fty,,  fa, . . .  -^n)  |  <  e, 

provided  77  be  taken  sufficiently  small;  the  number  e  being  arbitrarily  chosen. 
The  function F  (fa,  fa, . . . fa^  has  only  a  finite  set  of  values,  and  is  measurable. 
If  U  and  L  are  its  upper  and  lower  boundaries,  we  have 
L-e<F(fa,fa,...<f>n)<  U+e 

in  the  whole  domain.  Let  A  and  B  be  any  two  numbers  in  the  interval  (L,  U), 
then  the  set  of  points  for  which  A  <  F(fa,  ^2>  •••  ^M)  <  -B  is  measurable. 

Now  let  e  have  successively  the  values  in  a  sequence  {et}  which  converges 
to  zero,  then  there  exists  a  corresponding  sequence  {rjt},  of  values  of  77,  which 
converges  to  zero. 

The  set  of  points  Et,  for  which  A  <F(fa,  fa,  ...  fa)  <  B,  is  measurable, 
for  each  value  of  ijt,in  {rjt}.  Each  point  of  the  set  for  which 

A  <F(fa,fa,...<f)n)<B 

belongs  to  all  the  measurable  sets  Et,  from  and  after  some  particular  value  of 
t,  and  therefore,  by  a  theorem  established  in  §  131,  the  set  is  measurable. 
It  has  thus  been  shewn  that  F(fa,  fa,  ...  <f)n)  is  measurable  in  the  domain  for 
which  the  functions  are  defined. 

Next,  let  the  functions  fa,  fa, ...  <j>n  be  unbounded.  Let  faW  be  defined 
by  the  conditions  fa(N)  =  fa,  when  N^fa^-N;  fa(N)=N,  when  fa.>N; 
and  fa(N)  =  —  N,  when  fa<  —  N.  From  what  has  been  proved  above,  we  see 
that  the  function  F(fa(-*r>}  fa^^,, . .  <f>nW>)  is  measurable.  Let  JVhave  successively 
the  values  in  a  divergent  sequence  {^j  of  increasing  numbers.  Each  point 
of  the  set  for  which  A  <  F(fa,  </>.2)  ...  $w)  <  B  belongs  to  all  the  measurable 
sets  for  which  A  <  F(^Nt\  <£2W>,  ...  <j>n(Nt))<  B,  from  and  after  some  par- 
ticular value  of  t.  It  thus  follows  that  the  set  is  measurable.  Therefore  the 
theorem  holds  when  fa,  fa, ...  fa  are  unbounded. 

In  particular  we  have  the  theorem  that : 

The  sum,  or  the  product,  of  any  finite  number  of  measurable  functions,  defined 
in  a  measurable  domain  of  any  number  of  dimensions,  is  a  measurable  function. 

If  all  the  functions  fa,  fa,  ...  fa  are  measurable  (B),  the  function 
F(fa,  fa,  ...  </>n)  is  measurable  (B).  For  the  sets  employed  in  the  above 
proof  are  all  measurable  (B). 

THE   LEBESGUE   INTEGRAL   OF   A  MEASURABLE   FUNCTION. 

385.  Let  us  consider  a  measurable  set  e,  and  let  the  function  f(x)  =  1 
at  all  points  of  e.  The  measure  m  (e),  of  the  set  e,  is  said  to  define  the  integral 

f(x)dx,  of  the  function  f(x),  over  the  set  e. 
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If  c  be  any  positive,  or  negative,  number,  and  f(x)  =  c  at  all  points  of  e, 
f(x)dx  is  defined  to  be  the  number  cm(e).    The  values  of  f(x)  at  points 

that  do  not  belong  to  e  are  irrelevant.    Next,  let  e1}e2,...en  be  a  finite 
number  of  measurable  sets,  no  two  of  which  have  a  point  in  common. 

Let  f(x)  =  cr  at  the  points  of  er>  for  r  =  1,  2,  3,  ...  n;  where  clt  c2,  ...  cn 
are  assigned  numbers.  Then,  if  E  denote  the  set  e-i  +  ez  +  . . .  +  en,  the  integral 

r—n 

f(x)  dx,  of /(#),  over  the  set  E,  is  defined  as  the  sum  2  crm  (er). 

(E)  r=\ 

Next,  let  f(x)  be  a  measurable  function,  defined  for  the  points  of  a 
measurable  set  E,  and  bounded  in  that  set.  Let  U,  L  denote  respectively 
the  upper,  and  the  lower,  boundary  of  f(x)  in  E. 

Let  the  interval  (L,  U)  be  divided  into  parts  (a0,  c^),  (a1(  a.2),  ...  (an_i,  an); 
where  a0  =  L,  an  =  U,  and  such  that  the  greatest  of  these  parts  ar  —  ar^ ,  for 
r=  1,  2,  3,  ...  n,  is  <  77. 

Let  er  be  that  measurable  part  of  E,  for  all  points  of  which 

where  r  —  1,  2,  3,  . . .  n  —  1 ;  and  let  en  be  that  part  of  f(x)  for  the  points  of 
which  aw_!  £f(x}  £  U. 

Let  $n(x)  be  the  function  which  has  the  value  ar_lt  at  all  points  of  er, 
for  r  =  1,  2,  3,  ...  n;  and  let  ^  (#)  be  the  function  which  has  the  value  ar,  at 
all  points  of  er,  for  r  =  1,  2,  3,  . . .  n. 

We  have  then,  in  accordance  with  the  above  definition, 

r=n 

<£,  (x)  dx  =  2  ar_!  TO  (er), 

(E)  r=l 


r  r=n 

I     ^  (x)  dx  =  2  ar  m  (er)> 

f  f  ,„. 

.        YT,  (x)  dx  —         9,  (x)  dx  <  77  m  (E). 

J(E)  J  (E) 


and  therefore 

J  (E)  J  (E) 

If  there  be  assigned  to  r),  successively,  the  values  in  a  sequence  {r)m} ,  of 
diminishing  numbers,  for  which  rjm  ~*  0,  as  m  ~  co ,  it  is  seen  that 

4>>im  (A')  ^  d°es  no^  diminish,  and  that  I      ^m  (x}  dx  does  not  increase,  as 

J  (E)  J  (E) 

m  ~  oo .    These  two  sets  of  numbers  consequently  converge  to  a  common 

r  r 

limit.    This  limit  lim          <f)nm  (x)  dx  =  lim         ^m  (x)  dx  is  defined  to  be  the 

m  ~  QO  J  (E)  m  ~  oc  J  (E) 

value  of  the  Lebesgue  integral  I      f(x)dx,  of  f(x),  taken  over  the  measur- 

J(E) 

able  set  E. 

In  order  to  justify  this  definition,  it  is  necessary  to  prove  that  the  number 
so  defined  is  independent  of  the  particular  mode  in  which  the  interval 
(L,  U)  has  been  successively  sub-divided. 
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Let  ^,  (x),  ^r,  (x)  be  the  functions  which  correspond,  in  a  second  mode  of 
sub-division,  to  <f>n  (x),  ^  (x)  defined  above.  It  is  easily  seen  that  there  is  no 
loss  of  generality  in  taking  the  sequence  {r)n}  to  be  the  same  in  the  two  cases. 

Suppose  the  two  sub-divisions  of  (L,  U),  corresponding  to  rjm,  to  be  super- 
imposed, and  let  <£,m  (x)  be  the  function  corresponding  to  <£,OT  (x)  and  <j>qm  (x). 
We  have  then 


f   =  r 

0  £         <£„„,  0)  dx  -        <£,TO  (x)  dx<v)mm  (E), 

J  (E)  .'  (E) 

0  i  f     fam  (x)  dx-t     &m(x)  dx  <  rjm  m  (E), 

J  E  J  (E) 


and  therefore 


(E)  J  (E) 

r 

0)  dx  ~ 

J  ( 


J  (E)  J(E) 

As  m  '»*  oo ,  2?;m  m  (E)  ~  0 ;  and  therefore 


<  2rjm  m  (E). 


r  r      - 

lim  </>nm  0*0  dx  =    lim  £->,»  (*)  dx' 

m~<x>  J  (E)  m~<x>  J  (E) 

Thus  the  number  by  which  the  Lebesgue  integral  is  defined  is  indepen- 
dent of  the  mode  of  sub-division  of  the  set  E. 

Accordingly,  the  definition  may  be  stated  as  follows : 


The  Lebesgue  integral  \      f(x)  dx  of  the  bounded  measurable  function  f  (x), 

J  (E) 

taken  over  the   measurable   set  E,  is   defined   as  the   limit  to  which  either 

r—n  r—n 

2  ar_i  m  (er),  or   2  ar  m  (er),  tends,  as  the  greatest  of  the  numbers  ar  —  ar-i 

r-l  r-l 

converges  to  zero  ;  where  (L,  U)  is  divided  into  n  parts  (ar_i  ,  ar),  for 
r  =  1,  2,  3,  ...  n  ;  and  a0  =  L,  an—  U  ;  and  where  er  is  that  set  of  points  of  E, 
for  all  of  which  ar_j  ^  f  (x)  <  ar  ;  f=I,  2,  3,  ...  n  —  1;  and  en  is  that  set 
for  which  am_j  =/(#)  =  an.  U  and  L  denote  the  upper  and  lower  boundaries 
off(x)inE. 

In  case  the  set  E  is  a  set  in  ^-dimensional  space,  f(x)  is  used  to  denote 


If  the  set  E  consists  of  the  points  of  a  finite  interval  (a,  b),  or  of  a  cell 

rb 
(a,  b),  the  integral  over  E  is  denoted  by  I    /  (x)  dx, 

J  a 
/•(&('>,  6<21,..  .&<">) 

or  f(x(l},  x®t. 

J  (a(»,aW,...aW)J 


It  will  be  observed  that  the  function  </>,,  (x)  is  a  function  which  takes  only 
a  finite  number  of  values,  for  all  values  of  x  in  the  set  E  ;  and  that  it  is  such 
that  OS  /(a?)  -£,(*)<  17. 
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The   integral          f(x)  dx   is   then   defined   as   the   limit   of    the   non- 
./(j) 

diminishing  sequence  of  numbers 


where  771,  773,  ...  ijm>...  is  a  sequence  of  diminishing  numbers  that  converges 
to  zero,  as  m  •%*  oo  . 

386.    Next,  let  the  measurable  function  f(x)  be  unbounded  in  the  measur- 
able set  E,  and  such  that/(#)  ^  0,  in  E. 

Let  fN  (x)  be  defined  as  a  bounded  function  in  E,  by  the  specifications, 


fN  (x)  =  JV,  where  f(x) 
The  number  N  is  any  assigned  positive  number. 

If  there  be  assigned  to  N  the  values  in  an  increasing  sequence  Nlt  N2>  ... 
without  upper  limit,  the  integral  I      f^(x)dx  is  a  number  which  does  not 

J(E) 

diminish  as  N  has  successively  the  values  in  the  sequence.    It  follows  that 
fN(x)dx  either  converges  to  a  definite  upper  limit,  or  that  it  increases 


(E) 

indefinitely  as  N  does  so. 

r 
When  lim         fN  (x)  dx  exists  as  a  definite  number  the  integt  al  I      ./(#)  dx 

A'~  ao  .    (E)  J  (E) 

is  defined  by  that  number. 

The  value  of  the  integral  is  easily  seen  to  be  independent  of  the  par- 
ticular sequence  of  values  of  N.    Thus 


r  •    r 

/  (x)  dx  =  lim          fN  (x)  dx. 

J  (E)  N~«>J(E) 

r  r 

If  f(x)  be^O,  in  E,  I      f(x)dx  is  similarly  defined  as  lim  I      f_N(x)dx, 

J  (E)  N~<*>J(E) 

where  f-N(x)  =f(x),  for  points  x  at  which  f(x)  ^  —  N,  and  f-N(x)=—N, 
when  f(x)  <  —  N. 

Lastly,  let  the  unbounded  function  f(x\  defined  in  E,  have  both  positive 
and  negative  values. 

Let  f(x)  =f+  (x)  -  f~  (x),  where  /+  (x)  =f(x),  when  f(x)  ^  0,  and 
y+  (x)  =  0,  when  f(x)  <  0 ;  with  a  similar  definition  for  /~  (x).  Both  the 
functions  /+  (x),  f~  (x)  are  measurable. 


The  integral  I      f(x}dx  is  defined  as  the  difference, 

J  (E) 

I     f+(x)dx-j     f~(x)dx, 

J  (Ei     -  J  (E) 

provided  both  of  the  latter  integrals  exist  as  finite  numbers. 
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When  a  measurable  function  f(x),  of  one  or  more  variables,  is  such  that 
f(x}  dx  exists  as  a  finite  number,  f(x)  is  said  to  be  summable  in  the  set  E. 


(E) 

A  measurable  function  is  always  summable  if  it  be  bounded,  but  not 
necessarily  so  if  it  be  unbounded. 

A  function  f(x)  that  is  summable  in  E  is  also  said  to  be  integrable  (L) 
in  E,  and  the  integral  f(x)dx  is  termed  the  Lebesgue  integral,  or  shortly 

J  (E) 

the  L-integral  of  f(x)  in  E. 

387.  It  will  be  observed  that,  in  accordance  with  the  above  definition,  in 
order  that  a  measurable  function  f(x),  defined  for  the  measurable  set  of 
points  E,  may  be  summable  in  E,  it  is  necessary  that  each  of  the  two  functions 
/+  (#)>  /"  (#)  should  be  summable,  that  is,  each  one  of  them  must  be  integrable 

(L)  in  E.    Thus  the  two  limits  lim  I      f+(x)dx,  lim  I      f~(x)dx  must  be 

N~<*>  J  (E)     '  JV~»  J  (E) 

both  finite. 

The  definition  may  accordingly  be  stated  in  the  following  form,  which  is 
a  generalization  of  a  definition*  due  to  de  la  Valle'e  Poussin,  originally 
applicable  to  the  case  in  which  /*(#),  f^(x)  are  restricted  to  be  integrable 
(R)  in  a  cell  or  interval  (a,  6). 

If  the  unbounded  measurable  function  f(x)  be  defined  in  a  measurable  set 
E,  and  N  and  N'  denote  two  positive  numbers,  let  fN,N>(x)  be  such  that 
fNt  N,  (x)  =  f(x),  at  points  such  that  —  N '  £  f  (x)  ^  N ;  and  fNt  N,  (x}  =  N  at  points 

such  thatf(x)  >  N  ;  and  fN  N>(x)  =  —  N',  whenf(x)  <  —  N' ;  then         f(x)  dx 

J  (E) 

is  defined  as  the  double  limit  of  I      fN  N'(x)dx,  as  N  and  N'  diverge,  inde- 

J  (E) 

pendently  of  one  another,  to  <x> ,  whenever  this  double  limit  exists  as  a  definite 
number. 

In  accordance  with  this  definition,  when  f(x)  is  summable,  so  also  is  f(x)  j; 

for  \f(*) !  =ft(x)  +f~(x),  and  therefore        |/(#)   dx  is  the  sum  of 

JCE) 

r  r 

lim   I     f^r(x)  dx  and  lim         f  ~  (x)  dx, 

AT~*>  J  (E)  JV~«>  J  (E) 

each  of  which  is  finite. 


Since  the  existence  of  /     f(x}dx  involves  that  of  |      \f(x)\dx,  the  L- 

J(E)  J(E) 

integral   I     f(x)  dx  is  said  to  be  an  absolutely  convergent  integral,  thus  : 

J(E) 

*  Liouville's  Journal  (4),  vol.  vm,  p.  427. 


524  The  Lebesgue  integral  [OH.  vn 

The  L-integral  of  an  unbounded  summable  function  is  an  absolutely  con- 
vergent integral. 


It  is  easily  seen  that 


I     f(x)dx  Si      \f(oc)  dx;  for  the  absolute  value 

J(E)  J  (N) 

[  f 

of  the  integral  on  the  left-hand  side  is  S        /+  (x)  dx  +  I     /-  (x)  dx. 

J(E)  J(E) 

If  f (1)  (a;)  ^/(2)  (a?)  =  0,  the  function /(1)  (x)  being  summable  in  E,  then  the 
function  /(2)  (x),  assumed  to  be  measurable  in  E,  is  also  summable.    For  we 

f  r 

see  that  /$  (#)  c&»  =/(^  (x)  dx,  hence        /'^  (a;)  dx  S       /^l  (a;)  rfa;,  and  it  then 

J(E)     '  .' (E) 

f  f 

follows  that  I     /(2)  (x)  Sexists,  and  is  ^        fw(x)dx. 

J  (E)  J  (E) 


388.  The  above  definition  of  the  Z-integral,  when/(#)  =  0,  in  E,  and  the 
function  f(x}  is  unbounded,  may  be  replaced  by  another  definition  which  we 
proceed  to  obtain. 

Let  a0)  al}  a2,  ...  an,  ...  be  an  increasing  sequence  of  numbers,  such  that 
an  has  no  upper  boundary,  as  n  ~  oo  ,  and  where  a0  =  0.  Also  let  ar  —  ar^  £  rj, 
for  every  value  of  r. 

00  00 

Consider  the  limiting  sums  a=  S  ar^1m(er),  a'  =  S  arm(er);  where,  as 

r  =  l  ;•-! 

before,  er  is  the  measurable  set  of  points  at  which  a,._x  ^=  f(x)<  ar.     The 

30 

difference  of  the  two  sums  a,  a   is  2  (ar  —  ar_i)  m  (er),  which  is  not  greater 

r  =  l 

than  ijm(E).  It  follows  that  <r,  <r'  are  both  finite,  or  both  infinite.  Now  let 
«0,  aly  a2,  ...  be  the  end-points  of  the  meshes  of  a  net,  fitted  on  to  the  infinite 
linear  interval  (0,  oo  ) ;  and  consider  a  system  of  such  nets,  for  which  rj  has  the 
values  in  a  sequence  {?;„} ,  of  diminishing  numbers,  that  converges  to  zero.  If 
it  be  assumed  that  or,  a-'  are  finite,  it  is  clear  that  a  has  the  values  in  a  non- 
diminishing  sequence,  and  that  a-'  has  the  values  in  a  non-increasing  sequence, 
as  the  successive  nets  of  the  system  are  taken.  It  follows  that  a  and  a-'  have 
definite  limits,  as  n  ~  oo  ,  and  these  limits  must  be  identical,  since  cr'  —  cr  ~  0, 
as  n  ~  oo  . 

It  can  be  shewn  that  this  limit  is  independent  of  the  particular  system  of 
nets  employed.  For,  if  crl,  cr/  refer  to  one  system  [D],  of  nets,  and  <r2,  cr2'  to  a 
second  system  {D1},  we  may  take  nets  D,  D'  of  the  two  systems  such  that,  for 
D,  cr/  —  ffl^rjm  (E),  and  for  D',  cr2'  —  cr2  ^  rjm  (E).  Now  consider  the  net 
(D,  D')  obtained  by  superimposing  the  two  nets,  and  let  a'  be  the  corresponding 
value  of  cr',  then  a'  is  ^  o-/,  and  ^  o-2';  therefore  a'  —  c^  g  rjm  (E},  and  a'  —  cr2 
is  £r)m(E),  and  moreover  a'  —  al,  ~a  —  a2  are  both  =0.  It  follows  that 
j  —  cr2  ^rjm  (E) ;  and  thus  the  limits  of  c^  and  o-2,  as  t]  ~  0,  are  the  same. 
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oo 

The  limit  as  n~oo,  of  2  ar^lm(er),  which  is  identical  with  the  limit  of 

r=l 

oo  r 

2  arm  (er),for  the  nets  of  a  system  {Dn},  defines  the  value  of       f(x)  dx,for  an 

r=l  J(E) 

unbounded  non-negative  function  /(#),  measurable  in  the  measurable  set  E, 
whenever  this  limit  exists. 

It  will  be  shewn  that  this  definition  is  completely  equivalent  to  that  given 
in  §  386. 

Let  us  suppose  that  as  =  N,  a  fixed  positive  number ;  then  if  o-  be  convergent, 

r=s  r=s 

we  have  a  =  S  ar^m  (Er)  +  Rs,  <r'  =  S  arm  (Er)  +  Rs',  where  N  can  be  chosen 

r=l  r=l 

so  large  that  Rg  and  Rs'  are  both  less  than  e. 

r  r=s 

Assuming  that         f(x)  dx  is  the  limit  of  2  arm  (Er)  +  Rg',  for  the  system 

.'  (E)  r=l 

of  nets,  the  integer  s  so  varying  that  as  =  N,  we  see  that  I      fN  (x)  dx  is  the 

'  (E) 
r=s 

limit  of  2  arm  (Er)  +  Rs",  where  Rs"  <  Rs'  <  e. 

r=l 

Since  Rs',  Rs"  differ  from  one  another  by  less  than  e,  it  follows  that 
f(x)dx,         fN(x)dx  differ  from  one  another  by  not  more  than  e;  the 

J  (E)  J  (E) 

number  N  having  been  chosen  sufficiently  large.    It  follows  that 
lim  I     fN  (x)  dx  =  I     f(x)  dx. 

N~tx>  J  (E)  J  (E) 

r  f  f 

Conversely,  let        f(x)  dx  be  defined  to  be  lim  I     fN  (x)  dx,  then         f(x)  dx 

J  (E)  N~x,J(E)  J  (E) 

is  defined  as  the  repeated  limit 

{r=s  oc  1 

S  ar_^n  (Er)  +  N    2    m  (Er)  \ , 
r=\  r=s+\  } 

n  denoting  the  order  of  the  net  (a0,  a1}  a2,  ...),  and  s  denoting  the  integer, 
dependent  on  n,  for  which  as  =  N. 

r=s  <x> 

Both  S  ar_lm(Er)  and  N    S    m(Er)  are  non-diminishing  as  ti  increases, 

r=l  r=s+l 

r  =  s 

and  as  s  increases,  hence  the  repeated  limit  lim  lim   S  ar,lm(Er)  must  exist, 

JV~x  «~x  r  =  l 
r=s 

as  a  finite  number.   Denoting  £  ar_^m(Er}  by  cnN,  we  see  that  cnN  is  such  that 

r-l 

cn'x'  =  cnN,  if  n'  ~  n,  N'  =  N.  If  we  write  £  =  l/n,  rj  =  l/N,  we  may  regard  c)UV  as 
a  function  F(%,  rj)  of  the  two  variables  ^,  rj ;  and  the  function  is  monotone  (see 
§307).  In  accordance  with  the  theorem  proved  in  §307,  the  repeated  limits 
of  F(%,  77),  as  £~0,  17  ~0,  have  one  and  the  same  finite,  or  infinite,  value. 
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That  F(%,  77)  is  defined  only  when  f,  77  are  the  reciprocals  of  positive  integers 
makes  no  difference  as  regards  the  validity  of  the  theorem.  We  now  see  that, 
if  either  of  the  repeated  limits  lim  lim  cnN,  lim  Km  cnN  exists,  as  a  finite 

»~co  JV~oo  N~<x>  n~x> 

number,  the  other  exists,  and  has  the  same  value  as  the  former. 

r=s 

We  thus  see  that  lim  lim  2  ar_^m  (Er)  has  a  finite  value.    It  follows  that 

w~»  N~x  r=l 
co  oo  r=s 

2  ar-\m,  (Er)  is  convergent,  and  that  lim  2  ar^m  (Er)  =  lim  lim   2  ar_^m  (Er). 

r—\  n~<x>  r=\  A7~oo  n~<x>  /•=! 

00  CO 

Now    N    2    m  (Er)  <    2    ar^  m  (Er)  ;     hence,  if  N   be    sufficiently    large, 

r=s+l  r=s+l 

00  OO 

N   2    m(Er)  becomes  arbitrarily  small,  and  lim  JV    2    m(Er)  =  0.     We  thus 

r=s+l  N~<x>       r=s+l 

have,  since  this  limit  is  independent  of  n, 

r  rr-s  ~|  co 

f(x)  dx  =  lim  lim     2  ar_1m(Er)    =  lim  2  ar_1m(Er), 

J(E)  n~<x>  N~<x>lr=l  J       n~x>  r=I 

r  oo 

which  proves  that  the  definition  of  I     f(x}dx,  as  lim  2  ar-.im(I8r),  is  equiva- 

J  (E)  n~<x>  r=l 

lent  to  the  definition  as  lim        fN  (x)  dx. 

N~x>J  (E) 

The  following  theorem  has  been  proved  in  the  course  of  the  above  proof: 
If  f(x)  be  a  summable  function  which  is  =  0,  in  the  set  E,  then 


TV 

ivhere  E(N)  is  that  part  of  E  in  which  f(x}  ^  N. 

When  f(x}  is  not  restricted  to  be  =0  in  E,  we  see  from  the  definition  of 

r  r  r 

f(x)  dx  as  I     /+  (x)  dx  —        f~  (x)  dx,  that  the  integral  may  be  defined* 

.'  (E)  J(E)  •  (E) 

00  00 

as  the  limit  of  either  of  the  expressions  2   ar^1m(Er),  2  arm(Er),for  a  system 

-  00  —CTj 

of  nets  (.  .  .  —  a_2,  —  a_l5  a0.  «i,  «2>  •  •  •)  fitted  on  to  the  indefinite  interval  (—00,00). 

We  have  also  the  following  theorem  : 

If  f(x)  be  summable  in  the  measurable  set  E,  and  E(N}  denote  that  part  of 
Efor  which  \f(x)  \  ^  N,  then  lim  Nm  (E(N))  =  0. 

JVr~oo 

OTHER  DEFINITIONS  OF  AN  INTEGRAL. 

389.  A  general  theory  of  integration  has  been  developed  by  W.  H.  Young, 
independently  f  of  the  work  of  Lebesgue,  in  two  memoirs.  In  the  second  of 
these  memoirs,  the  theory  there  developed  is  brought  into  relation  with  the 
work  of  Lebesgue.  The  definition  of  W.  H.  Young  is  a  generalization  of  that 

*  This  is  the  definition  given  by  Lebesgue;  see  his  memoir  "Intt3grale,  Longeur,  Aire,"  Annali 
di  Mat.  (ina),  vol.  vn,  p.  258. 

j-  See  the  paper  "  On  upper  and  lower  integration,"  Proc.  Loud.  Math.  Soc.  (2),  vol.  n,  and 
also  "  On  the  general  theory  of  integration,"  Phil.  Trans.,  vol.  cciv,  p.  221. 
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of  the  upper,  and  the  lower,  72-integrals,  in  which  the  domain  is  divided  into 
a  number  of  parts,  of  a  special  kind.  W.  H.  Young  defines  the  generalized 
upper  and  lower  integrals  of  a  function  by  means  of  a  division  of  the  measurable 
domain  of  the  independent  variable  into  sets  of  measurable  parts.  This  de- 
finition may  be  formulated  as  follows  : 

Let  the  measurable  set  E,  which  is  the  domain  of  integration,  be  divided  into 
a  finite,  or  enumerably  infinite,  set  of  measurable  components,  and  let  the  measure 
of  each  component  be  multiplied  by  the  upper  boundary  of  the  function  in  that 
component,  and  the  sums  of  all  such  products  be  formed.  The  generalized  upper 
integral  is  defined  to  be  the  lower  limit  of  that  sum,  for  all  possible  modes  of 
division  of  E  into  components,  as  above  described.  The  generalized  lower  integral 
is  defined  in  a  similar  manner,  by  employing  the  lower  boundary  of  the  function 
in  each  component.  If  the  generalized  upper,  and  lower,  integrals  both  exist, 
and  have  the  same  finite  value,  that  value  is  said  to  be  the  value  of  the  integral 
of  the  function  over  E. 

This  definition  can  be  applied,  not  only  to  bounded  functions,  but  also  to 
unbounded  functions.  In  the  latter  case,  such*  sets  only  are  employed  as 
have  finite  upper  and  lower  boundaries  of  the  function.  It  can  be  shewn  that 
this  definition  is  equivalent  to  that  of  Lebesgue. 

The  definition  of  an  integral  has  been  further  generalized  by  Pierpont,  so 
as  to  apply  to  the  case  in  which  the  set  E  is  non-rneasurable. 

A  set  H  is  said  to  be  measurable  relatively  to  E  (which  may  be  non- 
measurable)  if  there  exists  a  measurable  set  0  such  that  H=  D  (G,  E).  Thus 
H  is  that  part  of  E  which  E  has  in  common  with  a  measurable  set. 

Pierpont's  definition  f  is  obtained  from  that  of  W.  H.  Young  by  considering 
those  systems  of  sub-division  of  E  into  components  which  are  such  that  each 
component  is  measurable  relatively  to  E.  In  forming  the  expression,  of  which 
the  lower  boundary  is  the  upper  integral,  we  employ  the  exterior  measure  of 
each  component  of  E,  and  the  same  in  forming  the  expression  of  which  the 
upper  boundary  is  the  lower  integral.  With  these  changes  in  the  statement 
of  the  definition  of  W.  H.  Young,  we  obtain  the  definition  of  Pierpont. 

Other  modes  of  defining  an  integral  have  been  developed  by  W.  H.  Young. 
As  these  depend  upon  the  theory  of  sequences  of  functions,  they  will  be  referred 
to  in  Vol.  II,  in  that  connection. 

Another  definition  of  an  integral  has  been  given  by  Borel|. 

*  See  Hidebrandt,  Bulletin  Amer.  Math.  Soc.,  vol.  xxiv,  pp.  120—123. 

f  Theory  of  Functions  of  Real  Variables,  vol.  n,  pp.  343  et  seq. 

+  Journal  de  Math.  (6),  vol.  vm,  pp.  199—205;  also  Lemons  sur  la  Theorie  des  Fonctions, 
2nd  ed.,  pp.  217 — 256  ;  see  also  Hahn,  Monatshefte  fur  Math.  u.  Physik,  vol.  xxvi,  p.  3.  For  a 
criticism  of  this  definition  see  Lebesgue,  Annales  dc  Vecole  normale  (3),  vol.  xxxv.  See  also  Borel, 
ibid.  (3),  vol.  xxxvi. 
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THE  i-INTEGRAL  AS  THE  MEASURE  OF  A  SET  OF  POINTS. 

390.  It  has  been  shewn,  in  §336,  that  the  72-integral  of  a  function 
expresses  the  measure  of  a  set  of  points  that  is  measurable  (J).  We  proceed 
to  shew  that  the  i-integral  has  a  similar  relation  to  a  measurable  set  of 
points.  This  relation  may  be  expressed  in  the  following  theorem  : 

If  the  summable  function  f(x),  defined  for  the  measurable  set  of  points  E, 
be  such  that  f  (as)  =  0,  in  E,  then  the  set  of  points  defined  byQ^y  ^f(x),  x  in  E, 
has  for  its  measure,  plane,  or  (p  +  1)  dimensional,  according  as  E  is  a  linear, 

or  a  p-dimensional,  set,  the  value  of  \     f(x}  dx. 

J(E) 

The  theorem  will  first  be  established  for  the  case  in  which  f(x)=  c,  over 
the  set  E. 

In  that  case  the  two,  or  (p  +  1)  dimensional,  measure  of  the  set  of  points  for 
which  x  is  in  E,  and  0  ^  y  ^  c,  is  cm  (E).  For  the  sets  E,  C(E)  can  be  enclosed 
in  sets  of  intervals,  or  cells,  such  that  the  measure  of  the  set  common  to  the 
two  sets  of  intervals  is  <  77.  The  set,  x  in  E,  0  ^  y  £  c,  and  the  complementary 
set,  x  in  C  (E),  0^  y  ^  c,  can  consequently  be  enclosed  in  sets  of  rectangles, 
or  (p  +  l)-dimensional  cells,  such  that  the  measure  of  the  set  common  to  the 
two  sets  is  <  077.  As  77  converges  to  zero,  the  measure  of  the  set  of  intervals, 
or  cells,  enclosing  E,  converges  to  m(E),  and  the  measure  of  the  set  of  rect- 
angles, or  cells,  enclosing  the  set  (x  in  E,  0  <  y  ^  c),  converges  to  cm  (E),  which 

is  thus  the  measure  of  the  set.    Therefore,  in  this  case  I     f(x)  dx  is  equal  to 

J(E) 

the  measure  of  the  set,  x  in  E,  0  ^  y  =/(#).' 

r=m 

Next,  if  E=  2  er,  where  er  denotes  a  measurable  set,  and  f(x)  =  cr,  for 

r=l 

r  =  1,  2,  3, ...  m,  the  measure  of  the  set  (x  in  E,  0  ^  y  ^f(x})  is  the'  sum  of 

r=m 

the  measures  of  the  r  sets  (x  in  er,  0  ^  y  ^  cr),  which  is  £  crm(er),  and  is 

r=l 

therefore  equal  to  I     f(x)dx. 

If  f(x]  is  any  bounded  measurable  function  (^  0),  the  measure  of  the  set 
(x'u\E,Q^y  ^f(x))  is  between  the  measures  of  the  two  sets 

(x  in  E,  0  ^  y  ^  <£,,(#)),  (x  in  E,  0  =g  y  ^  ^(x)), 

r=n  r=n 

employed  in  §385;  that  is  between  2  ar^lm(er)&nd  2  arm  (er)',  consequently  it 

r=l  r=l 


differs  from  I     f(%)  dx  by  less  than  77  ( U  —  L),  which  converges  to  0,  as  77  does 

J(E) 

r 
so.    Therefore  the  measure  of  the  set  is  equal  to  I     f(x)  dx. 

J  (E) 
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If  f(x)  be  a  bounded  measurable  function  which  has,  in  E,  values  of  both 
signs,  we  see  that 

f    f(x)dx=\     f(x)dx-\      {-f(x)}dx, 

J(E)  J(E,)  <  (EJ 

where  Elt  Ez  are  the  parts  of  E  in  which  f(x)  is  positive  and  negative  re- 
spectively. 

It  follows  that  I      f(x)dx  is  the  excess  of  the  measure  of  the  set 

(xinE,,Q^y^f(x)\ 

over  that  of  the  set  (x  in  E2,  0  ^  y  ^  -/(«)). 

rb 
The  value  of  the  L-integral  I  f(x)  dx,  where  f  \x)  is  a  measurable  function 

J  a 

=  0,  representing  the  measure  of  the  set  of  points  (a~x^b,  0  ^ y  ^ /(#)), may 
be  regarded  as  representing  what  is,  in  geometrical  language,  called  the  area 
bounded  by  the  curve  y—f(x),  the  ordinates  at  a  and  6,  and  the  #-axis. 

f(bW,  6(2)) 

Similarly  /(#(1>,  x(<1})d(xw,  #(2>)  may  be  regarded  as  representing  the 

J  (a'D.al2)) 

volume  bounded  by  the  surface  y=f(x(l},  #(2)),  the  plane  y  =  Q  and  the  planes 
a?*1'  =  aw,  x(l)  =  b(l),  #(2)  =  a®,  x®  =  6(2).  Accordingly,  such  a  measure  of  an  area, 
or  of  a  volume,  may  exist,  in  accordance  with  the  definition  of  Lebesgue  in 
cases  in  which  it  does  not  exist  in  accordance  with  the  definition  of  Jordan, 
because  a  set  of  points  which  is  measurable  is  not  necessarily  measurable  (J). 

rb 
The  Z-integral  I    f(x)  dx,  when  f(x)  has  both  signs,  may  be  regarded  as 

J  a 

expressing  the  excess  of  the  area  above  the  #-axis  over  the  area  below  it.  If 
/(#)  ~  0,  in  (a,  b),  and  is  summable,  but  unbounded  in  the  interval,  the  i-integral 
is  the  limit,  as  N~<x>,  of  the  area  defined  by  a~x~b,  ^^y^fN{x\  the 
function  fN(x}  being  defined  as  in  §386. 

THE  jR-INTEGRAL  AS  AN  i-INTEGRAL. 

391.    It  will  be  shewn  that  an  .K-integral  is  also  an  .L-integral,  although 
the  converse  does  not  in  general  hold. 

Let  f(x)  be  integrable  (R),  in  an  interval,  or  cell,  (a,  b).  If  E  denote  the 
set  of  points  at  which  f  (x)  >  A,  any  point  of  E  which  is  a  point  of  continuity 
of  f(x)  must  be  an  interior  point  of  E ';  for  such  a  point  has  a  neighbourhood 
in  which,  at  every  point,  f(x)  >  A.  A  point  of  E,  at  which/(a;)  is  discontinuous, 
need  not  be  an  interior  point  of  E,  but  all  such  points  belong  to  a  set  of  zero 
measure,  since  /(#)  is  integrable  (R).  Therefore  E  consists  in  general  of  an 
open  set  together  with  a  set  of  measure  zero ;  and,  since  both  these  parts  of 
E  are  measurable,  it  follows  that  E  is  measurable.  Since  A  is  arbitrary,  it 
has  thus  been  shewn  that/(V)  is  a  measurable  function.  Since  it  is  bounded 
H.  34 
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in  (a,  b),  it  is  therefore  summable  ;  and  thus  has  an  Z-integral.  The  measure 
(/)  of  a  set  is  identical  with  its  measure,  when  both  exist  ;  and  therefore  the 
^-integral  of/(#;)  has  the  same  value  as  the  Z-integral. 

If  f(x)  is  measurable,  and  bounded,  but  not  integrable  (R),  it  is  easily 

Jb  rb  rb 

seen  that  I    f(x)  dx  =      f(x)  dx  =      f(x}  dx.    For,  in  any  mesh  8  of  a  net, 

•la  J  a  _  a 

the  Z-integral  of/(#)  lies  between  5?7(S)  and 


THE  LEBESGUE  INTEGRAL  AS  A  FUNCTION  OF  A  SET  OF  POINTS. 

392.  It  has  been  pointed  out  in  §  208  that,  if  a  class  of  sets  of  points  E 
be  assigned,  and  if  by  any  rule,  or  set  of  rules,  a  definite  number  is  given, 
corresponding  to  each  set  E  of  the  given  class,  the  set  of  all  such  numbers 
may  be  regarded  as  denning  a  function  of  E  which  has  these  numbers  for  its 
values. 

Let  the  sets  of  points  E  be  all  measurable  sets,  and  let  us  assume,  for  the 
present,  that  the  class  E  consists  of  all  measurable  sets  contained  in  a  measurable 
domain  K.  Let  a  function  f(x)  be  defined  for  all  points  of  that  domain,  and 
let  it  be  taken  to  be  summable  in  X,  whether  the  function  be  bounded,  or  not. 
It  is  easily  seen  that  f(x)  is  summable  in  any  one  of  the  sets  E.  For  \f(x)  \ 
is  summable  in  K,  and  the  function  $>E(x}  which  is  equal  to  \f(x)  in  E,  and 
to  0,  in  C  (E),  the  complement  of  E  relatively  to  K,  is  such  that  its  integral 
over  K  is  less  than  that  of  \f(x)\,  and  is  therefore  finite.  The  integral  of 
$E(x)  over  K  is  the  same  as  over  E  ;  therefore  \f(x}  \  is  summable  in  E,  and 
consequently  f(x)  is  also  summable  in  E. 

The  value  of  I      f(x)  dx  may  be  regarded  as  defining  a  function  (f>(E)  of 

J  (E) 

the  measurable  set  E.  In  the  particular  case  in  which  f(x)  =  1,  at  all  points 
of  K,  the  .^-integral  is  m  (E),  which,  as  has  been  shewn  in  §  130,  is  a  completely 
additive  function. 


The  function  (f>  (E)  =  I     f(x)  dx  converges  to  zero,  as  in  (E)  ~  0,  uniformly 

J(E) 

for  all  sets  E  in  the  fundamental  domain  K. 

Let  N  be  an  arbitrarily  chosen  positive  number,  and  let  KN  be  that  part 
of  the  fundamental  domain  K  in  which  f(x}  >  N.    Let 

o0,  a1}  aa,  ...  N,  am,  ag+2,  ..., 

where  a0  =  0,  as  =  N,  be  a  sequence  of  numbers  increasing  indefinitely,  such 
that  an+l  —  an  <  e,  for  all  values  of  n.    The  numbers  e,  N  can  be  so  chosen  that 

00  CO 

%  arm(er},    2  ar+1?n(e,.)  both  converge  to  values  less  than  t),  where  er  is  the 

r=s  r=s 

set  of  points  for  which  ar  ^  '/(#)  |  <  ar+1.    The  integral   I        (/(#)  |  dx,  which 

J  (JSTy) 

lies  between  the  two  sums,  is  therefore  also  less  than  77. 
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Now    f      |  /(a?)  \dx=l         \f(x)  \dx+\  \f(x)\dx\    where  KNE 

denotes  the  set  of  points  D  (E,  KN\  common  to  E  and  KN\  that  this  is  the 
case  follows  from  the  definition  in  §  385. 

f  r 

We  see  that  I          |/(X>  1^=1        [/(*)  \dx<r);  hence 

| /(a?)  |  da;  <  77  +  JV?»  (£"  -  ^i?)  <  T;  +  ^m  (E)  <  fy ; 


(X) 
provided  m(E)<  ij/N ;  whatever  set  E  be  taken  that  satisfies  this  condition. 

Since  77  is  arbitrary,  it  follows  that   lira     I      \f(x)  \  dx  =  0 ;  and  that  this  con- 

m(E)~OJ  (E) 

vergence  is  uniform  with  respect  to  E. 


Since 


f(x)  dx   ^  I      \f(x)   dx,  we  see  that          f(x)  dx  converges 

'  (JS)  J  (E)  .'  (E) 


•'  (E) 


to  0,  as  m  (E)  ~  0,  uniformly  for  all  sets  E. 

393.  Since,  in  accordance  with  §  390,  $>(E)  is,  if/(#)  =  0,  the  measure  of  a 
two-dimensional  set,  or  a  (p  +  1  )-dimensional  set,  according  as  E  is  a  linear  set, 
or  a  jp-dimensional  set,  it  follows  from  the  theorem  (§  130)  that  the  measure 
of  a  set  is  completely  additive,  that,  in  the  case  in  which  f(x)  =  0,  in  K,  the 
function  (}>(E)  is  completely  additive.  In  the  general  case  in  which  f(x)  is 
the  difference  f+(x)—f~(x),  of  two  summable  functions,  each  of  which  is 
=  0,  by  applying  the  result  to  each  of  these  functions,  we  obtain  the  following 
theorem  : 

r 

The  function  (f>(E)  defined  as  the  value  of         f(x)dx,  where  E  is  a 

J  (E) 

measurable  set  contained  in  the  fundamental  interval,  or  cell,  in  which  f(x)  is 
summable,  is  a  completely  additive  function  of  E. 

In  particular  we  have 

I  f(x)  dx  =  I       f(x)  dx+\      f  O)  dx. 

J 


(E,)  (E.2) 

A  special  case  of  this  theorem  is  that,  if  8lf  S2,  ...  Sn,  ...  be  a  finite,  or  an 
enumerable,  sequence  of  non-overlapping  intervals,  or  cells,  contained  in  the 
fundamental  interval,  or  cell,  and  A  denote  their  sum,  or  limiting  sum,  then 


oo        r 

f(x)  dx  —  2    I      f(x)  dx. 

n=lJ  (5,,) 


34—2 
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EQUIVALENT  i-INTEGRALS. 

394.  If  the  values  of  a  function  f(x),  that  is  summable  over  a  set  E,  be 
altered  at  points  of  E  belonging  to  a  part  E0,  for  which  m  (E0)  =  0,  the  value 

of         f(%}  dx  is  unaltered.    For,  in  the  definition  of  the  integral,  the  measures 

J  (~E) 

of  all  the  sets  employed  will  be  unaltered.  Moreover,  the  integral  of  /(#)  over 
the  set  E  will  be  the  same  as  over  the  set  E  —  E0. 

Two  functions  defined  for  a  measurable  set  E,  which  have  almost  every- 
where in  E,  equal  values,  are  said  to  be  equivalent  functions.  Thus  all  summable 
and  equivalent  functions  have  the  same  //-integral. 

A  function  that  is  unbounded  in  E  may  be  equivalent  to  a  bounded  function; 
for  all  the  points  at  which  the  unbounded  function  is  numerically  greater  than 
a  certain  fixed  number,  may  form  a  set  of  which  the  measure  is  zero. 

In  some  cases,  especially  in  the  theory  of  functions  defined  by  series,  it  is 
convenient  to  admit  infinite  values  of  a  function  at  particular  points  of  the 
set  E.  If  these  points  form  a  set  E0  such  that  m  (E0)  =  0,  we  can  regard 

f  f 

f  (x)  dx  as  existing,  and  equal  to  f(x}dx,  whenever  this  latter 

.'  (E)  J  (E-E«) 

integral  exists. 

Iff(x)  be  ^  0,  in  the  set  E,  of  measure  >  0,  and  such  that  I      /(#)  dx  =  0, 

J  (E) 

thenf(x)  is  zero  almost  everywhere  in  E. 

Let  £*€  be  the  set  of  points  of  E  at  which  f(x)  =  e,  then  from  the  definition 
of  the  Z-integral,  we  should  have  I  f(x)dx^em(Ee);  it  follows  that  the 

J  (E) 

integral  of  f(x)  cannot  vanish  unless  m  (E^)  =  0  ;  and  this  must  be  the  case 
for  every  positive  value  of  e.  The  set  of  points  at  which  f(x]  >  0  is  the  outer 
limiting  set  of  the  sets  Eej,  E^,  ...,  corresponding  to  a  sequence  of  diminishing 
values  of  e,  that  converges  to  zero.  It  follows  (§  131)  that  the  set  of  points  at 
which  /(a?)  >  0,  has  measure  zero. 

If  the  summable  function  f  (x),  defined  for  an  interval,  or  cell,  (a,  b\  be  such 
that  its  integral  over  any  interval,  or  cell,  whatever,  contained  in  (a,  b),  is  zero, 
the  function  f  (x}  must  be  zero  almost  everywhere  in  (a,  b). 

Let  A  denote  a  set  of  non-overlapping  intervals,  or  cells,  containing  the 
points  of  the  set  E1}  that  part  of  (a,  6)  for  which  f(x)  ^  0.  We  have  then 

[     f(x}dx+l  f(x}dx=\     f(x)dx  =  0. 

J  (E,)  J  (b-Et)  J(A) 

The  set  A  can  be  so  chosen  that  m  (A  —  E^  is  arbitrarily  small,  and  therefore 
so  that  the  integral  of  /(.*)  over  A  —  E^  is  arbitrarily  small.  It  follows  that 

f(x)  dx  =  0,  and  therefore/(o;)  is  zero  at  almost  all  points  of  E^    Similarly, 


it  can  be  shewn  that  f(x)  is  zero  at  almost  all  points  of  the  set  of  points  at 
which  f  (si)  ^  0.    It  follows  that  f(x)  is  zero  at  almost  all  points  of  (a,  b}. 
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PROPERTIES  OF  THE  LEBESGUE  INTEGRAL. 

395.  It  has  been  shewn  in  § 384  that,  if  /, (x),fz(x)  be  functions  that  are 
measurable  in  the  measurable  set  E,  then  their  sum  fi(x)+/2(x)  is  also 
measurable  in  E. 

It  will  now  be  proved  that 

r  f  f 

{ yi  (x)  +f*  (*)}  dx  =  I      fi  (x)  dx  +         f9  (x)  dx. 

J  (E)    "  J  (E)  J  (E) 

First,  let  it  be  assumed  that  yi  (a?),  /2  (x)  are  bounded.    If  e  be  an  arbitrarily 
chosen  positive  number,  let  er(l),  er(2)  denote  the  two  sets  in  which 

re  ^yi  (x)  <  (r  +  1)  e ;  re  ^/2  (a?)  <  (r  +  1)  e. 
Consider  the  two  sums  2  rem(er(li),    2  rew(er(2)),  where  a, /3  are  integers 

r=a  r=a 

(positive  or  negative),  such  that  all  the  values  of  yi  (x),  f2  (x),  in  E,  are  ^  a, 
and  </3  +  1.    If  ers  denote  the  set  which  is  common  to  erw,  es(z],  we  have 

r=ft  s=P  n=2/3 

2  rem(er(1))+  2 


where  En  denotes  that  set  of  points  in  which  ne  =yi  (x)+f2(x)  <  (n+  2)e. 

s=ft  r=P 

This  follows  from  the  fact  that  er{l)  =  2  erg;  and  es(2)  =  2  ers.    Making  e  con- 

S=a  r=a 

verge  to  zero,  we  now  see  that    2   n.  2em  (En)  converges  to 

n=2a 


It  thus  follows  that  this  integral  is  the  sum  of  I      yi  (,-r)  cfo?  and  /      /2  (x)  dx. 

J  (E)  J  (E 

Next,  let  one,  or  both,  of  the  functions  yi  (x),  fz  (x)  be  unbounded  in  E.  Let 
EN  be  that  set  of  jpoints  in  which  2N  >fi  (x)  +/2  (x)  >  —  2N  ;  N  >/j  (x)  >  —  N; 
N  >/8  (a:)>  —  N  ;  then  m  (^v)  ~  771  (£'),  as  N  ~  oo  .  We  have  then 


r  r  r 

l/i  O)  +/2  (^)l  cte  «         /  (as)  dx+  \       f2  (x)  dx  ; 
•'  (Erf  J  (Ey)  J  (EX) 

and  since  by  the  theorem  of  §  392,  the  integrals  of  yi  (x)  +  f2  (x),  fv  (x},  /2(«)' 
taken  over  the  set  E—  EN,  all  converge  to  zero,  as  ^V^  x  ,  and  m  (E  —  EN}  ~  0, 
we  see  that 

f      {  /i  (*0  +/.  (*)}  d*  =  f      /,  (ar)  dar  +  f      /2  (a;)  dar. 

J  (E)  J  (E)  J  (E) 

396.  If  [f(x)Y  be  summable  in  E,  so  also  is  /(#),  but  the  converse  is  not 
necessarily  true.  For  {/(#)}*  >[/(«)  I,  if  j/(a?)  I  >1  ;  and  therefore  \f(x)\  is 
summable  over  that  part  of  E  in  each  point  of  which  it  is  >  1,  since  {/(#')t2  is 
summable  over  that  part.  It  follows  that  f(x)  ,  and  consequently  /(a;),  is 
summable  over  E. 
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If  [/i(X)]2>  [./siC^)]2  are  both  summable  over  E,  so  also  is  yi  (#)/2  (#)  5  for 
I  /i  (#)/2  (#)  |  =  i  [{/]  (a)}2  +  {/2  (#)}2L  and  therefore  /i  (#)/2  (a;)  |  is  summable 
over  E, 


It  can  be  the  case  that  f1(x}f2(x)  is  summable  over  E,  but  not  {/i(#0}2 


or 


Assuming  that  {/i(#)}2,  {/2(X)}2  are  both  summable  over  E,  we  have 

fl(x)f2(x)\dx 

^  (as)}*  doe  +  $  f      [f.2(x)Ydx     (1). 


(E) 


Also  {X  ]/!  (#)  |  +  fj,  f2(x)  |]2  is  summable  over  E,  as  it  is  the  sum  of  three 
summable  functions  ;  \,  p  denoting  constants. 

Since 


is  essentially  positive,  we  see  that 


f  f 

<  if  /W,2  r7r  I        f  r   ('r\\'2r]'r  C-)\ 

~-     I  |   / 1  \'^/  j      l*w    I  1 ./  2  \     /  J     t^***       •  ••••*••  »l*iiyi 

J  (E)  J  (E) 

The  inequalities  (1)  and  (2)  are  of  considerable  use  in  questions  connected 
with  the  convergence  of  series.  The  inequality  (2),  being  a  generalization  of 
a  theorem  due  to  Schwarz,  is  known  as  Schwarz's  inequality. 

397.  If  /!  (x)  be  summable,  but  not  equivalent  to  a  bounded  function,  it 
is  always  possible  to  determine  a  summable  function  fz  (x)  so  that  /i  (x)f2  (x)  is 
not  summable*. 

If  a\,  a,2,  ...  an,  ...  be  a  set  of  positive  increasing  numbers  without  an 
upper  limit,  and  en  denote  the  set  of  points  at  which  an  ^  \f^  (x)   <  an+1 ;  then 
m(en)  >  0,  for  an  infinite  number  of  values  of  n.    The  series  2  anm  (en)  is  con- 
vergent, being  ^  I      \f1(x)\dx;  let  j/2  (x) !  =  —     — — -,  for  all  points  of  en,  for 
J  (E)  nanm  \en) 

r  i 

each  value  of  n.    We  have  then  I      \fz  (.*•)  |  dx  =  S  —  —  ;  and  if  the  numbers  an 

j  (E)  nan 

be  properly  chosen  (for  example  an  =  n),  this  series  is  convergent,  and  /2(#)  is 

summable.     But  the  series  ]£  —     — -. — -anm(en)  being  divergent, 

nanm  \en) 

MX)  MX)  dx 

(E) 

has  not  a  finite  value,  and  therefore  f2  (x)  is  a  function  such  as  is  required. 
*  See  Lebesgue,  Annales  de  Toulouse  (3),  vol.  i,  p.  38. 
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398.  If  fn  (x)  denotes  a  sequence  of  measurable  functions,  defined  in  the 
interval,  or  cell,  (a,  b),  and  such  that  \fn(x)  |  **,  for  every  value  of  n,  and  for 
every  point  x,  in  the  interval,  or  cell,  (a,  b),  less  than  a  fixed  positive  number  A, 
then  if  [fn(x)}  converges  for  each  value  of  x,  to  the  value  of  the  Junction  f(x), 

fb  rb 

that  function  is  summable,  and  I    f(x)dx=  Km  /   fn(x)dx. 

J  a  n~oo  J  a 

Since,  for  any  fixed  value  of  x,  \f(x)  —fn  (x)  \  <  e,  provided  n  be  sufficiently 
great,  we  see  that  \f(x)\<  fn  (x)  \  +  e  <  A  +  e;  and  therefore  since  e  is 
arbitrary,  \f(x)\^A.  Therefore  the  function  f(x)  is  bounded.  Let  en 
denote  that  measurable  set  of  points  of  (a,  b)  at  which  fn  (x)  >  c  ;  then  the 
set  e,  of  points  at  which  f(x)  >  c,  is  such  that  each  point  of  e  belongs  to  all 
the  sets  {en},  from  and  after  some  value  of  n  dependent  on  x,  and  therefore, 
employing  a  theorem  given  in  §  131,  the  set  e  is  measurable.  It  follows  that 
f(x)  is  a  measurable  function,  as  c  can  be  arbitrarily  chosen  ;  and  since  f(x) 
is  bounded,  it  is  summable  in  (a,  b). 

Let  gn  denote  the  set  of  points  x,  at  which  \f(x}  —fn  (x)  >  e  ;  the  set  gn 
is  measurable,  and  lim  m  (gn)  =  0.  For  if  this  is  not  the  case,  there  are  an 

»~00 

indefinitely  great  number  of  values  of  n  for  which  m  (gn)  is  greater  than 
some  fixed  positive  number  a.  In  accordance  with  the  theorem  of  §  136, 
there  exists  a  set  of  points,  of  measure  =  a,  such  that  each  point  belongs  to  an 
infinite  number  of  the  sets  gn.  This  is  inconsistent  with  the  condition  that 
fn(x}  converges  to  f(x\  for  each  value  of  x.  It  has  thus  been  shewn  that 
m  (dn)  ~  0,  as  n  ~  oc  . 
Now 


I     I/O)  -fn  (x)}  dx  =  I     {/(«)  -/„  (as)}  dx  +  \f(x)  -fn  (x)} 

J  a  J  »„  J  C!(n  \ 


On  J  C(f/n) 


The  first  integral  on  the  right-hand  side  is  numerically  less  than  2Am  (gn), 
and  the  second  integral  is  numerically  less  than  d,  where  /  is  the  measure  of 
the  fundamental  cell.  The  integral  on  the  left-hand  side  is  arbitrarily  small, 
for  all  sufficiently  large  values  of  n  ;  and  thus  the  theorem  is  established. 

It  may  be  observed  that,  in  case  there  is  an  exceptional  set  of  points,  of 
measure  zero,  at  which  the  sequence  [fn  (x)}  does  not  converge,  the  above 
theorem  still  holds.  For,  in  the  above  proof,  we  can  disregard  this  set  ;  the 
integrations  being  all  taken  over  the  complements  of  this  set,  the  measures  of 
the  sets  over  which  they  are  taken  are  unaffected. 

399.  The  following  theorem  is  concerned  with  the  Z-integral  of  a  function 
which  is,  at  almost  every  point  of  its  domain,  the  limit  of  a  monotone 
sequence  of  measurable  functions. 

Let  {fn(x}}  be  a  sequence  of  non-negative  bounded  functions,  summable  in 
the  measurable  set  of  points  E,  in  any  number  of  dimensions,  and  let  it  be 
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assumedthat,  for  eachvalue  of 'x, in E,  the sequenceis monotone and  non-diminishing. 


Let  it  be  further  assumed  that  lim         /„  (x)  dx  has  a  definite  value.    Then, 

»~o>   '  (E) 

(1)  the  points  of  E  at  which  the  sequence  {fn(%)}  does  not  converge  to  a  definite 
number  form  a  set  of  points  of  measure  zero ;  and  (2),  if  f(x}  denote  the 

limit  of  the  sequence  {fn  (#)},  the  integral  I     f(%)  dx  exists,  and  has  the  value 

J  (E) 

lim  I      fn(x)dx;  those  points  of  E  at  which  f  (SB)  is  undefined  being  disre- 

n~x>  J  (E) 

f 
garded.    Conversely   (3),   if  \      f(x)dx   exists   as   a   definite   number,   then 

J  (E) 
f 

lim         fn(x)dx  exists,  and  has  the  same  value  as  the  integral. 

n~<x>  J  (E) 

The  first  two  parts  of  this  theorem  were  given*  substantially  by  Vitali, 
and  byf  B.  Levi;  the  proof  here  given  was  published*  by  Hobson.  Let  k 
denote  a  positive  number,  and  let  the  functions  fn(k}  (x)  be  defined  by 
fn(k}  (#)  —fn  (#).  for  values  of  x  for  which  /„  (x)  £  k,  and  by  /„<*'  (x)  =  k,  for 
values  of  x  for  which  fn(x)>k.  Let  f(k)(x)  be  a  function  such  that 
/(*><V)=/(V),  when  f(x)^k,  and  f^(x)  =  k,  when  f(x)>k.  For  each 
value  of  x,  fn(k]  (x}  defines  a  monotone  double  sequence,  that  is  a  monotone 
function  of  n  and  k,  if  n  have  the  values  1,  2,  3, ...,  and  k  have  the  values  in 
any  monotone  increasing  sequence  of  numbers  without  an  upper  limit.  It  is 

clear  that  I      fn(k]  (x)  dx,  regarded  as  dependent  on  n  and  k,  defines  a  monotone 

J(E) 

double  sequence  of  non-diminishing  numbers.    From  a  property  of  such  double 
sequences  (see  §  388),  we  infer  that  the  two  repeated  limits 

lim  lim   I      fn(k](x}dx,     lim  lim   I      fn(k]  (x}dx 

k~ao  n~x  J  (j!?)  n~x  k~<x>  J  (E) 

are  such  as  to  have  the  same  finite  value,  if  either  of  them  is  finite. 

Since  fn{k}  (x)  is  bounded,  for  all  values  of  &  and  x,  we  have  (see  §398), 

f  f 

lim  I      fn(k](x)dx=\      fn(x)dx, 

k~<*>  J  (E)  J  (E) 

f  f 

and  therefore  lim  lim         fn(k)  (x)  dx  =  lim  I      fn  (x)  dx,  provided  that  one  of 

»i~oo  fc~oo  J  (E)  n~<x>  J  (E) 

these  two  limits  is  assumed  to  exist. 

f  f 

Also,  we  have  lim         fn(k)  (x}  dx  =         /(fc>  (x)  dx,  since  fn(k)  (x}  is  bounded 

w~oo  .'  (E)  J  (E) 

for  all  values  of  n  and  x.    Therefore 

lim  lim  f     fn^(x}dx=  lim  f     f(k^(x}dx, 

k~x>n~x>J(E)  Jc~x  J  (E) 

provided  one  of  these  two  limits  exists. 

*  Rend,  di  Giro.  Mat.  di  Palermo,  vol.  xxni,  p.  137. 
t  Rend,  dell'  Istit.  Lombardo,  (2),  vol.  xxxix,  p.  775. 
t  Proc.  Land.  Math.  Soc.  (2),  vol.  vm,  p.  28. 
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Since /„<*'  (x)  is  summable,  so  also  is/(t)  (x),  and  therefore  also  lim  /'*'  (x), 

k~eo 

or/(#),  is  measurable.    In  case  lim  I      f(k)  (x)dx  is  finite,  its  value  defines 

f(x)dx. 


(E) 

First,  let  it  be  assumed  that  lim         fn  (x)  dx  exists.    If  et  be  the  set  of 

n~a=  J  (E) 

points  for  which  f(k)  (x)  =  k,  we  have 

f  f 

km  (etc)  <         f(k]  (x)  dx  <  lim         fn  (x)  dx. 

J  (E)  n~<*>  J  (E) 

The  set  of  points  at  which  f(x)  is  indefinitely  great  is  contained  in  all 
the  sets  e/c,  hence  its  measure  is 

1          r 
£  m  (ek)  <  j  lim  I      /„  (x)  dx. 

K  n~oo  J  (E) 

Therefore,  since  k  is  arbitrarily  great,  the  measure  of  the  set  of  points  at 
which  f(x)  =  oo  is  zero. 

Also        I     f(x)dx=\im!     f^(x)dx=\im!      fn(x)dx. 

J  (E)  k~x>J(E)  n~<x>  J  (E) 


Next,  if         f(x)dx,  or  lim  lim        fn(k](x)dx,  is  assumed  to  be  finite,  it  will 

J  (E)  k~x  n~<x>  J  (E) 

fn  (x)  dx  exists,  and 

[E) 

r 

has  the  value  of         f(x)dx. 
I  IF.\ 


•'  (E) 


THE    LIMITS    OF    A   SEQUENCE    OF    MEASURABLE    FUNCTIONS. 

400.  Let  /!  (x},  f2  (x),  ...fn  (x),  ...  be  a  sequence  of  measurable  functions 
defined  in  an  interval  (a,  6),  or  in  a  cell  of  any  number  of  dimensions.  Let 
vn  (x)  denote  that  function  which,  for  each  value  of  x,  has  for  its  value  the 
greatest  of  the  numbers /i  (x),  fz  (x),  ...  fn (x).  The  function  vn (x)  is  measur- 
able ;  for  the  set  of  points  at  which  A  <  vn  (x)  is  that  set  each  point  of  which 
belongs  to  one  or  more  of  the  measurable  sets  for  which 

A  </,  O),  A<f9  (x),  ...A<fn  (x). 

The  functions  v1(x),  v2(x],  ...  vn(x),  ...  form  a  sequence  which,  for  any 
value  of  x,  gives  a  monotone  non-decreasing  set  of  numbers  ;  let  w1  (x)  denote 
its  limit.  The  value  of  iol  (x)  is  finite,  or  infinite,  for  each  value  of  x.  Let 
the  function  wn  (x)  be  formed  in  the  same  manner  from  the  sequence 
vn  (x),  vn+l  (x),  ...,  obtained  by  leaving  out  the  first  n  —  1  functions  in  the 
sequence  v1  (x\  v2  (x), ....  The  sets  u\  (x),  w.2  (x), . . .  are  all  measurable ;  for 
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the  set  of  points  x  at  which  wl  (x)  >  A  is  the  set  of  points  which  belong  to  an 
infinite  number  of  the  measurable  sets  for  which 

v-i  (x)  >  A,  vz  (x)  >  A,  . . .  vn  (x)  >  A , . . . ; 

hence  w-±  (x)  is  a  measurable  set,  and  similarly  wn  (x)  is  measurable  for  each 
value  of  n.  The  functions  w1(x),  w.2(x),  ...  define  a  monotone  non-increasing 
sequence  of  numbers  for  each  value  of  x.  The  limit  of  this  sequence  isf(x), 
where  f(x)  denotes  lim  /„  (x).  As  before,  this  function  f(x),  being  the  limit 

7l~QO 

of  a  monotone  sequence  of  measurable  functions,  is  itself  measurable.  In  a 
similar  manner  it  can  be  shewn  that  f(x)  =  lim  /„  (x)  is  measurable.  It  has 

«~00 

thus  been  shewn  that : 

If  fi  (x),  f2  (x},  ...  be  a  sequence  of  measurable  functions  of  one  or  more 
variables,  defined  in  an  interval,  or  a  cell,  (a,  b),  the  functions  f  (x),  f(x)  are 
measurable,  where  f(x),  f(x)  denote,  for  each  value  of  x,  the  upper  and  lower 
limits  (finite  or  infinite]  of  the  sequence  of  numbers  [fn  (x)}. 

If,  instead  of  the  upper  and  lower  limits  of  {/(#)),  their  upper  and  lower 
boundaries  are  taken,  we  have  the  corresponding  theorem  : 

If  [fn  (x)}  is  a  sequence  of  measurable  functions  of  one  or  more  variables, 
the  functions  U (x),  L(x)  are  measurable;  where  U (x),  L(x)  denote,  for  each 
value  of  x,  the  upper  and  the  lower  boundaries  of  the  numbers  [fn  (x)}. 

The  set  of  points  for  which  U  (x)  >  A,  is  that  set  which  consists  of  the 
points  that  belong  to  one  or  more  of  the  sets  for  which  /i  (x)  >  A,fz  (x)  >  A,  — 
It  follows  that  the  set  is  measurable,  and  consequently  that  U  (x)  is  a 
measurable  function.  Similarly  it  can  be  shewn  that  L  (x)  is  a  measurable 
function. 

THE   DERIVATIVES   OF   A    FUNCTION. 

401.  If  (f)  (x}  be  continuous  in  the  linear  interval  (a,  b),  the  four  derivatives 
D+  0  (x),  D+  <f)  (x),  D~  (f)  (x),  D_<f>  (x)  (finite  or  infinite)  are  measurable  functions 
of  x.  In  particular,  if  </>  (x)  have  almost  everywhere  a  differential  coefficient 
(f)'  (x),  then  <j>' (x}  is  a  measurable  function  of  x. 

We  shall  consider  the  case  of  D+  <f>  (x)  and  D+  (j>  (x) ;  the  case  of  the  other 
functions  being  then  deducible. 

D+  (f)  (x)  is  the  upper  limit  lim  /  (x,  x  +  h),  when  h  >  0,  and  /  (x,  x  +  h) 

h~D 

denotes  the  incrementary  ratio,  {<f>(x  +  h)  —  (j)(x}}/h.  It  will  be  shewn  that  a 
sequence  of  positive  values  of  h,  converging  steadily  to  zero,  can  be  deter- 
mined, such  that  the  upper  and  lower  limits  of  /  (x,  x  +  h),  when  h  has 
successively  the  values  in  the  sequence,  are  for  every  value  of  x  the  same  as 
when  h  is  not  restricted  to  have  such  values.  Let  V,  h-z>  •••  h-n,  •••  be  a 
sequence  of  diminishing  positive  numbers,  converging  to  zero,  and  let 
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€lt  €2,  ...  fn, ...  be  another  such  sequence.  Since  I (x,  x  +  h)  is  continuous 
with  respect  to  (x,  A),  for  all  values  of  x  in  (a,  6),  and  for  all  values  of  h  that 
are  >  0,  it  follows  from  the  uniformity  of  convergence  of  I  (x,  x  +  h),  when  h 
is  confined  to  the  interval  (tin+l,  h'n),  that  this  interval  can  be  divided  into  a 
definite  number  rn  of  parts,  such  that  |  I(x,  x  -f  h)  —  I  (x,  h')  \  <  en,  for  every 
value  of  x,  provided  h  and  h'  both  lie  in  one  and  the  same  part  of  the  interval 
(h'n+l,  h'n}.  Let  this  sub-division  of  (hfn+1,  A'n)  be  made  for  each  value  of  n, 
and  let  Alt  A2,  h3, ...  denote  the  end-points  of  all  the  parts  of  all  the  intervals. 
The  sequence  hlt  h2>  hA,  ...  converges  to  zero,  and  it  is  a  sequence  such  as 
satisfies  the  required  condition;  for  we  have  I  (x,  x  +  hm)  —  I(x,  A)]<e«, 
provided  hm  £  h  ^  hm+l ;  the  integer  s  being  determinate,  corresponding  to 
each  value  of  m.  It  follows  that  the  upper  and  lower  limits  of  the  sequence 
I(x,x  +  A^,  I(x,x  +  h.2), ...  I (x,x  +  hm), . . .  are  identical  with  those  of  any  other 
sequence  /  (x,  x  +  A/),  /  (x,  x  +  A./),  ...  I  (x,  x  +  hm'), ...,  when 

h^hj^ht,  h,^h2'^h3,  ...; 

and  generally  hm  =  Am'  =  hm+l ;  and  this  is  the  case  for  every  value  of  x  in 
(a,  b).  Therefore  the  sequence  {hn}  has  the  required  property. 

If  we  identify  the  functions  fn  (x),  of  §  400,  with  the  functions  /  (x,  x  +  hn), 
where  the  sequence  hn  is  formed  as  indicated  above,  we  see  that  the  two 
derivatives  D+(f>(x),  D+(f>(x)  are  measurable  functions,  since  the  functions 
{(j>  (x  +  hn)  —  (f)  (o?)}/An  are  measurable  for  each  value  of  n. 

402.  If  E  be  a  set  of  points  in  the  interval  (a,  b),  at  which  Df(x),  one  of 
the  derivatives  of  a  function  f(x),  continuous  in  (a,  b),  has  a  fixed  sign  (and  is 
not  zero),  the  set  E,  when  it  exists,  is  unenumerable,  and  contains  a  perfect  set. 

It  will  be  sufficient  to  consider  the  set  of  points  E,  at  which  D+f(x)  <  0 ; 
the  other  cases  may  be  treated  in  a  similar  manner. 

If  a  be  a  point  of  E,  another  point  /3  ( >  a)  can  be  determined  such  that 
/(/3)—  /(«)  has  a  negative  value  —  X. 

Let  77  be  a  fixed  positive  number  such  that  77  (ft  —  a)  =  X3  <  X,  and  let  k  be 
a  positive  number  such  that  Xx  +  k  <  X ;  we  shall  consider  the  continuous 
function  <f>  (x)  =f(x)  —/(a)  +  f]  (%  —  a)  +  k,  in  the  interval  (a,  /3),  of  x.  Since 
(f>  (a)  is  positive,  and  0  (/3)  is  negative,  </>  (x)  must  have  the  value  0  at  one  or 
more  points  within  (a,  /3)  ;  let  £  be  the  point  nearest  to  /3  at  which  this  is 
the  case.  Since  <£  (£)  =  0,  and  (f)  (/3)  is  negative,  <£  (x)  is  negative  for  all 
values  of  x  such  that  £  <  x  ^  @.  Since  <f)(x)  —  <f>  (% )  is  negative,  it  follows 
that  D+  (f)  (|)  ^  0,  whence  we  have  D+f^)  ^  —  ?j  <  0,  and  therefore  £  belongs 
to  E. 

The  number  X,  being  kept  fixed,  k  may  vary  continuously  within  the 
interval  (0,  X  —  Xj) ;  and,  to  each  such  value  of  k,  there  corresponds  a  single 
value  of  f,  which  belongs  to  E.  It  follows  that  the  set  E  contains  a  single 
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point  corresponding  to  each  value  of  k  within  the  interval  (0,  X  —  \).  There- 
fore E  contains  an  unenumerable  set  of  points,  of  the  cardinal  number  of  the 
continuum,  and  thus  contains  a  perfect  set.  Accordingly  the  theorem*  has 
been  established. 

403.  Iff  (x)  be  a  function  defined  in  the  interval  (a,  b),  and  E  be  the  set  of 
points  at  which  one  of  the  derivatives  Df(x)  has  a  fixed  sign  (and  is  not  zero), 
then,  if  Df(x)  is  finite  (bounded  or  unbounded)  at  every  point  of  E,  the  exterior 
measure  of  E  is  greater  than  zero. 

It  will  be  sufficient  to  consider  the  case  in  which  E  is  the  set  of  points  at 
which  D+f(x)  <  0 ;  it  being  assumed  that,  at  no  point  of  Et  is  D+f(x)  =  —  oo  . 
Let  it  be  assumed,  if  possible,  that  me  (E)  =  0.  Let  En  be  that  part  of  E,  in 
the  points  of  which  n  —  1  ^  —  D+f(x)  <  n,  then  me  (En)  =  0.  The  set  En  may 
be  enclosed  in  a  set  An,  of  non-overlapping  intervals,  of  total  measure  en ;  and 
this  may  be  done  for  all  the  sets  En(n=  1,  2,  3,  ...).  The  numbers  en  can  be 
so  chosen  that  S  nen  converges  to  a  value  which  is  less  than  an  arbitrarily 

n=l 

chosen  number  e.  Let  Aw  (x)  denote  that  part  of  An  which  is  in  the  interval 
(a,  x),  and  let  <£ (x)  =  m  { Ax  (x)}  +  2m  { A2  (x)}  +  ...  +  nm  { An  (x)}  +  . . . ;  the 
function  <£  (x)  being  consequently  <  e,  for  all  values  of  x. 

Consider  the  function  i/r  (x)  =f(x)  -f  <£  (x).  Since  (f>(x)  is  monotone  and 
non-diminishing,  we  have  D+-fy~  (x)  ^  D+f(x),  at  all  points  of  C  (E).  At  every 
point  x,  of  E,  if  x  belongs  to  En,  the  increment  <£  (x  +  h)  —  $  (x),  for  sufficiently 
small  values  of  h,  is  at  least  nh ;  hence  D+^r  (x)^n  +  D+  <j>  (x)  >  0.  It  has 
thus  been  shewn  that  ty  (x)  is  monotone  and  non-diminishing  in  (a,  b) ;  thus 
•\Jr  (ft)  —  ^  (a)  ^  0,  for  any  two  points  a,  ft,  when  ft  >  a.  Since  <£  (ft)  —  </>  (a)  <  e, 
it  follows  that  f(ft)  —  /(«)  >  —  e.  But  if  a  is  a  point  of  E,  a  point  ft  can  be  so 
chosen  that  f(ft)  — /(«)  is  negative,  and  has  a  value  —k;  and  thus  k  <  e. 
Since  e  can  be  so  chosen  as  to  be  less  than  k,  the  hypothesis  that  me  (E)  =  0 
has  been  shewn  to  lead  to  contradiction.  It  has  therefore  been  shewn  that 
me(E)  >  0.  In  case  f(x)  is  continuous,  since  E  is  then  certainly  measurable, 
its  measure  must  be  >  0. 

INDEFINITE    INTEGRALS. 

404.  Iff(x)  be  summable,  whether  bounded  or  not,  in  the  linear  interval 
(a,  b),  it  is  also  summable  in  the  interval  (a,  x),  where  a  £  x  ^  b.  The  integral 

[x 
f(x)  dx  may  be  regarded  either  as  a  function  of  the  upper  limit,  or  as  a 

J  a 

function  of  the  set  of  points  which  constitute  the  interval  (a,  x). 

*  See  de  la  Vallee  Poussin's  Cours  d1  Analyse,  2nd  ed.,  vol.  i,  p.  80.  The  proof  there  given 
corresponds  to  the  case  ?7  =  0 ;  but  the  conclusion  there  drawn  that  £>+<£  (£)  <  0,  is  incorrect.  This  is 
remedied  above  by  the  introduction  of  the  positive  number  tj. 
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If  F(x)  be  a  function  defined  in  (a,  x},  and  such  that 
F(x)-F(a)=r f(x)dx, 


it  is  called  the  indefinite  integral  of  f(x),  and  is  determinate,  when  f(x}  is 
defined,  except  for  an  additive  constant. 

An  indefinite  integral  is  continuous,  and  of  bounded  total  variation,  in  the 
interval  for  which  it  is  defined. 

rx  +  h 

Since  F(x  +  h)-  F(x}  =         f(x}  dx, 

J  X 

F  (x}  -F(x-h)=r    f(x)  dx, 

J  x-h 

and  since  the  integrals  on  the  right-hand  side  converge,  in  accordance  with 
the  theorem  of  §392,  to  zero,  as  h  -*/  0,  it  follows  that  F(x)  is  continuous  at 
every  point  x,  of  (a,  b).  Again,  if  (a,  b)  be  divided  into  any  number  n  of 
parts. (xr_i )  xr},  where  x0  =  a,  xn  =  b,  we  have 


r=l 
r=n 


f(x)  dx 

L 

dx 


b\f(x}\dx. 

& 

Since  this  inequality  holds  for  every  possible  set  of  sub-division  of  (a,  by 
into  parts,  it  follows  that  F(x}  is  of  bounded  total  variation.  This  may  also 
be  proved  by  utilizing  the  fact  that  a  summable  function  f(x}  is  the  difference 
of  two  non-negative  summable  functions  /i  (x),  fz  (x}.  Then  F  (x)  is  the 
difference  of  the  two  monotone  functions,  Fl  (x),  Fz  (x},  which  are  the  indefinite 
integrals  of /i  (#), /2  (#).  Then  by  the  theorem  of  §244,  it  follows  that  F  (x)  is 
of  bounded  total  variation. 

405.  The  indefinite  integral  F  (x}  has  a  finite  differential  coefficient  almost 
everywhere  in  (a,  b). 

Since  F(x)  is  of  bounded  variation,  it  is  expressible  as  the  difference  of 
two  monotone  functions,  and  in  accordance  with  the  theorem  of  §  298,  it  con- 
sequently has  a  finite  differential  coefficient  almost  everywhere  in  (a,  b). 

This  theorem  is  included  in  the  following  more  general  theorem*: 

If  F  (x)  be  the  indefinite  integral  of  a  summable  function  f(x],  defined  in 
(a,b),  F(x)  has,  almost  everywhere  in  (a,b),  a  differential  coefficient  equal  to 

/(»> 

The  theorem  will  first  be  proved  in  the  case  in  which/(V)  is  bounded  in  (a,  b). 
*  See  Lebesgue,  Lemons  sur  I' integration,  p.  124. 
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It  has  been  shewn  in  §  140  that,  if  e  be  any  measurable  set  of  points 
in  (a,  b),  the  metric  density  of  e  is  1,  almost  everywhere  in  (a,  b),  and  is  0, 
almost  everywhere  in  C  (e).  Let  e  (x)  denote  that  part  of  e  which  is  in  the 
interval  (a,  x),  where  x  is  any  point  in  (a,  b).  It  follows  from  the  property  of 

e  just  quoted  that  -,—  e  (x)  exists,  and  has  the  value  1,  at  almost  all  points  of  e, 

\JLOC 

and  that  it  has  the  value  0,  at  almost  all  points  of  C  (e). 

If  L,  U  denote  the  lower  and  upper  boundaries  of  f(x)  in  (a,  b),  let 
(L,  U)  be  divided  into  parts  (a0,  a^,  (a1}  a2\  .  .  .  (a^  ,  an),  where  a0  =  L,  an  =  U, 
and  where  ar  —  ar_j  <  e,  for  all  values  of  r.  Let  er  be  the  set  of  points  at  which 
ar-  i=/(X)<  a>r>  except  that  en  is  the  set  at  which  an^  ^f(x)  £  an\  and  let 
er(x)  be  the  part  of  er  in  (a,  x).  Let  fa(x)  =  ar_1;  at  every  point  of  er,  for  all 
values  of  r,  and  let  fa  (x)  =  ar,  at  each  point  of  er,  for  all  values  of  r. 

rx  rx 

Also  let  F1  (x)  =  I    fa  («)  dx,     F2  (x)  =      fa  (x)  dx  ; 

J  a  J  a 

then 

F1  (x)  =  a0  mel  (x)  +  a1me2(x)  +  ...  +  an^  men  (x), 

F.2  (x)  =  a^  me1  (x}  +  «2  m  e3  (x)  +  ...  +  an  men  (x). 

The  function  Fl  (x}  has  a  differential  coefficient  equal  to  <^l  (x)  almost 
everywhere  in  (a,  b).  For  m  {er  (x)}  has  a  differential  coefficient  almost  every- 
where equal  to  1,  or  0,  according  as  x  belongs,  or  does  not  belong,  to  er. 
Similarly  F2  (x)  has  a  differential  coefficient  equal  to  fa2  (#)  almost  everywhere. 

Also 

Fi(iK)  <  F(x  +  h)-F(x}     F2  (x  +  h)  -  F2  (x) 


h  ~h~  ~~h~ 

for  positive  and  for  negative  values  of  x,  since  Fl  (x)  cannot  increase  more 
than  F  (x),  and  F  (x)  cannot  increase  more  than  F2  (x),  as  x  increasgs. 

Therefore  the  four  derivatives  of  F(x)  all  lie  between  (f>i(x),  </>2(X)  inclu- 
sively, at  each  point  x  which  does  not  belong  to  that  set,  of  measure  zero,  at 
which  Fl  (x),  F2  (x)  do  not  have  differential  coefficients  equal  to  fa  (x),  (f)2  (#) 
respectively.  Now  fa  (x),  fa  (x)  differ  from  each  other,  and  from  f(x),  by  less 
than  e;  therefore,  almost  everywhere,  the  four  derivatives  of  F  (x)  differ  from 
one  another  by  less  than  e.  Taking  a  sequence  of  values  of  e,  which  converges 
to  zero,  we  see  that  F  (x)  has  *  differential  coefficient  equal  tof(x),  almost 
everywhere  in  (a,  b). 

The  theorem  has  now  been  established  for  the  case  of  the  indefinite  in- 
tegral of  a  bounded  function. 

In  order  to  prove  the  theorem  for  the  case  in  which  f(x)  is  unbounded, 
the  following  Lemma*  will  be  required: 

*  See  de  la  Vallee  Poussin,  Cours  d'  Analyse,  2nd  ed.,  vol.  i,  p.  266. 
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If  4>(x)  be  a  continuous  monotone  non-diminishing  function,  <fi  (x)  is  sum- 

mable  in  (a,  b),  and 

rb 
</>'  (x)  dx^(f>(b)-(f>  (a). 


f 

j  u 


The  fact  that  $'  (x)  does  not  exist,  or  is  not  finite,  at  points  of  a  set  of 
measure  zero,  has  no  effect  on  the  integral,  which  is  taken  over  that  set  of 
points  at  which  <£'  (x)  exists,  and  is  finite. 

Let  Xjv(#)  ~  $  (*)>  f°r  values  °f  x  such  that  </>'  (x)  £  N,  and  XN(X}  —  N, 
when  <j)'  (x)  >  N.  In  accordance  with  the  result  obtained  in  §401,  <j>'  (x)  and 

rx 
XN(X}  ai%e  measurable  functions.    The  function  I    %N  (x)  dx,  being  an  indefi- 

•1  a 

nite  integral  of  a  bounded  function,  has  a  differential  coefficient,  equal  to 

XN  (x),  almost  everywhere  in  (a,  b).   At  all  points  of  (a,  b)  its  derivatives  are 

rb 

all  £  N.   Let  the  points  of  the  set  G,  at  which  I    ^N  0&)  dx  does  not  possess 

J  a 

a  differential  coefficient  equal  to  %#•(*'),  be  enclosed  in  a  set  of  intervals  A,  of 
measure  e/N,  and  let  A^  be  the  part  of  A  in  the  segment  (a,  x). 

We  shall  consider  the  function  <£  (x)  +  Nm  (A^)  =  ty  (x).  In  G,  we  have 
D^r(x)'^N]  and  at  points  not  in  G,  D-^r  (x)  ^  D(f>(x)  ^  XN(%\  where  D 

denotes  any  one  of  the  four  derivatives.   The  increase  of  the  function  -fy  (x), 

rx 
with  x,  is  not  less  than  that  of  I    p£y  (#)<&»;  therefore 

J  a 


rb 
It  follows  that      ^jy  (x)  dx  is  bounded,  for  all  values  of  N,  and  it  is  also 

•  a 

greater  than  Nm(Hj^),  where  Hy  is  that  set  of  points  at  which  D$(x}  >  N. 
We  infer  that  lim  m  (HN)  =  0.    If  EN  is  the  set  complementary  to  HN,  we 

JV~30 

have 

f6 
cf)  (x)  dx  ^       Y  v  (x)  doc£d>(b)-d>  (a)  +  e. 


If  N~  co  ,  we  see  that      D<f>(x}  dx  is  finite,  and  ^  0  (b)  —  cf)  (a). 

.  a 

From  this  Lemma  we  deduce  that: 

If  (f>  (x)  be  a  continuous  function,  of  bounded  variation  in  (a,  b),  tlien  Dcf)  (.r) 
is  summable,  when  taken  over  the  set  of  points  of  (a,  b)  at  which  it  is  finite;  Dcf)  (x) 
denoting  any  one  of  the  four  derivatives  of  (f>(x).  The  integral  of  D  (f)  (x)  has 
the  same  value  for  all  the  four  derivatives. 

For  <£  (x)  is  the  difference  of  two  monotone  non-diminishing  functions 
(/>!  (x),  </>2  (x).  Since  <£/  (x\  (/>2'  (x)  exist  almost  everywhere  in  (a,  b),  and  are 
summable  over  the  set  of  points  at  which  they  exist,  we  see  that  <£'  (x)  exists 
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almost  everywhere,  and  is  summable.    The  theorem  follows  from  the  fact  that 
(x)  =  (f>'  (x),  almost  everywhere  in  (a,  b);  thus 

=\  D<f>(x)dx. 

J  a 

We  are  now  in  a  position  to  prove  the  general  theorem,  when  f(x)  is 
unbounded. 

Consider  the  function  fx(x)  which  =  f(x),  when  f(x)£  N,  and  is  equal 
to  N,  when/(#)  >N;  it  being  assumed  that  f(x)  is  a  non-negative  function. 


Since      f(x)dx  and      f'N(x)dx  are  monotone  functions,  they  both  have 

.'a  J  a 

fx 

finite   differential  coefficients    almost   everywhere,  and  that  of  I  fN(x}dx 

J  a 

rx 
clearly  does  not  exceed  that  of  I  f(x)dx.     Since  fy(x)  is  bounded,  the 

J  a 

rx 
differential  coefficient  of  I  fN(x)dx  is/jv(a?),  almost  everywhere;  therefore 

J  a 

fx  (x}  £  F'  (x},  almost  everywhere.  Since  N  is  arbitrarily  great,  it  follows  that 
f(x)  =  F'  (x),  almost  everywhere. 

(b  fb 

Now  I   /(a?)  dx  —  F  (b)  —  F  (a)  ^  \    F'  (x)  dx,  as  is  seen  by  employing  the 

J  a  J  a 

Lemma  proved  above;  but  since  f(x)  ^  F'  (x),  we  have 

f  f(a)dee3  \  F'  (x)  dx. 

•'a  J  a 

rb 

It  follows  that       [F'  (x)  -/(#)]  dx  =  Q;  and  since  F'  (as)  -f(x)  ^  0,  it  then 

J  a 

follows  that  F'  (x)=f(x),  almost  everywhere.  The  theorem  has  therefore  been 
established  for  a  non-negative  unbounded  function. 

Since  every  summable  function  is  the  difference  of  two  non-negative  sum- 
mable functions,  the  theorem  is  completely  established. 

It  is  easily  seen  that,  at  every  point  of  continuity  of  the  function  f(x), 

rx 

I   f(x)  dx  has  a  differential  coefficient  equal  to  f(x).    For  at  such  a  point,  if 

.'  a 

I,  u  be  the  lower  and  upper  boundaries  of  the  function  in  (x  —  h,  x  +  A), 
F(x  +  h)-F(x)  and  F(x)-F(ae-h) 

are  between  Ih  and  uh  inclusively,  and  since  I  and  u  both  converge  to  f(x\ 
as  h  ~  0,  it  follows  that  F(x)  has  as  its  differential  coefficient  f(x).  The  set 
of  points  at  which  F(x)  has  f(x)  for  its  differential  coefficient  consequently 
includes  all  points  of  continuity  off(x),  when  such  points  exist. 

The  important  theorem  here  established,  which  is  due  to  Lebesgue,  throws 
light  on  one  of  the  fundamental  questions  which  arise  as  to  the  reversibility 
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of  the  two  processes  of  integration  and  differentiation;  as  it  asserts  that  the 
process  of  integration  of  any  summable  function  can  be  reversed  by  differentia- 
tion, at  almost  all  points  of  the  interval  for  which  the  function  is  defined,  and 
certainly  at  every  point  of  continuity  of  the  function. 

406.  The  necessary  and  sufficient  condition  that  a  function,  defined  for  an 
interval,  may  be  an  indefinite  integral  is  that  it  should  be  absolutely  continuous 
in  the  interval. 

To  shew  that  the  condition  stated  in  the  theorem  is  necessary,  consider 


If  A  be  any  set  of  non-overlapping  intervals  such  that  m  (A)  <  rj,  we  also 
have  m  ( Aj)  <  77,  m  (A2)  <  77,  where  Aj  consists  of  those  intervals  of  A  for  which 
the  variation  of  f(x)  is  positive,  and  A2  those  for  which  it  is  negative. 

r 
The  integral         f(as)  dx  is  the  sum  of  those  variations  of  F  (x)  that  are 

•/(A,) 

•  •         r 

positive,  and  I      f  (x)  dx  the  sum  of  those  that  are  negative. 

J  (A,) 


Since 


f(x)dx 


<  e,  provided  m  (E)  <  77,  where  77  is  determined  when 


(E) 

e  is  given,  we  see  that  the  sums  of  the  positive  and  of  the  negative  variations 
in  the  intervals  of  A  are  <  e,  for  all  sets  of  intervals  A,  such  that  m  (A)  <  ?;. 
The  condition  of  absolute  continuity  (see  §218)  of  F  (x}  is  therefore  satisfied. 
To  prove  that  the  condition  stated  is  sufficient,  let  us  assume  that  it  is 
satisfied  by  F  (x);  we  shall  first  prove  that  F  (x)  is  of  bounded  variation  in 
(a,  b). 

If  the  function  F  (x)  is  not  of  bounded  variation  over  (a,  b),  a  system  of 
nets  can  be  fitted  on  to  (a,  b)  such  that  the  sum  of  the  absolute  variations  of 
F  (x)  in  the  meshes  of  Dn  increases  indefinitely  with  n.  There  must  be  at 
least  one  mesh  of  Dl}  say  dl)  such  that  the  total  variation  over  d^  is  indefinitely 
great.  Further,  dl  must  contain  at  least  one  mesh  d2,  of  D2,  which  has  the 
same  property;  and  so  on.  For  a  large  enough  value  of  n,  dn  is  less  than  an 
arbitrarily  chosen  positive  number.  In  dn  there  is  a  set  of  intervals  of  total 
measure  <  vj,  for  which  the  sum  of  the  absolute  variations  of  F  (x)  is  greater 
than  an  arbitrarily  chosen  positive  number  A.  Hence  a  set  of  non-overlapping 
intervals  of  total  measure  arbitrarily  small  exists  such  that  the  sum  of  the 
absolute  values  of  tho  differences  of  F  (x)  at  the  end-points  of  an  interval  is 
greater  than  A,  and  this  is  contrary  to  the  hypothesis  that  F  (x)  is  absolutely 
continuous.  Therefore  F(x)musi  be  of  bounded  variation  in  (a,  b).  Hence 
F  (x)  has  a  finite  differential  coefficient  f(x)  almost  everywhere.  It  follows 
from  the  Lemma  in  §405  that,  F(x)  being  the  difference  of  two  monotone 
H.  35 
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functions,   its    differential    coefficient  f(x)   is   summable    in   (a,  b).      Also 

f* 
f(x)dx,  which  has  thus  been  shewn  to  exist,  has  a  differential  coefficient 

J  a 

P  3C 

f(x),  almost  everywhere.  Therefore  F(x)  —  I   f(x)  dx  =  (f>  (x)  has  a  differential 

J  a 

coefficient  equal  to  zero,  except  in  a  set  G,  of  measure  zero.  Enclose  G  in 
non -overlapping  intervals  of  a  set  A,  where  m  (A)  <  e.  Each  point  a,  at  which 
<f)'  (x)  exists,  and  has  the  value  0,  is  such  that,  if  x  is  in  a  sufficiently  small 
neighbourhood  of  a,  </>(«)  —  </>  (a)  |  <  e  x  —  a.  \.  A  Lebesgue  chain  (see  § 78) 
may  be  defined  as  follows,  so  as  to  reach  from  a  to  b.  To  a  point  a,  of  G,  attach 
the  part  of  that  interval  of  A  that  is  on  the  right  of  a;  to  a  point  a,  not  belonging 
to  G,  attach  an  interval  (a,  o^)  such  that  |  <£  (x)  -  </>  (a)  <  e  (x  —  a),  if  a  <  x  <  c^. 
This  interval  may  be  defined  uniquely,  by  taking  (a,  o^)  to  be  the  maximum  of 
all  the  intervals  which  satisfy  the  condition.  There  is  one  and  only  one  chain 
from  a  to  x  (^  b)  composed  of  the  intervals  so  defined  for  every  point  a  such 
that  a  =  a  <  x. 

Now  |  <f>  (x)  —  <f>  (a)  |  cannot  exceed  the  sum  of  the  absolute  values  of  the 
differences  of  the  functional  values  at  the  ends  of  an  interval  of  the  chain 
from  a  to  x.  To  this  sum  those  of  the  intervals  that  are  parts  of  the  intervals 
of  A  contribute  a  part  that  is  <  77,  dependent  on  e,  since  <f>(x)  is  absolutely 
continuous.  The  other  intervals  of  the  chain  contribute  a  part  <  e  (b  —  a). 
Hence  |  <£  (x)  —  $  (a)  |  <  77  -f  e  (b  —  a);  and  since  e  is  arbitrarily  small  and  T;,  e 
converge  together  to  zero,  we  have  <f>  (x)  =  <£  (a). 

•  x 

Therefore  F(x)  =  F(a)  -f      f(x)  dx ;  and  thus  F(x}  is  an  indefinite  integral. 

-   a 

The  following  theorem  has  been  proved: 

rx 
If  F  (x)  be  an  indefinite  integral  in  (a,  b),  then  F(x)  —  F(a)  =      F  (x)  dx; 

.'  a 

the  set  of  points  at  which  F'  (x)  may  not  exist  being  ignored. 

407.  The  necessary  and  sufficient  conditions  that  F  (x)  should  be  an  in- 
definite integral  have  been  stated  by  Lebesgue  in  another  form: 

The  necessary  and  sufficient  conditions  that  F  (x}  should  be  an  indefinite 
integral  in  (a,  b)  are  that,  (1)  it  should  be  of  bounded  total  variation,  and  (2), 
the  total  variation  over  any  set  of  points  of  measure  zero  shoidd  be  zero. 

By  the  total  variation  of  a  continuous  function  F(x)  over  any  set  E  is 
meant  the  limit  of  the  sum  of  the  absolute  variations  of  F  (x)  in  a  set  of 
non-overlapping  intervals  A  which  contains  E,  as  m(&)~m(E),  whenever 
such  limit  exists. 

It  has  been  shewn  that,  if  the  condition  of  the  former  theorem  is  satisfied, 
the  condition  (1)  is  satisfied.  Also  the  condition  of  absolute  continuity  ensures 
not  only  that  the  condition  (2)  is  satisfied,  but  also  that  it  is  satisfied  uniformly 
for  all  sets  of  points  of  measure  zero.  Thus  the  conditions  (1)  and  (2)  are  necessary. 
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That  the  conditions  are  sufficient  is  shewn  by  the  proof  in  §406,  if  we  apply 
(2)  to  the  set  G,  in  which  </>  (x)  has  not  a  differential  coefficient  equal  to  zero. 
The  above  theorem  may  also  be  stated  in  the  following*  form : 

If  x  =f(t),  where  f(t)  is  a  continuous  monotone  function  of  t,  and  a  =/(£0), 

rt 
the  necessary  and  sufficient  condition  that  f(t)—  f(t0)  =      f'(t)dt  is  that,  to 

Jt0 

every  set  of  points  on  the  t-interval,  of  measure  zero,  there  shall  correspond  a 
set  of  points  on  the  x-interval,  of  measure  zero. 

For  the  total  variation  of  f(t)  over  any  set  of  points  G(t},  on  the  ^-interval, 
is  the  lower  limit  of  the  measure  of  a  set  of  non-overlapping  intervals  on  the 
^-interval  which  encloses  the  set  G(x]  of  points  that  corresponds  to  G(t);  and 
this  lower  limit  is  me(G(x}).  Thus  the  condition  that  the  total  variation  of 
f(t)  over  a  set  G(t),  of  measure  zero,  should  be  zero,  is  equivalent  to  the  con- 
dition that  m(G(x))  should  be  zero. 

A  continuous  function  of  bounded  variation  is  an  indefinite  integral,  if 
both  the  monotone  continuous  functions  of  which  it  is  the  difference  satisfy 
the  condition  of  the  above  theorem. 

It  will  be  observed  that  the  functions  which  are  indefinite  integrals  form 
a  sub-class  of  the  class  of  functions  of  bounded  variation. 

A  function  which  is  expressible  as  the  sum  of  an  indefinite  integral  and 
of  a  bounded  monotone  (not  necessarily  continuous)  function,  and  which  is 
therefore  of  bounded  variation,  has  been  named  byf  W.  H.  Young  an  upper 
semi-integral,  or  a  lower  semi-integral,  according  as  the  monotone  function  is 
non-diminishing  or  non-increasing. 

It  has  been  shewn  by  W.  H.  Young  that  an  integral  is  both  an  upper,  and 
a  lower,  semi-integral.  It  is  accordingly  sufficient,  in  order  that  a  given  function 
be  an  indefinite  integral,  that  it  be  less  than  some  indefinite  integral  by  a 
monotone  non-diminishing  function,  and  also  greater  than  some  indefinite 
integral  by  a  monotone  non-diminishing  function. 

It  can  be  shewn  that  the  necessary  and  sufficient  conditions  that  F  (x) 
should  be  an  indefinite  72-integral  are  that  it  should  have  bounded  derivatives 
in  (a,  b),  and  that  a  derivative  should  be  continuous  almost  everywhere. 

408.  If  Fl  (x),  F.,  (x}  are  indefinite  integrals,  and  F(x]  be  the  function  that 
is  equal  to  Fl  (x),  for  all  values  of  x  such  that  F^  (x)  ^  F.2  (x),  and  to  F.2  (x)  when 
Fz(x}  >Fl(x),  then  F(x}  is  an  indefinite  integral. 

The  theorem  can  be  at  once  extended  to  the  case  of  any  finite  number  of 
indefinite  integrals  f\  (x),  F2  (x),  . . .  Fn  (x\  The  function  W(x)  has  at  each 
point  x  the  value  of  the  greatest  of  the  given  functions. 

*  See  Halm,  Monatalirfte  tier  Muili.  u.  Phi/sik,  vol.  xxm,  p.  103. 
t  Proc.  Loud,  ^fath.  Sue.  (2),  vol.  ix,  p.  294. 
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To  prove  the  theorem,  it  will  be  observed  that,  in  any  interval  (#,,  x2),  the 
variation  F(xz)  —  F(x1)  \  is  equal  to  that  of  F1(x),  if  f\  fa)  ^  F2  (a^)  and 
F-i  (#2)  ^  F2  (#2) ;  it  is  equal  to  that  of  F2  (x)  if  both  the  inequalities  are  reversed, 
and  to  |  F^  (x^  —  F2  (a?2)  I,  or  to  F^  (x.2]  —  F2  (x^}  \,  if  only  one  of  the  inequalities 
is  reversed.  It  is  clear  that  |  F^  (X)  —  F2  (x2)  \<\F-i  (x^)  —  F1  (x2)  \,  in  case 
F,  (X)  >  F, (ars), and  F1  (x,)  -Fa(a;a)\<  F, (x,)  -  F2 (ara)  | , in  case  F2 (x2\  > F1  (^) ; 
and  a  similar  remark  applies  to  [  Fl  (x2)  —  F2  (x^  \.  Thus  the  variation  of  F  (x) 
in  (x-i,  #2)  is  in  any  case  not  greater  than  the  sum  of  the  separate  variations  of 
F1(x),F2(x)  in  the  same  interval;  hence  the  sum  of  the  absolute  variations 
of  F  (x),  in  any  set  of  intervals,  is  not  greater  than  the  sum  of  the  absolute 
variations  of  F^  (x),  and  of  F2  (x},  in  the  same  set  of  intervals.  It  then  follows 
that,  if  Fl(x),  F2(x)  are  absolutely  continuous  in  (a,  b),  so  also  is  F(x),  which 
is  therefore  an  indefinite  integral. 

409.  The  criterion  that  a  function  f  (x),  defined  in  the  interval  (a,  b),  may 
be  the  indefinite  integral  of  a  function  of  bounded  variation  has  been  obtained* 
by  F.  Riesz.  It  is  contained  in  the  following  theorem: 

The  necessary  and  sufficient  condition  that  the  function  F(x),  defined  in  the 
interval  (a,  b),  may  be  the  indefinite  integral  of  a  function  of  bounded  variation 
is  that  the  expression 


r—m—\ 

2 

r=l 


F(xr+l-xr)     F(xr}-F(xr_l] 


inhere  (x0  ,x1}...  xm)  defines  a  net  fitted  on  to  the  interval. (a,  b),  and  x0  =  a,  xm=b, 
should  be  less  than  a  fixed  positive  number,  independent  of  the  particular  net. 

That  the  condition  is  necessary  follows  from  the  fact  that 

F(^,)  -  F(*r)  =  _J__  i^, 

xr+l-xr  xr+1-xrjxr 

where  ftr  is  some  number  between  the  upper  and  lower  boundaries  off(x)  in 
the  interval  (xr,  xr+1).  When/(#)  is  of  bounded  variation,  the  expression  in 
the  theorem  is  seen  to  be  bounded  for  all  possible  nets.  For  the  proof  of  the 
sufficiency  of  the  condition,  reference  may  be  made  to  Riesz's  memoir. 

THE  FUNDAMENTAL  THEOREM  OF  THE  INTEGRAL  CALCULUS  FOR  A 
LEBESGUE  INTEGRAL. 

410.  Let  F(x)  be  a  continuous  function,  defined  for  the  interval  (a,  b), 
and  let  it  be  assumed  that  F(x)  has,  at  every  point  of  (a,  b),  a  differential 
coefficient  F'  (x),  that  is  bounded  in  (a,  b).  It  has  been  shewn,  in  §  401,  that 
F'(x)  is  a  measurable  function;  being  bounded,  it  is  consequently  integrable  (L). 

F  (x  +  hn)  -  F  (x} 

Let  fn(oc)=  ~        — }  '  where  \hn\  is  a  sequence  ot  numbers  con- 

"n 

*  Annales  de  I'ecole  normale  (3),  vol.  xxvm,  p.  3G. 
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verging  to  zero;  then  \fn(x)  is  bounded,  for  all  values  of  n  and  x,  as  it  has 
the  same  boundaries  as  F'  (x).  By  the  theorem  of  §  398,  we  have 

rx  rx 

lim  I   fn  (x}  dx  =     F'  (as)  dx. 

n-^ao  '  <t  Jo 

Thus         fV  (x)dx  =  lim  i  I  f*  +  *"  F  (x)  dx  -  f  °  +  *"  ^ (a?)  <tol 

.'a  hn~Q'ln  (Jx  Ja  ) 

=  F(x)-F(a}. 
The  following  theorem  has  been  established: 

//  F(x)  be  a  function  which  possesses,  at  every  point  of  the  interval  (a,  b),  a 
differential  coefficient  F'(x},  bounded  in  that  interval,  then  F'  (x)  possesses  an 
L-integral  in  the  interval  (a,  x},  which  differs  from  F(x)  by  a  constant  only. 

This  theorem  corresponds  to  the  theorem  (B),  of  §  343.  For  the  /^-integral, 
the  theorem  is  subject  to  the  condition  that  F'  (x)  should  be  integrable  (R); 
but  it  has  here  been  shewn  that  the  corresponding  theorem  holds  without 
restriction,  when  Lebesgue's  definition  of  an  integral  is  employed,  so  long  as 
the  differential  coefficient  is  a  bounded  function. 

411.    The  following  theorem  was  given*  by  Lebesgue: 

If  a  function  <£>(%),  defined  in  a  given  interval,  be  such  that  one  of  its 
derivatives  (say  D+cj)(x))  is  finite  at  every  point,  the  necessary  and  sufficient 
condition  that  the  derivative  is  summable  in  the  interval  is  that  the  given 
function  be  of  bounded  variation  in  the  interval. 

The  indefinite  integral  of  such  a  summable  derivative  is  the  function  of 
which  it  is  the  derivative. 

To  prove  the  theorem,  we  assume  that,  for  each  value  of  x  in  (a,  b),  the 
derivative  D+  <£  (x),  of  the  given  continuous  function  <$>  (x},  has  a  finite  value. 
Let  the  unbounded  interval  (—</>,  oo )  be  divided  into  intervals  (ct,  cl+1), 
where  the  integer  t  has  all  positive  and  negative  values,  including  zero,  for 
which  we  assume  c0  =  0;  also  let  cl+1  —  ct<e,  for  all  values  of  t.  Let  e, 
denote  that  set  of  points  x,  in  (a,  b),  for  which  ct  <  D+  (f>  (x)  £  ct+1 ;  and 
arrange  the  sets  et  in  the  order  e0,  elt  e^,  e2,  e_2, ...,  en,  e_n,  .... 

Let  k0,  k1}  &_i,  &2,  k_2,  ...  kn,  k_n, ...  be  a  sequence  of  positive  numbers,  all 
less  than  1,  so  chosen  that  the  limiting  sum  of 

KO  -p  KI      G!     -\-  n?_i   I  C_i      T  .  .  .  T  Kn  I  Cn  |  -f-  /t_n  j  C_7l     ~r   •  •  • 

is  less  than  e.  Let  the  set  e0  be  enclosed  within  non-overlapping  intervals  of 
a  set  A0,  and  the  complementary  set  C  (e0)  within  intervals  of  a  set  A0',  so 
that  the  measure  of  that  set  of  intervals  which  is  common  to  A0,  A0'  does  not 

*  Lemons  sur  V integration,  p.  123.  For  a  discussion  of  his  proof,  and  for  corrections  to  its 
original  form,  see  Atti  del  Lincei,  Rendiconti,  (5)  vol.  xv(l);  B.  Levi,  pp.  433,  551,  07-i;  (5)  vol.  xv 
(2),  Lebesgue,  p.  3,  B.  Levi,  p.  358.  Also  (5)  vol.  xvi  (1),  Lebesgue,  p.  92. 
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exceed  k0.  Enclose  ex  in  a  set  of  intervals  At,  and  G(e0  +  ei)  in  a  set  of 
intervals  A/;  where  An  A/  are  both  within  A0',  and  have  in  common  a  set  of 
intervals  of  measure  not  exceeding  klf  Proceeding  in  this  manner,  we  enclose 
ep,  where  p  is  positive  or  negative,  in  a  set  of  intervals  Ap,  and 

C(e0  +  el  +  e_!+  ...  +  ep) 

in  a  set  of  intervals  Ap',  so  that  Ap,  A/  are  both  interior  to  A/,  where  q 
immediately  precedes  p,  and  such  that  Ap,  Ap'  have  in  common  a  set  of 
intervals  of  measure  not  exceeding  kp. 

We  have  m(&p)  —  m(ep)  ^  kp;  and  Ap  has  in  common  with  all  the  other 
sets  A  a  set  of  intervals  whose  measure  is  £  k0  +  ^  +  /c_:  +  .  .  .  +kp>  say  ^  if.p. 

m  (  Ap)  —  2  |  cp  j  m  (ep)  does  not  exceed  2  &p  cp  J  ,  or  is  <  e,  we 

p 
and  2    cp  m  (ep)  are  either  both  divergent,  or  both 


Since  H  \cp 

p 
see  that  2  cp  |  m 

p  P 

convergent;  and  in  the  latter  case  the  difference  of  their  limiting  sums  is 
less  than  e. 


Since 


—  e  <   D+  <f)  (x)  |  <  j  cp  |  +  e,  in  the  set  ep ,  we  have 


rb 

(ep)  —  €  (b  —  a)  <  I     D+  </>  (x)  \  dx  <  2  cp   m  (ep)  +  e  (b  -  a). 

J  a  p 

rb 
Hence.if  2  j  cp  m(ep}  is  convergent,     |  D+  $  (x)   dx  is  finite  ;  and  conversely. 

p  J  a 

It  has  now  been  shewn  that  the  necessary  and  sufficient  condition  that 

rb 

dx 


exists,  as  a  finite  number,  is  that  S    cp  m  (Ap)  should  be  convergent. 

p 

Any  point  x,  in  (a,  b),  belongs  to  one  of  the  sets  ep;  let  8P  be  that  interval, 
of  the  set  Ap,  which  contains  x.  Let  (x,  sc  +  h)  be  the  longest  interval,  on  the 
right  of  x,  contained  in  8P,  of  length  not  greater  than  e,  and  satisfying  the 
condition  cp  —  e  ^  7  (x,  x  -\-  h)  ^  cp+l  +  e.  It  follows  that 


h  {!  cp  \  -  2e]  <  |  </>  (x  +  h)  -  (j>  0)  1  <  h 


2 


A  unique  Lebesgue  chain,  reaching  from  a  to  b  (see  §  78),  may  be  defined 
by  means  of  these  intervals  (x,  x  +  h).  For  all  left-hand  end-points  xa,  of  the 
chain,  that  belong  to  one  and  the  same  set  ep,  let  Bp  denote  the  intervals  of 
the  corresponding  part  of  the  chain.  Then  2  |  0  (xa  +  h)  —  <£  (xa)  \  is  between 

the  two  numbers    cp    tn(Bp)  ±  *26m(BP)',  therefore,  for  the  whole  chain, 

2  j  0  (x  +  h)  -  $  (x) 
is  between  the  two  numbers  S  cp   m  (Bp)  ±  2e  (b  —  a). 


Now 


m  (Bp}  £  2   cp  m  (  A^),  and  therefore  2,   cp  j  m  (Bp)  converges  if 


D+  <j>  (x)  dx  is  finite. 
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It  has  now  been  shewn  that,  if  D+  <f>  (x)  is  summable  in  (a,  b), 


the  total  variation  of  <£  (x)  over  the  Lebesgue  chain,  is  less  than 


or  than 


2  I  cp   m  (ep)  +  e  +  2e  (b  -  a), 


rb 
and  this  is  less  than         j  D+  <f>  (x)  \  dx  +  e  +  3e  (6  -  a). 

J  a 

We  have  also       |  <j>  (b)  -  <f>  (a)  \  ^  2  \  $  (x  +  A)  -  </>  (x)  |, 
and  therefore 

-0(a)|<  f   \D+<j>(x)\dx+e+3e(b-a). 

.' 


Since  |  D+  <j>  (x)  is  integrable  in  any  interval  (a,  /3)  contained  in  (a,  b),  the 
last  inequality  may  be  applied  to  the  interval  (a,  /3).  If  (a,  b)  be  divided  in 
m  parts  (orr,  /3r),  where  r  =  1,  2,  3,  ...  m;  we  have 


f  )  -  </>  (ar)  j  < 

r  =  l 

and  since  e  is  arbitrary,  we  have 


r=m 

2 

r=l 


From  this  we  see  that  <j>  (x)  must  be  of  bounded  variation  in  (a,  b),  in  case 


r 

D~*~  <f>  (x)  dx  exists. 


Next,  let  it  be  assumed  that  <f>  (x)  has  bounded  variation  in  (a,  6),  then 
S  |  <j)  (x  +  h)  —  <J)  (x)  taken  for  the  intervals  of  the  chain  is  convergent  (see 

00 

§  243),  and  therefore    S     cp  \  m  (Bp)  converges  to  a  finite  number. 
f~\ 

Take  a  fixed  value  P,  of  p.  All  points  of  Ap  that  are  not  in  an  interval  of 
Bp  necessarily  belong  to  intervals  of  A3,  where  qi=p,  and  their  measure  does 
not  exceed  Kp;  hence 


P-P 


-  2 
P=i 


P=P 


K 


P=I 


The  numbers  k  can  be  subjected  to  the  condition  2  Kp  cp  j  <  e,  for  every 

i (Ap)  —  e  <    2    \cp\m (Bp). 


value  of  P;  we  then  have     2 

P=I 
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2  |  Cp  \  m  (ep)  —  e  <  2     cp  m  (Bp)  <  2,    cp  m  (Bp)  +  e, 

/>=•!  p  =  l  p  =  l 

oo  p  =  P 

and  therefore,  if  2     c^  |  m  (Bp)  is  finite,  so  also  is  2    cp  \  m  (ep),  and  this  last 
P=I  p-i 

/" 
sum  is  bounded  for  all  values  of  P;  it  therefore  follows  that  /    |  D+  <j>(x)  \  dx 

J  a 

exists.    It  has  thus  been  shewn  that  it  is  a  sufficient  condition  for  the  exist- 

rb 
ence  of  I    D+  <f>  (x)  dx,  that  $  (x)  should  be  of  bounded  variation  in  (a,  b). 

•'  a 

In  any  interval  of  the  chain,  we  have 

h  (cp  —  e)  ^  (f>  (x  +  h)  —  4>(x)^  h  (cp+l  +  e); 
therefore,  for  that  part  of  the  chain  for  which  p  ^  P, 

*S  cpm  (Bp)  -e(b-a)^^,{<f>(a;  +  h)-tj)  (x)}  ^  ?S  cpm  (Bp)  +  e(b-  a). 
/>=!  P=I 

p 

It  now  follows  that  2  {<£>  (x  +  h)  —  </>  (as)}  lies  between  the  numbers 


2   cpm(ep)  ±  (b  —  a  +  1)  e. 
p-i 

P-P 

Now  P  may  be  so  chosen  that  2  cpm(ep)  differs  from  2  cpm  (ep)  by  less 

P=I  P=I 

than  e;   then  2  {</>(#  +  /*)  —  <f>(%)},  for  the  whole  chain,  is  between  tlje  two 

00 

numbers  2  cpw  (e^,)  +  (6  —  a  +  2)  e.    Therefore  0  (6)  —  <£  (a)  is  between 
P  =  I 


and  since  e  is  arbitrary,  we  have 
[b 

J  a 

and  in  general 

J  a 

if  x  is  in  (a,  b). 

The  theorem  may  be  proved  for  the  case  in  which  any  one  of  the  other 
derivatives  is  employed,  in  a  precisely  similar  manner.  We  have  then  the 
following  theorem: 

If  (f)(x)  be  of  bounded  variation,  and  have  its  four  derivatives  finite  at  each 
point,  we  have 

rb  rb  rb  rb 

<j>(b)  —  <f>(a)—.\    D+  <f> (x)  dx  =      D+  $  (x)  dx  =     D~  (f>  (x)  dx  =  I   D_  <f>  (x)  dx. 

J  a  J  a  -a  J  a 
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This  holds  for  every  interval  («,  0)  contained  in  (a,  6);  in  every  such  interval 
{D+  <f>  (x)  -D+<j>  (as)}  dx=0;  hence  (see  §  394)  we  have  D+  <f>  (x)  =  D+(j>  (x), 


I. 


almost  everywhere.  Similarly,  by  taking  the  other  derivatives  in  pairs,  we 
see  that,  almost  everywhere,  all  four  derivatives  are  equal,  and  thus  a  finite 
differential  coefficient  exists.  This  is  a  particular  case  of  the  theorem,  estab- 
lished in  §  298,  that  any  function  of  bounded  variation  has,  almost  every- 
where, a  finite  differential  coefficient. 

We  have,  further,  the  following  theorem: 

If  <f>(x)  be  of  bounded  variation,  and  one  of  its  derivatives  D<f>  (x)  is  finite 
at  every  point  of  the  interval  (a,  b),  then 

y  (x)  dx  =  (j)  (b)  —  (f>  (a); 

the  integral  being  taken  over  the  set  of  points  at  which  <$>'  (x)  exists,  and  is  finite. 
In  particular,  the  theorem  holds  if  (f)'  (x)  exists  at  every  point,  and  is  finite. 

412.  The  theorem  of  §  411  may  be  applied*  to  obtain  a  proof  of  the 
theorem  already  established,  in  §  298,  that  a  function  of  bounded  variation, 
and  in  particular,  any  monotone  function,  has  a  finite  differential  coefficient 
almost  everywhere. 

As  any  function  of  bounded  variation  is  the  difference  of  two  monotone 
non-diminishing  functions,  it  is  sufficient  to  consider  the  case  of  a  monotone 
non-diminishing  function  f(x).  All  the  derivatives  of  f(x)  are  =0,  hence 
those  of  f(x)  -f  kx  =  <f>  (x)  are  all  =  k,  where  k  is  any  chosen  positive  number. 
It  will  be  sufficient  to  prove  the  theorem  for  <f>  (x).  Let  <f>  (x)  =  £,  and  consider 
the  inverse  function  x  =  F(^),  regarded  as  defined  on  the  ^-segment  (</>(a)> 
(f>  (b)),  which  is  also  monotone  non-diminishing.  Whether  (f>  (x)  is  continuous 
or  not,  F  (£)  is  a  continuous  function  of  £.  All  the  derivatives  of  F(g)  are  in 
the  interval  (0,  k~l) ;  and  thus,  as  these  are  all  finite,  F  (£)  has  a  differential 
coefficient  for  almost  all  values  of  £.  If  K  be  any  fixed  number  >  A;"1,  we  have 
A#  <  -STA£,  and  thus  we  have  2A#  <  /f  A£.  From  this  it  follows  that,  to  any 
set  of  points  of  measure  zero  on  the  ^-segment,  there  corresponds  a  set  of 
points  on  the  ^'-segment,  of  measure  zero.  Therefore  x  =  F(%)  has  a  differ- 
ential coefficient  F'  (£),  for  almost  every  value  of  x ;  and  thus  0  (x)  has  a 
differential  coefficient  almost  everywhere.  Each  point  on  the  ^-segment  at 
which  F'  (|)  =  0  can  be  enclosed  in  an  interval  A£,  such  that  A#  <  eA£,  where 
e  is  arbitrarily  chosen.  It  follows  that,  if  E  be  the  measure  of  the  set  of 
points  on  the  ^-segment  at  which  F'  (£)  =  0,  the  measure  of  the  corresponding 
set  on  the  ^-segment  is  <  e  \m  (E}  +  77},  where  77  is  arbitrarily  small ;  and  since 
e  is  arbitrary,  it  is  zero.  Thus  the  points  on  the  #-axis,  for  which  F'  (£)=  0, 
and  consequently  <f>'  (x)  =  +  oc  ,  form  a  set  of  measure  zero.  Therefore  $'  (x) 
exists,  and  is  finite,  almost  everywhere  in  (a,  b). 

*  See  W.  H.  Young,  Quarterly  Journal  of  Math.,  vol.XMi,  p.  79. 
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413.    There  remains  for  consideration  the  case  in  which  a  continuous 
function  <£(#),  of  bounded  variation,  is  such  that  one  of  its  derivatives,  say 
(x),  is  not  everywhere  finite. 


If  (f>  (x)  is  continuous,  and  D<f>  (x)  is  finite,  except  at  the  points  of  a 
reducible  set  G,  and  if 

rx 

Z)</>  (x)  dx 

.  a 

exists  as  an  //-integral,  then 


Consider  the  closed  enumerable  set   G'.     If  (a',  x}   be  interior  to  an 

interval  contiguous  to  G',  then 

rx 
(f)(x)-(f>  (a')  =       D(f>  (x}  da;  ; 

J  a' 

and  therefore 

$  (x}  -  !X  D(f>  (x}  dx  =  (f>  (a)  -  r  Dcf)  (x)  dx. 

J  a  J  a 

It  follows  that  the  continuous  function 

<f>  (x)  -  I    D(f>  (x)  dx 

J  a 

is  constant  in  each  interval  contiguous  to  G',  and  therefore,  since  G'  is 
enumerable,  it  is  constant  in  the  whole  interval  (a,  6),  and  equal  to  <£  (a).  We 
have  therefore  the  following  theorem  : 

If  </>  (x)  be  a  continuous  function,  and  if  one  of  its  derivatives  D0  (x)  be 
infinite  only  at  the  points  of  a  reducible  set  in  (a,  b),  and  is  summable  in  (a,  b}, 
when  those  points  at  which  it  is  infinite  are  ignored,  then 


rx 

I    Dcf)  (x)  dx  =  (f)  (x)  —  <f>  (a),  in  (a,  b). 

J  a 


The  condition  that  D(f>  (x}  is  summable  may  be  replaced  by  the  condition  that 
<j)  (x)  should  be  of  bounded  variation. 

If  the  set  G  were  irreducible,  but  non-dense  in  (a,  b),  the  set  G'  would 
contain  a  perfect  component  H.    The  reasoning  given  above  would  shew  that 


rx 

(x)  —  I    D(j>  (x)  dx 

.!  a 


is  constant  in  each  interval  contiguous  to  H,  but  it  would  not  follow  that  it 
would  be  constant  in  (a,  6).  In  fact  it  may  be  a  continuous  function  with  an 
everywhere  dense  set  of  lines  of  invariability  (see  §  269).  In  this  case  the 
relation 


(x)  dx  =  (f>  (x)  —  (f>  (a) 
does  not  in  general  hold  good ;  but  we  have  the  following  theorem : 
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If<f>  (x}  be  a  continuous  function,  with  one  of  its  derivatives  D<f>(x)  infinite 
at  the  points  of  a  non-dense  set  G,  such  that  G'  consists  of  a  perfect  set  H  and 
an  enumerable  set  (which  may  be  absent),  and  if  D<f>  (x)  possesses  an  L-integral 
in  (a,  b),  then 

I    Dd> (x) dx  =  <t> (x)  —  <f> (a)  +  U(x)  —  U(a), 

J  a 

where  U(x}  is  a  continuous  function  with  an  everywhere  dense  set  of  lines  of 
invariability.  The  condition  that  D<f>  (x)  is  summable  may  be  replaced  by  the 
condition  that  <£  (x)  should  be  of  bounded  variation. 

It  has  already  been  shewn,  in  §  405,  that,  if  <f>  (x)  be  continuous,  and  of 
bounded  variation,  D<j>(x)  is  summable,  when  taken  over  the  set  of  points 
Cnecessarily  of  measure  equal  to  that  of  the  whole  interval)  at  which  it  is 
finite. 

414.  It  has  been  shewn  in  §405,  that,  for  a  function  f(x),  that  is  con- 
tinuous, and  of  bounded  variation,  /'  (x)  is  summable  in  the  interval  for  which 

rcc 

f(x)  is  defined.    The  indefinite  integral  I  /'  (x)  dx  is  however  not  necessarily 


equal  tof(x)—f(a),  unless  f(x)  satisfies  the  further  condition  that  it  should 
be  an  indefinite  integral.  In  accordance  with  §  407,  this  condition  may  be 
expressed  in  the  form  that  the  total  variation  of  f(x)  over  any  set  of  points 
of  measure  zero  should  be  zero. 

In  the  general  case  of  a  continuous  function  of  bounded  variation  which 
is  not  subject  to  this  last  condition,  the  relation  of  the  function  to  the 
indefinite  integral  of  f  (x)  is  expressed  by  the  following  theorem,  due*  to 
de  la  Vallee  Poussin  : 

If  f(x)  be  a  continuous  function,  of  bounded  variation  in  the  interval 
(a,  b),  then 

f(x)  -/(a)  -  f"f  (x}  dx  +  V,  (x)  -  T2  (x), 

J  a 

where  V1  (x)  is  the  total  variation  of  the  monotone  continuous  function  fi  (x)  over 
the  set  of  points  in  (a,x),  at  which  one  of  'its  derivatives  D/j  (x)  is  not  finite,  and 
Vz(x)  is  similarly  defined,  relatively  to  /2  (x);  and  /i  (x),  f.2(x)  are  two  con- 
tinuous monotone  non-diminishing  functions  whose  difference  is  f(x). 

It  will  be  sufficient  to  assume  that/(#)  is  monotone,  either  non-increasing 
or  non-diminishing  ;  the  result  will  be  obtained  by  considering  the  difference 
of  two  such  functions. 

Let  </>A-(V)  =  |  Df(x) !,  at  all  points  at  which  j  Df(x)  \  >  N,  and  $#(#)  =  0, 
when  I)f(x)  ^  N;  thus  <f>N(x)  -  Df(x)  ^-N.  Let  the  set  of  points  E,  at 
which  Df(x)  is  not  finite,  be  enclosed  in  a  set  A,  of  non-overlapping  intervals  ; 

*  See  the  Cours  d' Analyse,  2nd  ed.,  vol.  i,  p.  2G9. 
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the  set  A  can  be  divided  into  a  finite  set  Ax,  and  an  infinite  set  A2,  such 
that  the  sum  of  the  variations,  S  {/(#»•)  —  /(#V-i)}>  taken  for  the  intervals 
(#r_i,  xr)  of  A2  is  <  e ;  this  variation  is  taken  with  its  proper  sign. 

Let  FI  (x)  be  the  sum  of  the  variations,  each  with  its  proper  sign,  of  the 
function 


for  the  intervals  of  Ax  that  lie  in  (a,  x)',  and  let  F2  (x)  be  the  sum  of  the  varia- 
tions of/(#),  taken  positively,  for  the  intervals  of  A2  that  lie  in  (a,  x}  ;  we  thus 
have  F2  (x)  £  F2  (b)  <  e.    If  x  is  within  an  interval  of  A1}  or  of  A2,  the  variation 
in  the  part  of  that  interval  on  the  left  of  x  is  counted  in  V1  (x),  or  F2  (x). 
Let  us  consider  the  function 


x  (x)  =/(*)  -     {D/(«)  -  ^  («)i  dx  -  V,  (x)  +  F2  (x). 

J  a 

In  an  interval  of  the  set  A]  ,  Vl  (x)  and 


both  increase  by  the  same  amount,  or  both  diminish  by  the  same  amount,  and 
F2  (x)  does  not  vary  ;  thus  such  an  interval  is  one  of  invariability  of  x  (x)- 
In  a  finite  interval  complementary  to  A1;  V1  (x)  does  not  vary,  and 

rx 

I    <t>t?(x'}dx    V.  (x) 

J  a 

do  not  diminish.    The  function 

f(x)-fXDf(x)dx 

J  a 

has  a  derivative  which  is  equal  to  zero,  for  almost  all  values  of  x\  and  it 
follows  that  the  points  where  the  derivative  of  x  (x)  is  negative  form  a  set  of 
measure  zero,  in  case  there  are  any  such  points. 

Moreover,  DX  (#)  has  nowhere  the  value  —  oo  ,  for  it  is  the  sum  of 

[<t>N(x)-Df(x)}  dx, 
of  which  the  derivative  is  =—N,  and  of 

of  which  the  derivative  is  finite,  except  at  points  in  A2,  and  at  such  points 
cannot  be  negative,  as  in  any  interval  (x,  x  +  fi)  contained  in  an  interval  of 
A2  in  which  f(x)  diminishes,  f(x)  +  V.2(x)  does  not  vary,  and  also  Vl(x)  does 
not  vary.  Since,  in  an  interval  complementary  to  Ax,  DX(X)  has  nowhere  the 
value  —  oo  ,  and  is  certainly  =  0  almost  everywhere,  it  follows  from  the 
theorem  in  §  403,  that  it  is  nowhere  negative,  and  therefore  x  (x}  never 
diminishes.  It  follows  that  x  (x)  ^s  a  monotone  non-diminishing  function ; 
thus  x  (#)  =  X  (ft^'  or 

f(x)  -f(a)  2:  !"{J)f(e)  -  <j>N  (x)}  dx  -  V,  (x)  +  Vz  (x\ 

J  a 
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Since  this  holds  for  an  arbitrary  value  of  e,  let  e~0,  then  F2(#)~0,  and 
F,  (x)  converges  to  the  total  variation  of 

f(x}  -  I    \Df(x)  -  (f>N(x}}  dx, 

J  a 

over  the  intervals  of  A,  each  variation  having  its  proper  sign.    Now  let 
N~  oo ,  then 

'X 


and  F!  (x)  converges  to  the  variation  of 

f(x)-  f*Df(x)dx  in  A; 

.  a 

this  may  be  denoted  by   FA  (x).    Now  let  m  (A)  ~  0,  then  the  variation  of 

fx  —  — 

I    Df(x)  dx,  in  A,  converges  to  zero,  and  KA  (#)  converges  to  V  (x\  the  total 

variation  of/(V)  over  the  set  E.    We  then  have 

f(x)-f(a)  £  \XDf(x]  dx  +  V(x)  ; 


we  can  now  change  f(x)  into   —f(x),  when  V(x)   changes  to  —  F(#);  \\?e 
then  find  that 

f(x)  -/(a)  ^  t"  Df(x)  dx  +  V(x). 

J  a 

From  the  two  inequalities,  we  have 


f(x}  -/(«)=      Df(x)  dx  +  V(x)  =      f  (x)  dx  +  FO), 

.  a  J  a 

when/(«)  is  monotone. 

When  f(x)  is  of  bounded  variation,  and  the  difference  of  two  monotone 
non-diminishing  functions,  we  have 


f(x)  -/(a)  =      f  (x)  dx  +  V,  (x}  -  V,  (x). 

J  a 
THE  TOTAL  VARIATION  OF  AN  INDEFINITE  INTEGRAL. 

415.  It  has  been  shewn  in  §  404  that  the  indefinite  integral  F  (x)  of  a 
function  f(x),  summable  in  (a,  6),  has  its  total  variation 

VlF(x)^  \h\f(x)\dx. 

.   a 

It  can  however  be  proved  that 

VbaF(x)=    b\f(x)   dx. 

-  a 

Let  the  set  El}  of  points  at  which  f(x)  =  0,  be  enclosed  in  a  set  of  intervals 
Aj,  of  which  the  measure  exceeds  that  of  El  by  an  arbitrarily  small  amount; 
a  finite  set  Aj  of  these  intervals  can  then  be  chosen  such  that  the  difference 
of  m  (Aj)  and  m  (E^  is  arbitrarily  small.  Similarly  the  set  E.2>  of  points  at 
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which  f(x)  <  0,  can  be  enclosed  in  a  finite  set  A2,  of  intervals,  so  that  m(A2) 
and  m  (E2)  differ  by  an  arbitrarily  small  number.  Further,  Aj,  A2  can  be  so 
chosen  that  the  measure  of  the  common  part  of  A1}  A2  is  arbitrarily  small. 

We  have/(X)  =/+(#)  —f~(x\  where  f+(x)  =  0,  over  Ez,  and/~(#)  =  0,  over 
E!.  Remove  from  A1?  and  from  A2,  the  intervals  of  the  set  D(&i,  A"2),  which 
they  have  in  common;  there  remains  then  a  finite  set  of  intervals  A,  consisting 
of  the  remaining  parts  of  the  intervals  of  Ax  and  A2.  The  sum  of  the  absolute 
variations  of  F  (x)  over  the  intervals  A  differs  from  the  sums,  added  together, 
of  the  absolute  variations  in  A1;  A2,  by  not  more  than 

f 
9  I  I f(T\    dr • 

-J      |  /     \w/         \JU\KJ    J 

and  this  is  arbitrarily  small.  The  sum  of  the  absolute  variations  over  the  in- 
tervals of  Aj  and  A2  is 


f(x)  dx 


f(x}  dx 


where  A1  = 


=  {S2|;  and  this  differs  by  an  arbitrarily  small  number  from 


|_  f+(x}dx+\_f-(x}dx, 


•    (A,) 


which  differs  by  an  arbitrarily  small  amount  from         |/(#)  |  dx.    Therefore  a 

J  a 

set  of  intervals  A  can  be  determined  such  that  the  sum  of  the  absolute'varia- 
tions  of  F(x)  over  them  differs  by  an  arbitrarily  small  amount  from 


f  \f(x)\dx; 

J  a 


consequently  this  integral  is,  in  accordance  with  the  definition  of  §  243,  the 
total  variation  of  F  (x)  over  (a,  b),  since  that  total  variation  cannot  exceed 
the  integral.  We  have  further 


and  thus 


A 

dx 


exists,  and  is  equal  almost  everywhere  to  f(x) 

The  variation  of  F(x),  over  any  set  E,  has  been  defined  (see  §  252)  as  the 
lower  boundary  of  the  sum  of  the  variations  of  P  (x)  and  N  (x) ;  P  (x)  and 
—  N  (x)  denoting  the  positive,  and  the  negative,  total  variations  of  F  (x)  in 
(a,  x),  taken  for  a  set  of  intervals  (an,  ftn)  which  enclose  the  points  of  E, 
when  all  possible  such  sets  of  intervals  are  taken  into  account. 

We  can  shew  that  the  variation  of  F  (x)  over  E  is  equal  to 


For  the  integral  of  \f(x}\  over  (an,  /3n)  is  the  sum  of  the  variations  of  P  (x) 
and  N  (x). 
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The  above  theorem  holds  also  *  in  the  case  of  functions  of  two,  or  more, 
variables,  denned  in  a  rectangle,  or  cell.  In  the  two-dimensional  case,  the 
total  variation  of  F  (XM,  #(2))  over  a  measurable  set  E  is  the  lower  boundary  of 

2  |  {*W>,  /3n<»)-*W,  aw(2')-*W,  £««)  + JW>f  «„<»)}  |, 
where  the  rectangles  enclosing  E  are  denoted  by 

(a  (1    a  (2)  •  fi  (1)  ft  <2'^ 

VHn    >  "M     3    Pn    >  Pn    )• 
THE  GENERALIZED  INDEFINITE  INTEGRAL. 

416.  If  f(x)  be  measurable  in  a  fundamental  interval,  or  cell,  in  which  it 
is  defined,  and  E  denote  any  measurable  set  of  points  in  that  interval,  or  cell, 


f(x)dx  =  <f>(E)  may  be  called  the  generalized  indefinite  integral  of  f(x). 

(E) 

It  has  been  shewn  in  §  392  to  be  a  function  of  E  which  converges  to  zero, 
as  m  (E)  does  so,  uniformly  for  all  such  sets ;  and  it  has  been  shewn  to  be  a 
completely  additive  function. 

Let  E  +  BE  be  a  measurable  set  which  contains  E,  and  let  E  —  BE  be  one 
which  is  contained  in  E.  A  function  <£  (E)  is  said  to  be  continuous,  for  the 
particular  set  E,  if  <£  (E+BE),  $(E-  BE)  converge  to  <£  (E),  as  m  (BE)  con- 
verges to  zero.  This  is  the  case  if,  for  every  positive  number  e, 

4>(E±BE)-<f>(E)   <e, 

*- 

provided  m(BE)<ije;  where  i/e  depends  upon  e,  and  converges  to  zero  as  e 
does  so. 

If,  for  every  value  of  6,  a  value  of  rje  can  be  so  chosen  as  to  be  the  same  for 
all  measurable  sets  E  in  the  given  domain,  the  function  (f>(E)  is  said  to  be 
uniformly  continuous  in  the  given  domain. 

We  obtain  now  the  theorem  that : 

If  f(x)  be  a  summable  function,  defined  in  a  fundamental  cell,  or  interval, 

f(x)  dx  is  a  uniformly  continuous  function  of  E. 


•1  (E) 

For 


f(x}dx 

(E+SE)  (E)  (SJK) 


r  r  r 

and  I      f(x)dx  —  \  f(®)  dx  =         f(x]  dx  ; 

J  (E)  J  (E-6E)  J  (&E) 

and  the  integral  on  the  right-hand  side  is  numerically  <  e,  for  all  sets  BE,  of 
which  the  measure  is  less  than  a  number  rjf,  dependent  only  on  e. 

rx  r(xw,  x&) 

That       f(x)dx,  f(x(l),  x('2})  d(xw,  xt2>)  are  absolutely  continuous 

Ja  ./  («(!>,  a(2)) 

with  respect  to  x,  or  (x(l},  x{-}),  are  particular  cases  of  this  theorem. 
*  Lebesgue,  Ainiales  de  I'ecole  normale  (3),  vol.  xxvn,  p.  3Si>. 


560  The  Lebesgue  integral  [CH.  vn 

417.  The  following  theorem  is  the  analogue  of  the  property  of  a  continuous 
function  of  a  single  variable,  given  in  §  214,  that  the  function  takes  every 
value  between  its  upper  and  lower  limits  in  the  interval  for  which  it  is  denned: 


If  (f>(e)  denote  the  generalized  indefinite  integral         f  (x}  dx,  for  all 

.'(•) 
measurable  sets  e,  contained  in  a  given  bounded  measurable  set  E,for  luhich  the 

summable  function  f(x)  is  defined,  e  can  be  so  determined  that  (j>(e)  has  a  pre- 
scribed value  lying  between  the  upper  and  lower  boundaries  of  <£  (e)  for  all 
sets  e,  in  E. 

In  the  first  place  we  see  that,  if  E  is  a  measurable  set,  in  any  number  of 
dimensions,  a  component  F,  of  E,  can  be  so  determined  that  m  (F)  has  any. 
prescribed  value  between  0  and  m(E).  For  if  #(1>  be  one  of  the  coordinates 
which  determine  the  position  of  a  point  in  space,  the  section  E(%),  ofE,  between 
a  plane  x(l}  =  f,  and  a  fixed  plane  x(l)  =  a,  where  those  points  of  E  for  which 
xw  <  a  form  a  set  which  has  a  measure  less  than  the  prescribed  number,  is  such 
that  its  measure  mE  (£)  is  a  continuous  function  of  £,  and  thus  the  theorem 
follows  from  the  theorem  of  §  214. 

f  f  f 

We  have  now       f(x}dx=\     f(x)dx  +        f  (x)  dx,  where  e1  is  a  com- 

J  (</)  ./(«,)  J  (e.2)' 

ponent  of  Elt  that  part  of  E  in  which  f(x)  is  positive,  and  where  e%  is  a  com- 
ponent of  E2,  that  part  of  E  in  which  f(x)  is  negative.  The  upper  and  lower 

f  f  f 

limits  of       f(x)  dx,  in  E,  are  clearly         f(x)  dx,  and  /      f(x)  dx,  which  may 

J  (e)  J  (E,)  J  (E,} 

be  denoted  by  m(G1)  and  —  m(G.2),  respectively,  where  G1}  G2  are  sets  whose 
dimensions  exceed  that  of  E  by  unity.  Any  number  G  between  these  two  can 
be  expressed  by  A  —  B,  where  A  <  m(G1),  B>m(G.2).  The  set  el  can  be  so 

determined  that         f(x)  dx  —  A,  and  e2  can  be  so  determinedwthat 
J  « 


then  if  e  =  el  +  e.,,  we  have        f(x)  dx  =  C.    It  should  be  observed  that  G^  and 

•>  (e) 

Gz  are  unbounded  in  case  f(x)  is  unbounded  in  E,  but  they  have  finite  measures. 


THE  INDEFINITE  INTEGRAL  OF  A  FUNCTION  OF  TWO  VARIABLES. 
418.    It  should  be  observed  that  the  indefinite  integral 

F(x®,  x®}  =  f  /V1',  «(2))  d(xU,  a®) 

J  (a'1',  a<->) 

can  be  expressed  as  the  difference  of  two  monotone  non-diminishing  functions, 
which  are  also  quasi-monotone  (see  §  255). 
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For,  let  /!  (xw,  #(2))  =f(xw,  #<2)),  at  the  points  at  which  the  function  /  is 
=  0,  and  let  /8(#(1),  #(2))  =  —f(xw,  x{2>),  at  the  points  at  which  /  is  negative ; 
thus  /(#<'>,  #<2>)  =/,  (a?<»,  x®)  -ft  0(1),  x®\  and 

F '(x^  #(2))  =  F  (x^  x1®)  —  F  (xw  ic(2') 
where  F^  and  F2  are  the  indefinite  integrals  of  f^  and  f2  respectively. 

The  two  functions  F1(x(l),  x®),  F2(xw,x®)  are  both  monotone  and  non- 
diminishing,  in  accordance  with  the  definition  of  §  253.  Again,  with  the  nota- 
tion of  §254, 


/•( 
=  I 


(*«>,  xMy 

with  a  similar  result  for  F2.    It  follows  that  the  functions  Fl}  Fz  are  quasi- 
monotone,  in  accordance  with  the  definition  of  §  255. 

419.    If  the  function  f(xw,  x{2))  be  summable  in  the  cell  (a(1),  a(2);  b(1\  6(2)) 
it  has  been  shewn*  by  Fubini  and  Torelli  that  the  indefinite  integral 

l>,  a;'2>) 


denoted  by  F  (x(l\  x^),  has  the  property  that      (1)     (2)  exists,  and  has  the  value 

,  at  almost  all  points  of  the  cell  ;  and  thus  that 
d2F  d*F 

V  V  ± 


_        x        «, 

~J{      '       >' 

almost  everywhere  in  the  cell.  This  result  has  been  however  deducedf  by 
W.  H.  Young  from  the  more  general  theorem  which  he  established,  that  any 
function  F(xw,  *(2))  that  is  of  bounded  variation  in  the  cell,  in  accordance  with 

d*F 
the  definition  in  §  254,  has  the  property  that      (2)     (1)  exists  almost  everywhere 

in  the  cell.    This  property  we  proceed  to  establish. 

It  is  sufficient  to  consider  the  case  of  a  quasi-monotone  function  F(x(l\  #(2)). 

rlf/1 

Since  F  is  monotone  with  respect  to  x(l),  ^-^  exists,  for  each  fixed  value  of 


#(2),  for  all  values  of  x(1\  with  the  exception  of  those  belonging  to  a  set  S(Xw) 
of  linear  measure  zero.  Let  x(2]  have  all  rational  values  in  the  interval  (a(2>,  6<2)), 
then  all  the  linear  sets  S(x®)  taken  together  form  a  set  S,  of  linear  measure 

1  E7 

zero.    For  every  value  of  x(l)  that  does  not  belong  to  S,  ^—  ^  exists  for  all 

GOC 

rational  values  of  x^. 

Consider  any  value  of  x(l)  that  does  not  belong  to  S,  then 


*  Eend.  del  Circ.  Mat.  di  Palermo,  vol.  XL,  p.  295. 
t  Comptes  Rendus,  Paris,  vol.  CLXIV,  1917,  p.  622. 

H.  36 
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is  monotone  non-diminishing  with  respect  to  #(2),  and  the  two  functions 
J)F(x{1},  x(2}),  DF(xw,  x(2})  which  denote  the  greatest  and  least  extreme  de- 
rivatives of  F(x(l),  #(2)),  with  respect  to  x(l),  are  also  monotone  non-diminishing 
with  respect  to  x{2}.  Consequently,  for  the  fixed  value  of  x(l\  they  are  both 
continuous  functions  of  x(2),  except  for  an  enumerable  set  of  values  of  x(2}.  For 
a  value  of  x(2)  for  which  they  are  both  continuous,  each  of  them  is  the  limit  of 
a  sequence  obtained  by  giving  x(2]  rational  values  only.  But  when  #(2)  is 

r\J? 

rational,  DF(xw,  x{2))  and  DF(x(1\  x{'2))  are  equal,  since  ~—^  exists  at  such 

GOG 

points.    Hence,  for  any  value  of  x('2\  not  belonging  to  the  enumerable  set,  the 

two  functions  DF,  DF  are  equal,  as  they  are  the  limits  of  one  and  the  same 

dF 
sequence  of  numbers  ;  and  therefore  ^-^  exists.    It  thus  appears  that,  for  each 


rlW 

value  of  x(l),  not  belonging  to  a  set  S  of  linear  measure  zero,  ^-m  exists,  for  all 

\)9u 

values  of  #(2),  except  those  of  an  enumerable  set. 

When  x(l]  has  a  fixed  value  not  belonging  to  S,  the  two  functions 

1),  tf<2>),  DF(x(l\  x(2)), 


being  monotone  functions  of  x(2\  have  differential  coefficients  with  respect  to 
x(2],  for  almost  all  values  of  x(2)  ;  thus 

l\  x(2)  +  k)  -  7)F(x(l\  x(2))     DF  O(1),  x™  +  k)  -  DF  (x(l\  #(2)) 


both  have  definite  limits,  as  k  -  0,  if  x(2)  does  not  belong  to  a  certain  set  of 
measure  zero.  A  sequence  of  values  of  k,  converging  to  zero,  may  be  so  chosen 
that  none  of  the  values  of  x(2)  +  k  belong  to  the  exceptional  set,  which  consists 

7)  Jf 
of  the  enumerable  set  for  which  ^-,T,  does  not  exist,  and  of  the  set  of  measure 

dxw 

zero  at  which  the  limits  are  not  definite.  For  this  sequence  of  values  of  k, 
T>(x(l\  xW  +  k}  =  D_(xw,  «<-'>  +  &);  and  also  DF(x(l\  x^)  =  DF(x(l\  x{2)),  if  x^ 
does  not  belong  to  the  exceptional  set  ;  hence  the  two  sequences  are  identical, 

1    (?)F(<rW    r(2)  _L  JA       2EVr(1)    r(2h) 

,,.  T'j.'  1  !•  -"-    \OJP    li*      ,    •*        T^   nj)          OJ7    \Jr      f   ill       II  , 

and  their  common  limit  is  also  lim  y<-         ~   ,.,  —  —>:  and 

fc~0  K  [  OXW  dxw 

d2F 
therefore  -  r^-^—n^  exists.    When  all  values  of  x(l]  are  considered,  the  exceptional 

mw 


sets  all  belong  to  a  set  of  plane  measure  zero.    The  following  theorem  has 
now  been  established  : 

If  F(x(l\  x(2))  be  a  quasi-monotone  function,  defined  in  a  given  cell, 

\x(2))     o*F(x(l\  x®) 


both  exist,  almost  everywhere  in  the  cell. 
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If  F  (x(l\  #(2))  be  an  indefinite  integral,  it  may  be  expressed  (see  §  386)  as 
the  difference  of  two  functions  F1  (x(l\  #(2)),  F2  (x(1\  #(2))  which  are  the  indefinite 
integrals  of  two  non-negative  functions  /i  (x(l\  x(2)),f2(xw,  #(2)),  whose  difference 
is  the  function  f(x(l},  x(2}),  of  which  F(x(l\  #(2))  is  the  indefinite  integral. 

Consider  the  quasi-monotone  function 

rtfv  rx(v 
F,  (x(l\  tf  (2))  =  J       I      /  (x{1\  #(2>)  dxm  .  dx®\          (see  §  255). 

dF  p:(2) 

For  a  fixed  value  of  x(2\  ^—^,  exists,  and  is  equal  to  I       /,  (x(1},  xw)  dx&,  for 

OX(l}  Jaf-'> 

almost  all  values  of  xw  ;  and  when  this  exists,  its  differential  coefficient  with 
respect  to  #(2)  exists,  and  is  equal  to/i  (xw,  x{2}),  for  almost  all  values  of  #(2).    It 


thus  follows  from  the  proof  of  the  foregoing  theorem,  that  5—  72T;r(i)  exigts 

030       (jOC 

is  equal  to  ft  (x(l\  x(2])  almost  everywhere  in  the  cell.    As  the  corresponding 
result  holds  for  the  function  f2,  we  have  the  theorem  that  : 

/•(£<!),  X(2») 

//  F  (%w,  tf  (2))  =  J    ^    (2)  f(x^\  a-<2>)  d  (xw,  x(v\  where  f(x(l\  x™)  is  summable 
in  the  cell  (a(1),  a(2)  ;  b(l\  fe(2)),  then,  almost  everywhere  in  the  cell, 


This  is  the  analogue  of  the  theorem  for  an  integral  of  a  function  of  a  single 
variable,  given  in  §  405. 

INTEGRATION  BY  PARTS  FOR  THE  Z-INTEGRAL. 

rx  rx 

420.    Let  the  two  i-integrals  I    U  (x}  dx,   I    V(x)dx  be  denoted  by  u(x), 

J  a  J  a 

v  (x)  respectively.  The  functions  u  (x},  v  (x)  are  absolutely  continuous,  and  of 
bounded  variation  in  (a,  b).  In  accordance  with  the  theorem  established  in 
§405,  u(x)  and  v  (x)  have  differential  coefficients  equal  to  U  (x\  V  (x)  re- 
spectively, almost  everywhere  in  (a,  b).  Let  E  denote  the  set  of  points  at 

,.,,,..,,  •        du    dv  dv       du 

which  this  is  the  case  ;  then,  since  -,-  ,  -7-  are  summable  m  a,  u  -y-  ,  v  ^—  are 

dx    dx  dx      dx 

also  summable  in  E,  since  u  and  v  are  bounded.    We  now  have 

[     d  (uv)  7  du  ,        f         dv   , 

—  7-  -   dx  =         v  -j-  dx  +  \       u  -=-  dx  ; 
J  (E}    dx  J  (E]     dx          J  (E}     dx 

and  since  m  (E)  =  b  —  a,  we  may  write  this  equation 

r     ,6       'b    du  7        [b    dv  . 
\uv\  =      v  -j-  dx  +  I   u  -j-  dx; 

a       .  a      ^"^  J  a      ^^-' 

and  this  is  equivalent  to 

x=  !bU(x)dxfbV(x)dx-  fV(«)|  F  U  (x) 

J  a  J  a  J  a  (  J  a 

which  is  the  general  form  for  the  integration  by  parts,  for  //-integrals. 

36—2 
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MEAN   VALUE   THEOREMS. 

421.  It  is  frequently  of  importance  to  be  able  to  assign  upper  and  lower 
limits  between  which  the  value  of  a  definite  integral  lies,  in  cases  where  the 
exact  determination  of  the  value  is  not  required.  Such  estimates  of  the  value 
of  a  definite  integral  may  frequently  be  made  by  the  employment  of  theorems 
known  as  mean  value  theorems ;  the  most  important  of  these  will  be  here 
investigated. 

If  f(x)  be  a  bounded  function,  summable  in  the  measurable  set  of  points 
E,  we  have,  denoting  by  U  and  L  the  upper,  and  the  lower,  boundary  of  f(x) 
iuE, 

Lm  (E)  g  (     f(x)  dx  £  Urn  (E)  • 

J(E) 

and  therefore   I      f(x)  dx  =  Mm  (E),  where  M  is  some  number  which  satisfies 

J(E) 

the  condition  U  ^  M  ^  L. 

This  holds  good  whatever  be  the  number  of  dimensions  of  the  set  E. 

In  case  f(x)  is  a  continuous  function  in  the  linear  interval  (a,  b),  there  must 
be  some  point  in  (a,  b)  at  which  f(x)  =  M.  If  that  point  be  denoted  by 
a,  +  6  (b  —  a),  we  obtain  the  following  theorem  : 

If  f(x)  be  continuous  in  the  linear  interval  (a,  b),  then 
f(x)  dx  =  (b-  a)f{a  +0(b-  a)}, 


f 

.'  a 


where  6  is  some  number  such  that  0  ^  6  £  1. 

Next,  let  f(x)  and  <£  (x)  be  summable  in  the  measurable  set  E,  and  suppose 
f(x)  is  bounded  in  E,  and  $  (#)  everywhere  ^  0  in  J£.  We  find  immediately 
from  the  definition  of  the  integral  off(x)  $  (x)  in  E,  that 

f  f  f 

i  I      0  (#)  a«  g  I      f(tt)  </>  (x)  dx  ^  U  \      (f>  (x)  dx, 

J  (E)  J  (E)  J  (E) 

where  U  and  L  are  the  upper,  and  the  lower,  boundary  off(x)  in  E. 
It  follows  at  once  that 

f(x)  </>  (x)  dx  =  M1  I      (/>  (x)  dx, 

J  (E)  J  (B) 

where  ML  is  a  number  such  that  U^Ml^L. 

In  case  f(x)  be  a  function  that  is  continuous  in  the  linear  interval  (a,  b), 
we  obtain  the  following  theorem  : 

If  f(x)  be  continuous  in  the  linear  interval  (a,  6),  and  <j>  (x)  be  a  summable 
function  that  is  =  0, 

rb  rb 

?)  <4  (x)  dx  —f{a  +  6l  (b  —  a)\       <£>(%)  dx, 

J  a 

where  6l  is  some  number  such  that  0  ^  #;  ^  1. 


/; 

J  a 
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This  theorem,  including  also  the  more  general  case  in  which  the  integration 
is  over  any  measurable  set  E  in  which  f(x)  is  continuous,  is  known  as  the 
First  Mean  Value  Theorem  of  the  Integral  Calculus. 

An  extension  to  the  case  in  which  $  (x)  is  not  necessarily  everywhere  of 
the  same  sign,  but  has  a  finite  lower  boundary,  is  obtained  by  applying  the 
theorem  to  <f>(x)  +  C,  where  C  is  such  that  (f>(x)  +  C^O  in  (a,  b),  or  in  the 
set  E. 

For  the  case  of  a  function  of  two  variables,  where  f(xw,  #(2))  is  continuous 
in  the  cell  (a'1',  a<2)  ;  6(1),  6<2'),  and  <£  (x(l),  *-(2))  is  summable,  and  ^  0,  in  that  cell, 
the  theorem  may  be  written  in  the  form 


/• 

J  ( 


(2))  d  O(1),  x®) 

(ad),  aW) 


O(l),tf(2)), 

•),  oO»)   ' 

where  0l}  02  satisfy  the  conditions  0  ^  0l  ^  1,  0  =  02  =  1. 

422.  If  the  bounded  function  f(x)  be  monotone  and  non-increasing  in  the 
linear  interval  (a,  b),  and  everywhere  =  0,  and  if  <f>  (x)  be  summable  (whether 
bounded  or  not)  in  (a,  b),  then 

Cb  r( 

f(x)  </>  0)  dx  =f(a)       (j>  (x)  dx, 


where  f  is  some  number  such  that  a  ^  £  =  b. 

Also,  iff(x)  be  monotone  and  non-diminishing  in  (a,  b),  and  everywhere  =  0, 

rb  rb 

then  f(x}  0  (x) ax  =f(b)      <f>  (^)  dx, 

J  a  J  f 

where  %  is  some  number  such  that  a  ^  g^  b. 

This  theorem  was  first  given*  by  Bonnet,  for  the  case  in  which  <£(#)  has 
an  .R-integral,  and  was  applied  by  him  to  the  theory  of  Fourier's  series. 

Another  form  of  the  theorem  was  obtained  by  Weierstrass,  and  also  by 
Du  Bois  Reymondf,  for  the  case  in  which  </>(#)  has  an  .R-integral,  and  it  is 
generally  known  as  the  Second  Mean  Value  Theorem.  When  generalized,  so 
that  $  (x)  is  only  restricted  to  be  summable  in  (a,  b),  the  theorem  may  be 
stated  as  follows : 

Iff(x)  be  monotone  and  bounded  in  the  linear  interval  (a,  b),  and  if  (j>  (x)  be 
summable  (bounded  or  unbounded)  in  that  interval,  then 

rb  '£  rb 

I    f(x)  <j>  (x)  dx  =f(a)       <f>  (x)  dx  +f(b)        (f>  (x)  dx, 

J  a  J  a  J  f 

where  £  is  some  point  in  (a,  b). 

*  Mem.  Acad.  Belg.,  vol.  xxin  (1850),  p.  8 ;  also  Liouville's  Journal,  vol.  xiv  (1849),  p.  249. 
t  Crelle's  Journal,  vpl.  LXIX  (1869),  p.  81. 
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The  theorem  in  this  form  is  deducible  immediately  from  Bonnet's  theorem, 
by  writing  f(x)  —f(b),  orf(x)  —f(a),  in  the  two  cases,  instead  of  f(x).  Bonnet's 
theorem  is  however  not  immediately  deducible*  from  the  second  theorem. 

rb 
It  is  clear  that,  since  the  value  of      f(x)  $  (x)  dx  is  unaltered  by  changing 

J  a 

the  values  of  f(a)  and  f(b),  we  may,  in  the  above  statement,  take,  instead  of 
f(a),f(b),  any  two  numbers^  A,  B  which  are  such  that  the  function  ^r(x), 
defined  by  -v|r  (a)  =  A,  ty  (b)  =  B,  -vjr  (x)  =f(x),  for  a  <  x  <  b,  is  monotone.  We 
have  thus  the  generalized  form  of  the  theorem  : 

rb  rt  rb 

I    f(x)  (f)  (x)  dx  =  A  \    $  (x)  dx  +  B  I    tf>(ai)  dx, 

J  a  J  a  J  f 

where  A  =f(a  +  0),  B  ^f(b  —  0),  iff(x)  is  non-diminishing,  and  A  =f(a  +  0), 
B^f(b  —  Q),  iff(x)  is  non-increasing. 

The  value  of  £  depends  in  general  upon  the  chosen  values  of  A  and  B.  In 
this  generalized  form,  the  theorem  includes  Bonnet's  theorem  as  a  particular 
case.  For  we  may  take  A  =  0,  B  =f(b),  if /(#)  is  positive  and  non-diminishing ; 
or  A  =f(a),  5  =  0,  in  case/(#)  is  positive  and  non-increasing. 

Various  proofsj  of  the  theorem  of  Weierstrass  and  Du  Bois  Reymond§ 
have  been  published.  In  these  proofs  the  function  <£(&•)  has  usually  been 
restricted  to  change  its  sign  only  a  finite  number  of  times,  and  sometimes  to 
be  differentiable ;  but  a  proof  free  from  the  former  restriction  was  given  by 
Du  Bois  Reymond.  A  proof  has  been  given  by  Pringsheim||  in  which  <j>(x) 
is  not  restricted  to  be  a  bounded  function,  but  may  be  any  function  such  as 
possesses  an  absolutely  convergent  integral,  or  in  certain  cases  it  may  have  an 
integral  that  is  not  absolutely  convergent. 

423.  The  following  proof  of  the  second  mean  value  theorem,  in  its  general 
form,  in  which  the  only  restriction  imposed  upon  the  function  $  (x)  is  that  it 
is  to  be  summable,  whether  bounded  or  unbounded,  was  given  IT  by  Hobson. 

Let  <f)  (x}  be  a  function  which,  whether  it  be  bounded  or  not,  is  summable 
in  the  interval  (a,  b).  Let/(V)  be  monotone  and  non-increasing  in  the  interval, 
and  suppose  it  to  have  no  negative  values. 

Let  er  be  an  arbitrarily  chosen  positive  number,l'ess  than  f(a  +  0)  —f(b  —  0); 
and  let  the  function  fr  (x)  be  defined  for  the  interval  (a,  b)  as  follows : 

*  This  was  pointed  out  by  Pringsheim,  Miinch.  Ber.,  vol.  xxx,  where  an  account  of  various 
proofs  of  the  theorem  is  given. 

t  Du  Bois  Reymond,  Schlomilch's  ZeitscJir.,  vol.  xx;  Hist.  Lit.  Abtg.,  p.  126. 

J  For  example,  by  Hankel,  Schlomilch's  Zeitschr.,  vol.xiv;  by  Meyer,  Math.  Annalen,  vol.  vi; 
by  C.  Neumann,  Kreis- Kugel-  und  Cylinderfunctionen,  Leipzig,  1881,  p.  28;  by  Holder,  Gottinger 
Anzeiger,  1894,  p.  519;  by  Netto,  Schlomilch's  Zeitschr.,  vol.  xi;  by  Kowalewski,  Math.  Annalen, 
vol.  LX. 

§  Crelle's  Journal,  vol.  LXXIX  (1875),  p.  42.    See  also  Kronecker's  Vorlesungen,  vol.  i. 

||  Miinch.  Her.,  vol.  xxx.  IF  Proc.  Land.  Math.  Soc.  (2),  vol.  vu,  p.  14. 
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An  interval  (a,  a?j)  can  be  so  determined  that  f(a  +  0)  —/(a?)  <  er,  for 
a  £  oc  <  a?i,  and  so  that  /(a  +  0)  —f(xl)  =  er.  In  case  a^  is  a  point  of  continuity 
of  f(x\  we  shall  have  /"(a  +  0)  —  f(x1)  =  ef,  but  this  will  not  be  the  case  if  xl 
is  a  point  of  discontinuity.  Next,  an  interval  (xl,xt)  can  be  so  determined 
that  /(#a  +  0)  —f(x)  <€r,  for  xl  £  x  <  a?2,  and  that  f(x^  +  0)  —f(x2)  =  er.  Pro- 
ceeding in  this  manner  to  determine  intervals  (x2,  x3),  (a?3,  #4),  ...  ,  for  some 
finite  value  of  n,  not  exceeding  {/(a  +  0)—  f(b  —  0)}/er,  we  must  have 

/(>„_!  +  0)  -/(a?)  <  er,  for  a?B_i  ^  x  <  b  ; 
we  then  take  a>n  to  coincide  with  b. 

Let  /r  (a;)  =f(a  +  0),  for  a  ^  a?  <  a^  ;  let  fr  (x)  =f(xl  +  0),  for  xl  £  x  <  a?2;  and 
in  general  let  fr  (a;)  =f(xs  +  0),  for  xg^x<  xs+l  .  The  function  /r  (a;),  so  defined, 
has  only  a  finite  number  of  values  in  the  interval  (a,  b)  ;  it  is  monotone  and 
non-increasing,  and  is  =  0.  Moreover,  we  have  0  ^fr(x)—f(x)<  e,  for  every 
value  of  x  except  for  the  values  a,x1,x2,  ...  xn-i,  b,  at  which  it  is  not  necessarily 
the  case. 

We  have  now 


[ 

J  a 


Denoting  I    <£  (a;)  das  by  F  (x),  we  have 

J  a 

£  /,  0)  </>  (a;)  dx  ={f(a  +  0)  -/(«,  +  0)}  F  (x,)  +  {/(x,  +  0)  -/<>,  +  0)}  F(x2) 


+  . . .  +  {/<>n_2  +  0)  -f(xn_l  +  0)}  F(xn_l)  +f(x1l_l  +  0)  F(b). 

Since  /(a  +  0)  -f(x,  +  0),  /(^  +  0)  -f(x2  +  0),  . . . ,  f(xn^  +  0)  are  all  >  0, 
the  expression  on  the  right-hand  side  will  be  unaltered  in  value  if  F(x:), 

F(x2), F(xn_1),  F(b)  be  all  replaced  by  a  definite  number  N  which  lies 

between  the  greatest  and  the  least  of  these  numbers.  The  expression  then 
becomes  Nf(a  +  Q).  Moreover,  since  F(x)  is  continuous  in  the  interval  (a,  b), 
some  value  %r,  of  a?,  exists  such  that  N=F(i;r').  It  has  therefore  been  proved  that 

"6  [Sr 

fr(x)  (f>(x)  dx=f(a  +  0)        (f)(x)dx, 

\  J  a 

where  gr  is  some  point  in  the  interval  (a,  6). 

rb  rb  rb 

Also          I    fr  (x)  <j>  (x)  dx  —       f(x)  <f>  (x)  dx   <er  I      (f>  (x)  j  dx. 

•la  .a  J  a 

It  follows  that 

"'  f  f  r 

f(x)  </>  (a;)  dx  -f(a  +  0)        0  (a?)  dx 


T 

J  a 
rb 
where  77,.  =  er  I      <£  (a;)   dx. 

J  a 
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Let  r  =  1,  2,  3,  . . . ,  where  ej ,  e2,  e3,  . . .  is  a  sequence  which  converges  to  zero; 
then  also  771, 773,  rjs, ...  is  a  sequence  w^hich  converges  to  zero.  The  set  of  points 
£i>  &>  £g>  •••  has  at  least  one  limiting  point;  and  it  is  clear  that  the  sequence 
|er}  may  be  so  chosen  by  neglecting,  if  necessary,  a  part,  that  the  sequence 
{£,.}  has  a  single  limiting  point  f . 

We  have  then 


If  £be  an  arbitrarily  chosen  positive  number,  a  value  r1}  of  r,  may  be  so 


/•g 

I    <f)(x)dx    <  !£,  provided  r^rj.    Then 

./  f ,. 


chosen  that  r)r  <  |  £,  and  f(a  +  0) 

we  have 

r?>  ri 

f(x)  (f>  (x}  dx  —f(a  +  0)  I    (f)  (x}  dx    <  £; 

.'a  J  a 

and  therefore,  since  £  is  arbitrarily  small,  we  must  have 

K*)<fc    (!)• 


In  a  precisely  similar  manner,  wheny(#)  is  non-diminishing  in  (a,  b),  and 
is  never  negative,  it  may  be  shewn  that 

(«)<to     ...............  (2), 


where  rj  is  some  point  in  (a,  b). 

In  case  /(a)  =f(a  +  0),  f(b)  =f(b  —  0),  these  results  are  equivalent  to' 
Bonnet's  form  of  the  second  mean  value  theorem. 

Next,  let  f(x)  be  only  restricted  to  be  bounded  and  monotone  in  (a,  b), 
but  to  be  unrestricted  as  regards  sign.  In  case  f(x)  is  non-increasing,  we  may 
apply  the  theorem  (1)  to  the  function  f(x)  —f(b  —  0),  and  we  thus  have 

[  /(»  </>  0)  dx  =f(a  +  0)   [  <j>  (x)  dx  +  f  (b  -  0)  f  $  (x)  dx. 

J  a  Ja  J  f 

In  case/(a;)  is  non-diminishing  in  (a,  b),  we  may  apply  the  theorem  (2)  to 
the  function  /(#)—  f(a  +  0),  and  we  then  have 

f  /(a?)  <j>  (x}  dx  =f(a  +  0)     *  </>  (*)  ^  +/(6  -  0)  [    $  (a)  <&?. 

J  a  ~  a  J  1 

It  has  thus  been  shewn  that,  if  f(x)  be  bounded  and  monotone  in  (a,  b), 
and  (f)  (x)  be  summable  in  the  same  interval, 

t  f(x)  <j)  (x)  dx  =/(«  +  0)  !    $  (x)  dx  +f(b  -  0)  \  (f>  (x)  dx, 

J  a  -a  J  X 

where  X  is  some  point  in  the  interval  (a,  b). 
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In  order  to  obtain  the  more  general  form  of  this  theorem,  let  A  and  B  be 
numbers  such  that  A  ^  f(a  +  0),  B  ^  f(b  —  0),  when  f(x)  is  non-increasing  ; 
or  else  let  A  ^/(a  +  0),  B^f(b—  0),  when/(#)  is  non-diminishing. 

Consider  an  interval  (a  —  A,,  b  +  \)  which  contains  (a,  b)  in  its  interior,  and 
let  f(x)  =  A,  for  a  —  \  ^  x  <  a  ;  and  f(x)  =  B,  for  b  <  x  ^  b  +  \  ;  the  function 
f(x)  being  already  defined  for  a  ^  a;  ^  b.  Let  <f>  (x)  =  0,  for  a  —  \  ^  x  <  a,  and 
for  b  <  a;  ^  b  +  \,  where  </>  (x)  has  already  been  defined  for  a  ^  x  ^  b.  Now  apply 
the  theorem  above  established,  for  the  interval  (a  —  \  b  +  \),  for  which 
f  (a  -  \  +  0)  =  A,  and  /(&  +  \  -  0)  =  B.  We  then  have 

rb  rX  rb 

f(x)  <j>  (x)  dx  =  A  I    4>(x)  dx  +  B  /    <f>  (x)  dx, 

J  a  J  a  J  X 

where  X  is  some  point  in  (a  —  \,  b  4-  X),  and  which  clearly  lies  in  (a,  6). 

The  general  form  of  the  second  mean  value  theorem  may  now  be  stated 
as  follows  : 

Iff(x}  be  bounded  and  monotone  in  the  interval  (a,  b),  and  if  </>(#),  whether 
bounded  or  not,  be  summable  in  (a,  b),  then,  if  A,  B  be  numbers  such  that 


or  ^a  +     , 

according  as  f(x)  is  non-increasing,  or  non-diminishing,  in  (a,  b), 

rb  rX  rb 

I    f(x)  (f>  (x)  dx  =  A    I     (j)  (x)  dx  +  B  I    <j>  (x}  dx, 

J  a  J  a  J  X 

where  X  is  some  number  in  (a,  b).  The  number  X  will  depend  upon  the  values 
of  A  and  B.  In  particular,  we  may  have  A  =f(a),  B  —f(b),  or  also  A  =f(a  +  0), 
B=f(b-0). 

In  case  the  function  f(x}  is  never  negative  in  (a,  b),  we  may  take  B  =  0, 
iff(x)  is  a  non-increasing  function  ;  and  we  may  take  A  =  0,  if  f(x)  is  a  non- 
diminishing  function.  We  obtain  thus  the  following  generalized  form  of 
Bonnet's  theorem  : 

If  f(x)  be  a  bounded  monotone  function,  never  negative  in  (a,  b),  and  if  (f>  (x} 
be  any  function  summable  in  (a,  b),  then 

rb  rx 

I    f(x)  </>  (x)  dx  =  A         0  (x)  dx 

J  a  J  a 

where  A  is  any  number  =f(a  +  0),  and  X  is  some  number,  dependent  on  A,  in 
the  interval  (a,  b),  provided  f(x)  is  non-increasing.  If  f(x}  is  non-diminishing, 

we  have 

rb  rb 

f(x)  (j)  (x)  dx  —  B\    (f)  (x)  dx 

J  a  J  x 

where  B  is  any  number  ~  f(b  —  0),  and  X  is  some  number  in  (a,  b),  dependent 
on  the  value  of  B.  In  particular,  we  may  take  A  =f(a),  B  =/(&),  in  the  two 
cases. 
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424.    If  the  function  f  (x)  be  of  bounded  variation  in  the  linear  interval 

r/3 
(a,  b),  and  M  be  the  upper  boundary  of    I    <£  (x)  dx  ,  for  all  pairs  of  values  of 

J  a 

a,  @  in  the  interval  (a,  b),  we  may  make  use  of  the  fact  that 


where  P(x),  N(x)  are  monotone  non-diminishing  functions. 
We  have 

b{P(x)  -P(a)}<f>(x)dx  =  {P(b)-P(a)}  l\(x)dx 

a  ^  f 

with  a  similar  equation  for  Q  (x).    By  subtraction,  we  have 

,-P(a)) 


/: 


Hence  we  have*  the  following  result: 

"b  Cb 

f(x)  (f>  (x)  dx  —  f(( 


dx 


<M.VJ>f(x), 


Vab  f(x}  denoting  the  total  variation  in  (a,  b);    where  f  (x)  is  of  bounded 


0  (x)  dx 


,  for  all  in- 


variation  in  (a,  b),  and  M  is  the  upper  boundary  of 
tervals  (a,  /3)  contained  in  (a,  b). 

This  result  is  of  considerable  use  in  estimating  the  value  of  the  integral  of 
the  product  of  a  function  of  a  bounded  variation  and  a  summable  function. 

425.  Let  (j>  (x)  denote  a  function  of  any  number  of  variables,  that  is  sum- 
mable in  a  bounded  measurable  set  E;  and  let  f(x)  denote  a  function  that 
is  bounded  and  summable  in  E,  and  is  everywhere  =  0.  If  U  and  L  denote 
the  upper,  and  the  lower,  boundary  of /(a1),  in  E,  let  a0,  a1}  a2,  •••  an  be  a  set  of 
decreasing  numbers,  where  a0  =  A,  a  number  =  U,  and  L  =  an  =  0.  Let  e.r_^ 
denote  that  set  of  points  for  which  ar^  =/(«)  >  ar,  for  r  =  1,  2,  3, ...  n. 

Let/(m)  («)  =  «,._!,  in  the  set  er_1?  for  each  value  of  r;  we  have  then 

r  r  r  r 

I     f(m)  (x) (f> (x) dx  =  a0\     (f>  (x) dx  +  a}  I     (f>(x)dx+  ...  +  an^          0  (x)  dx 

J(E)  •/(«„>  •>(«,)  hen-,) 


)  dx , 


and  therefore        f(m](x}d>(x)dx=AMm,  where  Mm  is  some  number  between  the 

(E) 

*  See  Lebesgue,  Annales  de  Toulouse,  (3),  vol.  i,  p.  37. 


(E) 
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greatest  and  the  least  of  the  numbers  /      <£  (x)  dx;  EB  denoting 

J{E») 

&0  ~T  &1  ~T  •  •  •     I    &8—1  ) 

for  5  =  1,  2,  ...  n.    The  set  Es  is  that  set  for  which  U±f(m]  (x)  >  as. 

The  numbers  a0,  a1(  a2,  ...  an  can  be  so  chosen  that  the  greatest  of  the 
differences  of  the  successive  numbers  is  <  ew,  where  |em]  is  a  sequence  of 
decreasing  numbers  converging  to  zero.  We  have  now 

if     f^(x)<t>(x}dx-l      f(x)<j>(x)dx 
J(E)  J  (E) 

which  converges  to  zero,  as  m  ~  oo  .    The  numbers  AMm  consequently  con- 

f  r 

verge  to        f(x)<j>  (x)  dx,  as  m  ~  oo  ;  and  thus  I     f(x)<j>  (x)  dx  =  AM,  where 

J  (E)  J  (E) 

M  is  between  the  greatest  and  least  of  the  numbers  I     (f>  (x)  dx,  and  e  belongs 

J  (e) 
to  a  family  of  sets  such  that  in  any  one  of  them  f  (x)  exceeds  a  certain  number 

between  A  and  zero.    By  the  theorem  of  §  417,  there  exists  a  set  E±  of  the 

family,  for  which        6  (x)  dx  =  M. 
•  CE.) 

The  following  theorem*  has  thus  been  established: 

If  E  be  a  set  of  points  which  is  measurable  and  bounded,  and  in  any  number 
of  dimensions,  <j>(x),f(x)  are  summable  in  E,f(x}  being  bounded  therein,  and 

^  0,  then 

r  r 

/(#)  6  (x)  dx  =  A  \       (f>  (x)  dx, 

J  (E)  J  (Ei) 

where  A  is  any  assigned  number  =  the  upper  boundary  off  (x)  in  E,  and  E1  is 
a  part  of  E  dependent  on  the  value  of  A,  and  belonging  to  a  family  of  sets  of 
increasing  measure,  such  that  for  any  one  of  them  f(x)  is  greater  than  some 
fixed  number. 

This  theorem  may  be  regarded  as  the  complete  generalization  of  Bonnet's 
theorem,  for  any  number  of  dimensions,  and  for  integration  through  any 
measurable  and  bounded  set  of  points. 

If/  (x}  be  no  longer  restricted  to  be  ^  0,  we  may  apply  the  above  theorem 
to  the  function  f(x)  —  B,  where  B  is  any  number  ^  L,  the  lower  boundary  of 
f(x)  in  E. 

The  theorem  then  takes  the  form: 

r  r  r 

/  (x)  <f>(x)dx  =  A'  I       d>  (x)  dx  +  B'  I  rf>  (x)  dx, 

•J(E)  J(Ej  HE-EI) 

where  A'  and  B'  are  such  that  A'  ^  U,  B'  ^  L  ;  U,  L  being  the  upper,  and  the 
lower,  boundary  of  f(x)  in  E,  and  El  is  a  part  of  E  such  that  f(x)  is  therein 
greater  than  some  number  dependent  on  A'  and  B'. 

*  See  Lebesgue,  Annales  de  I'ecole  normale,  (3),  vol.  xxvn,  p.  443. 
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If  we  take  the  set  E  to  consist  of  the  points  of  a  linear  interval  (a,  6);  and 
/  (x)  to  be  monotone,  non-increasing,  and  ^  0,  the  set  El  will  be  the  set  of 

rfi 
points  of  some  interval  (a,  £),  and  thus  M  =  I   </>(#)  cfo,  for  some  value  of  £  in 

./  a 

(a,  6).  Similarly,  if  f(x)  be  ^  0,  and  monotone,  non-decreasing,  we  must  take 
for  E1  some  interval  (£',  6).  Thus  the  first  theorem  given  above  reduces  to 
Bonnet's  theorem. 

426.  In  case  the  [set  E  is  the  plane  rectangle  (a(1),  a(2);  b(l\  6(2)),  and  the 
function f(x(l},  #(2))  is  monotone,  and  non-increasing  (see  §  253),  and  ^  0,  the  set 
E1  may  be  replaced  by  the  set  of  points  in  a  rectangle  (a(1),  a(2);  f (l),  £(2)),  where 
(£(1),  £(2))  is  some  point  in  E ;  we  thus  have 


•I,  a  (2>) 

.(id),    £(2)) 


/•( 

J  ( 


(a(D,   a®) 

where  A  ^/(a(l)  +  0,  a(2)  +  0). 

In  case  the   monotone    function   is   non-diminishing,  and   =  0,  we   find 
similarly  that 


where  5  ^/(6<J)  -  0,  6(2)  -  0). 

These  theorems  are  the  extension  of  Bonnet's  theorem  to  the  case  of 
functions  of  two  variables. 

In  case  f(x(l\  #(2))  is  a  quasi-monotone  function  =  0,  in  the  rectangle 
(a(1),  a('2)',  i(l),  i(2))  the  form  of  the  theorem  depends  upon  the  type  of  the 
function  (see  §  255). 

If  f(x(l\  ac{'2))  is  monotone  non-increasing  with  respect  both  to  x(l)  and 
to  «(2),  the  theorem  takes  the  form  (1). 

If  the  function  be  monotone  non-diminishing  with  respect  both  to  x(l]  and 
to  x('2\  the  theorem  takes  the  form  (2). 

If  the  function  be  monotone  non-increasing  with  respect  to  x(l],  and 
monotone  non-decreasing  with  respect  to  x('2\  the  theorem*  takes  the  form 

r(bw,  ft*2)) 

f(x^\  x(*}  6  (x^\  x®}  d  (x(1),  x(»} 
J  (aW,  a  I2') 


*  See  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  273,  where  the  case  of  quasi- 
monotone  functions  of  any  number  of  variables  is  treated. 
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where  (7^/(a(1)  +  0,  6(2)  —  0).  When  the  function  is  monotone  non-decreasing 
with  respect  to  XM  ,  and  monotone  non-increasing  with  respect  to  x(z\  the 
theorem  takes  the  form 


(aW.aW)* 

""~~  -W  I  O  \OC     )  00     )  CL  \(K     *  3C     /  . . .  (  T  ) . 

J  (£<D,  a<2>) 

where  D  ^/(6(1)  -  0,  a(2)  +  0). 

It  is  easy  to  obtain  the  formulae  which  correspond  to  the  general  form  of 
the  second  mean  value  theorem,  wheny(#(1),  a?(2))  is  no  longer  restricted  to  be 
non-negative. 

REPEATED  LEBESGUE  INTEGRALS. 

427.  The  ./[/-integral  of  a  function  of  any  number  of  variables  has  been 
defined  as  the  limit  of  a  sequence  of  simple  sums;  but  it  is  convenient  to 
represent  such  an  integral  as  a  set  of  repeated  limits  of  sums  with  respect  to 
the  separate  variables,  and  thus  to  express  the  integral  as  a  repeated  integral 
with  respect  to  the  separate  variables,  taken  in  any  order.  As  an  Z-integral 
with  a  non-negative  integrand  is  the  measure  of  a  set  of  points  in  space  of 
dimensions  one  greater  than  the  dimensions  of  the  space  in  which  the  function 
is  defined,  we  first  shew  that  the  measure  of  a  set  of  points,  measurable  in 
p  + 1  dimensions,  can  be  exhibited  as  the  single  //-integral  of  the  measure 
of  the  ^-dimensional  section  of  the  set  by  planes  perpendicular  to  one  of 
the  coordinate  axes,  or  more  generally  as  the  Z-integral  of  the  g-dimensional 
section  by  a  (p  —  g)-dimensional  section  of  the  set. 

Let  E  denote  a  measurable  and  bounded  set  of  points,  in  the  p-dimen- 
sional  space  between  the  two  planes  x(l}  =  a(l],  x(l]  =  b(l},  perpendicular  to  the 
x(l  -axis.  Denote  by  E(x(l))  the  ja-dimensional  set  which  is  the  section  of  E  by 
the  plane  corresponding  to  any  fixed  value  of  x(l)  in  the  interval  (a(l),  6(l)). 

The  set  E  is  contained  in  a  set  of  closed  cells  A,  of  measure  <  m  (E)  +  e, 
where  e  is  an  arbitrarily  small  positive  number  (see  §  127).  Giving  to  e  the 
values  in  a  sequence  of  decreasing  numbers  that  converges  to  zero,  we  have 
a  sequence  {At|,  of  sets  of  closed  cells;  and  the  sequence  may  be  so  chosen 
that  each  set  contains  the  next.  The  inner  limiting  set  of  {At|  is  a  set  E1 
which  contains  E,  and  is  such  that  m  (E^  =  m  (E). 

Let  Atl,  A12,  ...  Ain  be  the  cells  of  the  set  At;  we  have  then 

oo      rb 

mi  /\    \  ~~*     ^       f     •>»  J  /\        I  /r1  (1)  \  [  fi  wW 
\L*(.)  —  I  1*™»*1  \"^      /J  "'"'      J 

n  =  1  J  a 

where  Atn(#(1))  is  the  section  of  the  cell  Ain  by  a  plane  perpendicular  to  the 
#(1)-axis,  and  the  measure  m  {A,w  (#(1))|  is  the  jo-dimensional  measure  of 
Aul(#(1)).  Since  an  Z-integral  is  completely  additive,  we  have 
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In  virtue  of  the  theorem  of  §  398,  we  now  have 
m  (E)  =  m  (E,)  =  lira  m  (At)  =  f  m  {E, 

t~oo  .'  a 

since  lim  m  {At  (#(1>)}  =  m  {E1 


Again,  the  set  El  —  E,  which  is  of  measure  zero,  is  contained  in  each  set 
of  a  sequence  of  sets  of  cells  {A/},  where  A/  can  be  so  determined  as  to  be 
contained  in  the  set  At.  This  sequence  of  cells  has  an  inner  limiting  set  Ez 
which  contains  E^  —  E. 

We  have  now,  as  before 

m(At')=[  m{^(x^}}dx(l\ 

J  a 

and  therefore,  taking  the  limit  of  both  sides  of  the  equation,  we  have 
[  m  [E2  (a"')}  dacM  -  lim  m  (A/)  =  0. 

J  a  t~ao 

It  follows  that,  for  almost  all  values  of  x(l\  m  {Ez(xM}}  =  0;  and  since 
Ez(x(l)}  contains  the  section  of  El  —  E,  in  {E(x(l])\  exists,  for  almost  all  values 
of  x(i),  and  is  equal  to  m  {El(x(l})},  which  exists  everywhere,  as  El(x(l')}  is  the 
limit  of  a  sequence  of  measurable  sets. 

We  have  now 

m  (E)  =  !bm  {E,  (^•<1))}  dx®  =  f  m  {E(x(l)}}  dxw. 

J  a  J  a 

The  following  theorem  has  now  been  established  : 

If  E  be  a  bounded  measurable  set  of  points  in  (p  +  \}-dimensional  space,  its 
section  E{x}  is  measurable,  as  a  p-dimensional  set,  for  almost  all  values  of  x. 
The  function  m{E(x}},  of  x,  defined  for  almost  all  values  of  x,  is  linearly 
measurable,  and  its  L-integral  with  respect  to  x  is  equal  to  in  (E).  The  coordi- 
nate x  may  be  any  one  of  the  p  +  1  coordinates  which  determine  the  points  of  E. 

Instead  of  taking  x  to  be  one  of  the  coordinates  x(l],  x(2\  ...  x(v+l\  it  may 
be  taken  to  be  typical  of  a  group  (x(l\  x('2\  ...  x(q]),  of  these  coordinates,  where 
q  has  any  value  <  p  +  1.  The  section  E(x)  will  then  denote  that  (p  +  l—  q)- 
dimensional  set  which  is  the  section  of  the  set  E  obtained  by  giving 

x(l\  x(*\  ...#<«> 

fixed  values.   This  will  then  entail  no  essential  modification  in  the  above  proof 
of  the  theorem;  and  accordingly  we  obtain  the  following  more  general  result: 

If  E  be  a  bounded  measurable  set  of  points  in  (p  +  l)-dimensional  space,  its 
section  E  (x(l},  x('2\  .  .  .  x(<*]},  where  q  <  p  +  1,  is  measurable  as  a  set  in  (p  +  l  —  q)- 
dimensional  space,  for  almost  all  points  (x(l},  x{2\  .  .  .  x(i}}.  The  function 


is  measurable  as  a  q-dimensional  set,  and  its  L-integral  with  respect  to  (x(1\  x^\  .  .  . 
#(2>)  is  equal  to  m  (E).  Instead  of  x(l\  x(<i],  .  .  ,  x(<*\  we  may  take  any  q  of  the  p+l 
coordinates  x(l),  x{2},  ...  #(p+1). 
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428.  Let  f(x(1\  x(z\  ...  x^)  be  any  bounded  function,  defined  in  the  cell 
(a(l),  a(2),  ...  a{P};  b(1\  6(2),  ...  6(^),  and  measurable  in  that  cell.  We  first  assume 
that  the  function  is  at  all  points  ^  0.  The  integral  of  the  function  may  be 
regarded  as  the  measure  of  a  (p  +  l)-dimensional  set,  defined  in  a  (p  +  1)- 
dimensional  cell  (a(l),a(2),  ...  a(^+1);  b(1\  b(2\...b{P+1)),  where  a(P+1)  is  any  number 
less  than  the  lower  boundary  of  the  function  in  the  cell,  and  b(p+1}  is  any 
number  greater  than  its  upper  boundary. 

We  may  suppose  the  function  to  be  defined  to  have  the  value  zero  at  any 
point  of  the  (p+  l)-dimensional  cell,  for  which  it  was  not  originally  defined. 

We  now  have,  from  the  foregoing  theorem, 


f 
J 


(ad),  0(2), 

/•(fed),  6<2>,      6<«>)  ( 
J  (ad),  a  <2>,...  a  (8>)  (J  (a<«+1>,  a  (9+2),  ..  .«(»)) 


for  every  value  of  q  (<  p). 
If  we  take  q  =  1,  we  have 


r 

J 


(a)"  Jam  (J(a(2),  a(3),...a(P)r 

/•(&) 
By  repeated  use  of  this  result,  we  express  I     f(x)  dx  as  a  repeated  integral, 

J  (a] 

taken  successively  with  respect  to  each  coordinate;  and  it  is  clear  that  the 
order  in  which  the  integrations  are  taken  is  immaterial. 


In  particular,  we  see  that 


f    r&(2) 

J 

[J  a&) 


ad) 

If  the  function  f(x(l\  ...  x^)  is  no  longer  restricted  to  be  non-negative,  it 
can  be  expressed  as  the  difference  of  two  measurable  non-negative  functions, 
and  the  general  theorem  may  be  applied  to  each  of  these  functions.  We  have 
then  the  following  result. 

If  f(x^\  x^}  be  defined  in  the  cell  (a(l),  a(2);  b(l),  6(2)),  and  be  bounded  and 
measurable,  the  two  repeated  integrals 


dx®,  f  (xU,  a® 

JaW.  aWJ 

r(bW,  6<2)) 

both  exist,  and  are  equal  to  f  (x(l\  x(z]}  d  (x(l),  x®\ 

J(a(D,  a«r. 
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,-&(!)    ,~b  <2) 

The  form  /(#(1),  #(2))  dacw.  dx(2)  is  employed  to  denote  the  repeated 

J  a'1*  '  a<2> 

rfcll)    f    /-ft  (2)  J 

integral          j  I       f(x(l\  #(2))  <&e(2)  f  c&e(l),  in  which  the  integration  is  taken, 

Ja<i>  (]aMJ  ) 

first  with  respect  to  # (2),  and  then  with  respect  to  x(l). 

It  has  been  shewn  that  I  f  \x^ ,  #(2))  dx®  has  a  definite  value  for  almost 
all  values  of  x^\ 

More  generally  we  have  the  theorem  that: 

The  L-integral  of  a  function  ofp  variables,  defined  in  a  cell  (a,b),  and  bounded 
and  measurable  in  that  cell,  is  equivalent  to  the  repeated  integral  obtained  by 
integrating  the  function  successively  with  regard  to  the  p-coordinates,  taken  in 
any  order. 

429.  In  order  to  extend  the  results  of  §  428  to  the  case  of  an  unbounded 
function,  it  will  be  sufficient  to  consider  the  case  in  which  the  function  is  non- 
negative,  since  any  summable  function  can  be  treated  as  the  difference  of  two 
such  functions. 

In  the  two-dimensional  case,  let  f(x^\  #(2))  be  a  non-negative  function, 
summable  over  the  measurable  set  E.  Let  [kn]  be  a  monotone  sequence  of 
positive  numbers  without  upper  limit,  and  \etfn  (#(l),  #(2))  =f(x(l),  «(2)),  when 
f(x(l\  a?(2))  ^  kn,  and  let/n(a?W,  x®)  =  kn,  when /(a;  W,  a?W)  >  kn. 

We  have  then,  from  §  428, 

r  f         r 

f  (r^>    T^\  d  (T^    T^\—  I  da1®  I  f  (r^  irW\fJjr® 

f  11    V  w         %    tv  /     Ll/    \  w          .    \AJ  I   I      \AjtAJ  I     I  ¥L  \W         «t//  /  IX/w          • 

./  **  \  '  /  \  *  /  I  I/  "  \  J  / 

JCE)  J  J 

If  [     /(«(1);  «(a))  rf  («(1),  «(2))  exists,  it  is  defined  as 
J(E) 


which  is  equal  to 

lim 


Denoting  I  /w  (ic(l),  ^c(2))  c?a;(2)  by  sn(xM),  we  see  that  (sM(^(l))}  is  a  monotone 
non-decreasing  sequence,  for  each  value  of#(l);  and  on  the  assumption  that 
I  /(#(l),  #(2))d(#(l),  «(2))  exists,  we  conclude  that  lim  I  sn  (%(l))  dx(l}  exists, 

J(£)  n~x>J 

and  has  the  same  value  as  the  integral. 

Applying  the  theorem  of  §  399,  we  see  that  {sn  («(1))}  converges  for  almost 
all  points  #(l),  and  also  that 

fs  O(l))  dx®  =  (     f(xU,  x®)  d  (a;W,  x®)] 
J  J(E) 

where  s  («W)  =  lim  sn  («(l))  =  lim  f/n  («W, 

n~oo  n~aoJ 
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Applying  again  the  theorem  of  §  399,  we  see  that,  for  each  value  of  #w  for 
which  lim  f/n(«(1),  x®)dx®  is  finite, 

n~oo  J 

f/(««,  #(2))  dx®  =  lim  [/„  (#<",  «  W)  <fo(2). 

J  «~00  J 

Therefore  f'd&pW  f/O(1),  #(2))  rfa?w 

exists,  and  is  equal  to 

f  '/(*»,«»)  <*(*»,•»). 

•/(£) 

The  complete  result  may  be  stated  as  follows: 

Iff(xb\  x®)  be  any  function,  bounded  or  unbounded,  that  is  summable  in 
the  measurable  and  bounded  set  E,  then  the  repeated  integrals 

I  dx®  \f(x®,  a®)  dx®,     I  dx®  J/0(1),  x®)  dx® 

both  exist,  and  have  the  same  value  as  the  L-integral  function  over  E. 

It  is  clear  that  this  statement  includes  the  case  of  a  function  of  any 
number  of  variables,  as  x®  and  x®  may  each  be  regarded  as  typical  of  a  set 
of  variables.  The  first  proof  of  this  result  was  given  by  Fubini*,  who  de- 
duced the  theorem  from  a  definition  due  to  Lebesgue,  of  the  superior  and 
inferior  integrals  of  a  function,  which  are  not  identical  with  Darboux's  upper 
and  lower  integrals,  defined  in  §  331.  The  above  proof  of  the  theoremf  was 
given  by  Hobson,  who  also  gave  the  following  extension: 

If  f(x^\  #(2))  be  summable  in  the  measurable  and  bounded  set  E,  and  if 

dx® 


has  a  definite  value,  then         f(x®,  #('2')  d(x(-l\  #(2))  exists,  and  is  consequently 

J(P) 
equal  to  the  repeated  integrals  off(x('i\  #(2)). 

Let  us  consider  the  function  \f(x^\  #(2))  |,  and  apply  the  last  part  of  the 
theorem  of  §  399.    Thus,  if  Is  (xM)dx®  exists,  it  follows  that  lim  I  sn  (#( 

J  w~oo  J 

exists,  and  is  equal  to 

Also 

Wda?W=     d»W  f     nxb\  x^dx^  = 

(E) 


where  8n(x®)  denotes  \\fn  (x(l\  x($)  \  dx®. 


*  Rend.  del.  Real.  Accad.  dei  Lincei,  vol.  xvi. 
+  Proc.  Land.  Math.  Soc.  (2),  vol.  vin,  p.  22. 

H.  37 
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f         f 

If  we  assume  that  I  dx™  \  \f(x^\  x^)  j  dx^  exists,  we  see  that 

I          /V-r'W    -r(2h    d  fvb)    ffeh 

/    \tjUj   w       )     Uf  \  JU       j   Jj        ) , 

J  (E)  ; 

which  is  defined  as 

r  r 

lira         fn  (a?W,  x(2))  d  0(1),  «(2)),  or  as  lim     sn  («(l))  rfa;^), 

w~oo  J  (£)  n~oo  J 

f  f  /" 

exists,  and  is  equal  to  ls(a;(1))  c?«(1),  or  I  c?a;(l)  |    f(x(l\  x®)  dx®. 

It  then  follows  that/(a;(l),  #(2))  is  summable  in  ^". 

In  the  case  in  which  f(x(l\  #(2))  is  bounded  in  E,  the  theorem  is  an  imme- 
diate corollary  from  the  preceding  theorem. 

It  may  happen  that  the  repeated  integrals  of  /(#(l),  x^)  exist,  but  that 
I  dx^  \  \  f  (x^\  x®)  dx^   does  not  exist.    In   that   case   the  Z-integral   of 

f  (x(l\  x®)  does  not  exist,  and  the  two  repeated  integrals  of  f(x(l\  x(2))  may 
have  unequal  values. 

nl          ,p2    y'2  fl       f-1          y,2    _    yZ 

-r- — ~^adxdy,  I    I    — — ?—-dydx  both  exist,  but  have 
>  (x*  -hy2)2          y  Jo  Jo  (x2  +  y2)2 

^qual  val 
(0,  0;  1,  1). 


yO,  _  ^.2 

unequal  values,  the  function  -r    —  '—  -  not  being   summable   in  the  square 

O2  +   2)2 


A  FUNDAMENTAL  APPROXIMATION   THEOREM. 

430.  The  following  theorem  affords  the  means  of  relating  the  Z-integral 
of  a  summable  function  with  a  sequence  of  integrals  of  continuous  functions, 
in  such  a  way  that  various  properties  of  the  Z-integral  may  be  obtained 
by  considering  the  special  case  in  which  the  integrand  is  a  continuous 
function  : 

If*  f(x)  be  summable  in  a  given  interval,  or  cell,  (a,  b),  a  continuous 
function  (j>  (x)  can  be  constructed  which  satisfies  the  condition  that 


is  less  than  an  arbitrarily  prescribed  positive  number.  Moreover,  in  case 
f(x)  =  0,  in  (a,  6),  the  function  <f>  (x)  can  be  so  determined  that  <f)(x)  ^0,  in 
(a,  b). 

The  proof  of  this  theorem  may  be  exhibited  as  a  process  which  has  several 
stages. 

*  See  Hobson,  Proc.  Lond.Math.  Soc.  (2),  vol.  xn,  p.  156.  . 
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(1)  Consider  the  case  in  which  f(x)  =  1,  in  the  cell,  or  interval,  (a,  /3) 
contained  in  (a,  b),  and  is  elsewhere  zero. 

Let  the  continuous  function  <f>(x)  be  defined  to  be  equal  to  1  in  the  cell 
and  to  be  equal  to  1  —  0  on  the  boundary  of  the  cell 

for  each  value  of  d  such  that  0  ^  6  ^  1 ;  and  outside  the  cell  for  which  0  =  1, 
let  0  (#)  =  0.  We  may  disregard  all  points  that  are  not  in  the  fundamental 
cell  (a,  b). 

The  function  <f>  (x)  has  values  between  0  and  1  in  the  set  of  points  between 
the  two  cells  (a,  /3)  and  (a  —  e,  (3  +  e) ;  and  it  is  continuous  in  (a,  b).  Since 
f(x)  —  <j)(x)  vanishes  in  the  cell  (a,  /3),  we  see  that 

rb 

I     f(x}  —  <f>  (x)  |  dx 

J  a 

is  less  than  Ae,  where  A  is  a  fixed  number  independent  of  e;  and  this 
is  arbitrarily  small. 

(2)  Let/(#)  =  1,  in  each  cell  of  a  finite  set  A1(  A2,  ...  An,  of  non-overlapping 
cells  contained  in  (a,   b),  and  let  f(x)  =  0  at  all  other  points  of  (a,  b).    Let 
<f>r(x)  be  the  continuous  function,  defined  as  in  (1),  such  that,  if  fr(x)  =  l  in 
Ar,  and  everywhere  else  =0, 

[b   I    S    <  /     N      J  1 

f     (  ffi\  —  m      (  (¥}  1      (JiOC  ^ €.  ' 

J  a      '  n 

also  suppose  </>,•(#)  defined  for  r  =  1,  2,  3,  ...  n.    Let 

(£  (x)  =  <pl  (x)  +  (f)2  (x)  +  •  •  •  +  <j>n  (#)  j 

we  have  then 

rb  r-=n  r  b 

\f(x)  —  (j)  (x)  dx  <  2        \fr  (x)  —  <f>r  (x)    dx  <  e. 

J  a  r  =  l J a 

The  function  </>  (x)  is  continuous,  and  satisfies  the  condition  in  the  theorem 
for  the  particular  function  f(x). 

(3)  Let  f(x)  =  1  in  a  measurable  set  of  points  e,  contained  in  (a,  b) ;  and 
let/O)  =  0  in  C(e). 

A  set  of  non-overlapping  cells  Aj,  A2,  ...  A,n,  ...,  of  total  measure  exceed- 
ing m  (e)  by  less  than  |e,  can  be  so  determined  that  they  contain  all  the  points 
of  e  either  within  them  or  on  their  boundaries.  An  integer  n  can  be  so  chosen 

CO 

that    2     m  (Ar)  <  ^e,  and  that 

r  =n 

0  <    2  m  (A,.)  -  m  (e)  <  £e. 

r  =  l 

Let  fn  (x)  be  the  function  that  has  the  value  1  in  each  of  the  cells 

An  Aa,  ...  An, 

37—2 


580  The  Lebesgue  integral  [OH.  VH 

and  elsewhere  the  value  zero.     In  virtue  of  (2)  a  continuous  function  <£  (x) 
can  be  so  determined  that 

'6 

\fn(x)-<f>(x)\dx<^e. 


rb 

We  have  also  \f(x}~fn  (#)   dx  <  ^e ; 

J  a 

rb 

and  therefore  1/0*0  — $  (#)  |  <&c  <  e. 

.  a 

The  continuous  function  <f>  (x)  satisfies  the  condition  of  the  theorem  for  the 
particular  function  /  (x). 

(4)  Let  e1}  e2,  ...  em  be  m  measurable  sets,  no  two  of  which  have  a  point 
in  common,  and  all  of  which  are  in  the  cell  (a,  b).  Let  numbers  cly  c2,  ...  cm 
be  prescribed,  and  let/(#)  be  the  function  that  has  the  value  cr  in  the  set  er, 
for  the  values  1,  2,  3,  ...  m,  of  r,  and  has  the  value  zero  at  points  that  do  not 
belong  to  any  of  the  sets  er. 

Let  <f>r  (x)  be  the  continuous  function  determined  as  in  (3),  such  that,  if 
fr  (#)=!,  in  er,  and  is  elsewhere  zero, 

\fr  (x)  —  <f)r  (x 


and  let  these  functions  (f)r(x)  be  constructed  for  the  values  r  =  1,  2,  3, ...  m. 
If  <f>  (x)  =  c^  (x)  +  c2<$>2  (x)  +  . . .  +  cm<f>m  (x),  we  have 

•6 

f(x)-<f>(x)\dx 


r-l 


Therefore  <£  (x)  is  the  continuous  function  required  by  the  theorem,  which 
corresponds  to  the  function  /(#). 

(5)     Let  f(x)  be  a  bounded  and  summable  function.    In  accordance  with 
§  385,  a  function  f  (x)  can  be  determined  which  is  such  that 

\f(x)-J(x)    <v, 

and  that  f(x)  has  only  a  finite  number  of  distinct  values,  each  of  which  it 
takes  in  a  measurable  set  of  points.  Let  <£  (x}  be  the  continuous  function, 
constructed  as  in  (4),  such  that 

rb    _ 

l/0)-</>O)  dx<*- 

J  a 

We  now  have 

f "  f(x)  -<j>(x)\dx*  f  \f(x)  -f(x)\  dx+  f  |7fc)  -  0  (x)  I  dx 

J  a  J  a  J  a 

<  ijm  (a,  b)  +  e ; 
where  m  (a,  b)  is  the  measure  of  the  fundamental  cell  (a,  6). 
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Since  77  and  e  are  arbitrarily  small,  a  continuous  function  (ft  (x)  such  as 
the  theorem  requires  can  be  determined,  corresponding  to  any  bounded 
function  that  is  summable  in  (a,  6). 

(6)  Lastly,  let  f(x)  be  unbounded,  but  summable  in  (a,  b).  A  bounded 
summable  function  f$  (x)  can  be  determined,  as  in  §  386,  such  that 

•b 


is  arbitrarily  small,  say  <  ie;  then  if  <f)(x)  be  a  continuous  function,  obtained 

as  in  (5),  such  that 

rb 

\/N  (#)  ~~  $  (#)  |  dx  <  f  e, 


rb 
we  have  /    \f(x)  —  $  (x)\  dx<  e. 

J  a 

This  completes  the  proof  of  the  theorem,  for  the  general  case  of  any 
summable  function  f(x).  It  is  clear,  by  examining  the  successive  stages  of  the 
process  of  construction  of  (f>  (x},  that  if  f(x)  ^  0,  in  (a  b),  then  (f>  (x)  ^  0, 
in  (a,  b). 

We  observe  that 


f 

J  a 


dx 


If  e  have  successively  the  values  in  a  diminishing  sequence  {en}  that 
converges  to  zero,  and  if  <f>r  (x)  be  the  continuous  function  that  corresponds 
to  the  value  er,  of  e,  we  see  that 

rb  rb 

lim        (f)r  (x)  dx  =      f(x)  dx, 

r~<x>   J  a  .  a 

rb 
and  also  lim  I    \f(x)  —  <f>r  (x)  \  dx  =  0. 

r~ao  J  a 

431.  Various  theorems  of  importance  in  the  theories  of  integration  and 
of  series  can  be  deduced  from  the  theorem  of  §  430. 

Let  f(x)  be  summable  in  the  open  linear  interval  (a,  /3),  and  let  (a,  b)  be 
a  closed  interval  contained  in  («,  ft).  Consider 


where  t  is  <  /3  —  b,  so  that  the  integrand  is  defined  in  (a,  b).   Let  <£  (#)  be  the 
continuous  function,  determined  as  in  the  theorem  of  §  430,  so  that 

dx  <  £e. 

The  integral  of  \f(x  +  t)—f(x)\  over  (a,  6)  does  not  exceed  the  sum  of  the 
integrals  of 

—  <£(#)!,  and  |  (f>  (x)  —f(x)  \ 
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over  the  same  interval.  The  first  and  the  third  of  these  integrals  are  each 
<  |e,  and  the  second  integral  is  also  <  Je,  provided  t  is  less  than  a  fixed 
number  £€,  dependent  on  e.  Therefore 

[b 


if  t  <  Se  ;  and  since  e  is  arbitrary,  it  follows  that  the  limit  of  the  value  of  the 
integral,  as  t  ~  0,  is  zero. 

We  have  thus  obtained  the  following  theorem,  which  has  been  established 
in  a  different  manner  *  by  Lebesgue  : 

Iff(x)  be  summable  in  an  open  linear  interval  (a,  ft),  and  if  (a,  b)  be  a  closed 
interval  interior  to  (a,  ft),  then 

lim 


lim  fi 

t~Q  J  a 


The  corresponding  theorem  holds  for  a  summable  function  defined  in  an 
open  cell  (a,  ft)  of  any  number  of  dimensions ;  thus,  in  the  case  of  a  function 
f(xw,  x(z])  of  two  dimensions,  we  have 

lim        I      '          /•(«»>  +  <w,  x®  -f  t&)  -  f(a>®,  x®}  \  d  (xw ,  x®)  =  0. 

If     (1  \         ('2\  \ 

No  essential  change  in  the  proof  of  the  theorem  for  the  linear  case  is 
required  to  make  this  extension. 

432.  The  following  theorem,  also  due  to  Lebesgue f,  may  be  deduced  from 
the  theorem  of  §  430  : 

If  f(x]  be  summable  in  a  linear  interval  (a,  b),  then,  for  almost  all  points 
oc0,  in  (a,  b),  \f(x)  —  a.  is,  for  x  =  x6,  the  differential  coefficient  of  its  indefinite 
integral,  whatever  value  the  constant  a  may  have.  In  particular 

\f(x)-f(x,}\dx 

has  a  differential  coefficient  at  X  =  XQ,  equal  to  zero,  for  almost  all  values  of  #0, 
in  (a,  b). 

CXa+h 

The  integral  I         \f(x)  —  a\dx 

is  between  the  two  numbers 

|  (j>  (x)  —  a  dx  ±  \f(x}  —  4>  (#)  |  dx, 

where  0  (x)  is  a  continuous  function  which  we  may  take  to  be  such  that 

b 


*  See  the  Legons  sur  les  series  trigonometriques,  p.  15. 
t  Loc,  cit.,  p.  13. 
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It  follows  from  this  inequality  that  the  set  of  points  at  which 


583 


has  its  measure  less  than  e  ;  and  therefore,  if  x0  be  any  point  of  a  certain  set 
Ht,  where  m  (Ht)  >  b  —  a  -  e,  the  condition  |/(#0)  —  <J>  (#<,)  !  <  e  is  satisfied. 

Since    (f>  (x)  —  a  |  is  a  continuous  function, 

1  rx0+h 
Km  j-  I          0  (x)  —  a  da; 

A~O  h  J  x0 

exists,  and  is  equal  to    </>  (XQ)  —  a  ,  for  all  points  #„  interior  to  (a,  6). 

For  almost  all  values  of  x0, 

1 


exists,  and  is  equal  to    f(x0}  —  <f>  (x0)  ,  since  |/(#)  —  $  (x)  j   is  a  summable 
function  (see  §  405). 

We  now  see  that 

_  1   rzo+fc 

lira  j-  I         |  f(x)  —  adx 

7lM)  ^  -  ^o 

is  between  the  numbers 


for  almost  all  values  of  x0,  whatever  value  a  may  have.  In  a  set  £"e,  such  that 
m  (-ff'e)  =  m  (.£Q  >  b  —  a  —  e. 

it  follows  that  Km 


is  between  the  numbers  /(#o)  —  a  |  +  2e. 

Let  e  have  the  values  in  a  sequence  which  converges  to  zero,  then  m 
converges  to  b  —  a;  we  see  then  that,  for  almost  all  values  of  x0, 

f(x)  —  a\dx  • 


i  [x° 

lim  T 


exists,  and  is  equal  to  \f(®o)  ~  a 

whatever  value  a  may  have.    The  theorem  has  now  been  established. 

433.  Let  /j  (x)  be  a  function  which  is  summable,  and  is  S  0,  in  a  cell,  or 
interval,  (a,  6);  let  us  further  assume  that  [fi(x)}2  is  summable  in  (a,  b). 
A  continuous  function  ^(x),  at  every  point  ^  0,  can  be  so  determined  that 

' 


is  less  than  e/4.    We  have,  for  every  value  of  x, 

L/iW-fiWiK/iW- 

and  it  follows  that 


f  {fl(x)-<f>l(x) 

J  a- 
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Next,  let/(#)  be  no  longer  restricted  to  be  ^  0,  in  (a,  b),  but  let 
be  summable  in  that  cell,  or  interval  ;  f(x)  can  then  be  expressed  as  the 
difference  f^  (#)  —  fz  (x)  of  two  functions,  each  of  which  is  ^  0,  in  (a,  b),  and 
such  that  the  square  of  each  of  them  is  summable  in  (a,  6). 

Let  <j>.2(x)  be  a  continuous  function,  corresponding  to/2(#),  such  that 


and  let  <£  (#)  =  fa  (x)  —  <f>2  (x). 

We  have  now 

f  {f(x}  -  <£  (x)}*  dx  ^  2  I"  {,/•  (a?)  -  </>!  (*)}'  efe 

J  a  J  a 

+  2  f  {/,(*)  -&(*)}»(&, 

.'a 

and  the  expression  on  the  right-hand  side  is  <  e. 

The  following  theorem  has  thus  been  established  : 

If  [f(x)}2  be  summable  (whether  it  be  bounded  or  unbounded)  in  the  interval, 
or  cell,  (a,  b),  a  continuous  function  <j>(x)  can  be  so  determined  that 


is  less  than  a  prescribed  positive  number  e. 

From  this  theorem  a  result  can  be  deduced  which  has  been  otherwise 
obtained,  for  the  case  of  a  linear  interval,  by  A.  C.  Dixon*. 

If  {f(x}}2  be  summable,  in  an  open  cell,  or  interval,  (a,  /3),  and  (a,  b)  be  a  cell, 
or  an  interval,  contained  in  (a,  /3),  then 

lim  I  '{f(t  +  x)-f(t)}2dt  =  Q. 

In  the  case  of  »a  cell  of  two  dimensions,  x  represents  a  pair  (x(l),  x{2})  of 
numbers,  and  the  limit  is  taken  as  xw,  x{'2}  converge  independently  of  one  another 
to  zero. 

We  assume  x  to  be  such  that  t  4-  x  is  in  (a,  /S),  for  all  values  of  t  in  (a,  b). 
If  $  (t)  be  a  continuous  function  such  that 


we  have     I*  \f(t  +  x)  -f(t)}2  dt  <  3  f    [f(t  +  <c)-<f>(t  +x)}2  dt 

J  a  •>  a 


Proc.  Caml>.  Phil.  Soc.,  vol.  xv,  p.  210. 
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The  first  and  third  integrals  on  the  right-hand  side  are  each  <  e.  The 
function  <f>  (t)  being  uniformly  continuous  in  a  closed  interval  (a',  6')  which 
contains  (a,  6),  and  is  in  (a,  /3),  we  see  that  if  |  a;  \ ,  or  in  the  two-dimensional  case, 
and  |  x®  ,  be  sufficiently  small,  the  second  integral  is  <  e.  It  follows  that 

b{f(x+t)-f(t)Ydt<9e, 


if  |  #  |  be  sufficiently  small.     Since  e  is  arbitrary,  the  theorem  has  been 
established. 

From  this  result,  the  following  theorem*  may  be  deduced : 

If  {f(x)Y  be  summable  in  an  open  interval,  or  cell,  (a,  /3),  and  (a,  b)  be 
contained  in  (a,  ft), 


is  a  continuous  function  of  x,  for  all  points  x  in  (a,  b). 

In  the  two-dimensional  case,f(t  +  x)  denotes  f(t(l)  +  xw,  t{2} 
Let  F  (x)  denote 


then,  employing  Schwarz's  inequality,  we  have 

{F(x1)-F(x2}}^  \b  [f(t)}'dt  ["{/(t+xj 

J  a  J  a 

the  second  integral  on  the  right-hand  side  is  less  than 

f' {/(*+*') -/(OP  <^ 

J   ttj 

if  («!,  /3j)  is  an  interval  or  cell  contained  in  (a,  /3),  and  x'  denotes  xl  —  x2. 
This  integral  converges  to  zero,  with  x1  —  x<2;  hence  F  (xj)  —  F  (#2)  \  is 
arbitrarily  small,  for  all  sufficiently  small  values  of  x1  —  x2\,  and  therefore 
F(x)  is  a  continuous  function. 

APPROXIMATE   REPRESENTATION   OF   AN   L-INTEGRAL   AS   A   RIEMANN   SUM. 

434.  Let  f(x)  be  summable  in  the  interval,  or  cell,  (a,  6),  and  let  fy(x) 
denote  the  function  that  has  the  same  values  as  f(x)  at  all  points  at  which 
|  f(x)  \  ^  N,  and  which  has  the  value  N,  or  —  N,  at  all  other  points  of  (a,  6). 
Let  the  measure  of  the  set  E,  of  points  at  which  |  f(x)  \  >  N,  have  the  value  e. 

By  the  theorem  of  §  430,  a  continuous  function  <f>  (x)  can  be  so  determined  that 

fb 

\    I  /N  (x)  —  $  (#)  dx  <  e2.    In  a  certain  set  of  points  L,  such  that  m  (L)  <  e,  we 

J  a 

may  have     fN (x)  —  $(x)\  ^  e ;    in  the   complementary  set    C (L)   we    have 

*  For  the  case  of  a  linear  interval,  this  theorem  has  been  proved  in  a  different  manner  by  A.  C. 
Dixon,  loc.  cit. 
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I  /N  (#)  —  <£  (#)  I  <  6-     The   function   <£  (a?)   may   be   taken   to   be   such   that 
<£(#)   ^  JV;  for  if  it  have  values  numerically  >  N,  these  may  be  replaced  by 
N  or  —N,  without  affecting  the  conditions  that  $ (x)  is  to  satisfy. 


r=m 


A  net  can  be  fitted  on  to  the  interval,  or  cell,  (a,  b),  such  that  2  <£  (fr)  m 

r=l 

rb 

differs  from  I    <f>  (x)  da;  by  less  than  e;  where  B1}  B3,  ...  Bm  denote  the  m  meshes 
J  a 

of  the  net,  and  the  points  {£r}  are  chosen  in  any  manner  so  that  £r  is  a  point 
of  Sr.  If  there  are  meshes  of  the  net  such  that  all  their  points  belong  either 
to  L  or  to  E,  the  total  measure  of  such  meshes  cannot  exceed  2e. 

In  all  the  other  meshes  there  are  points  that  belong  neither  to  L  nor  to  E. 
In  these  latter,  the  points  %r  can  be  so  chosen  that  they  do  not  belong  to  L, 
or  to  E.  In  a  mesh  Br,  all  the  points  of  which  belong  to  L  or  to  E,  we  can 
choose  %r  so  that  /(£»•)  differs  from  the  lower  boundary  of  \f(x)  \  in  8,-,  by 
less  than  e.  Let  this  be  done  for  all  such  meshes  ;  then,  denoting  the  set  of 
all  such  meshes  by  A,  where  m  (A)  ^  2e,  we  see  that 

\f(x)\dx. 

(A) 

The  sum  on  the  left-hand  side  is  here  taken  for  all  the  meshes  of  the  set  A. 
We  have  also  2  $  (|v)  ni  (&,.)  <  2  t/e  ;  where  U  is  the  upper  boundary  of 
|  <f>  (x)  in  (a,  b). 

In  the  other  meshes  of  the  net  we  have 


where  £  is  the  measure  of  the  interval,  or  cell,  (a,  b). 

b  rb 


f  f  f 

Also       f(x)  dx  differs  from    /    <£  (x)  dx  by  less  than  e2  +  I       f(x)  \  dx ; 

J  a  J  a  J(E) 

fb  r=m  r 

hence  I    f(x)  dx  differs  from  2   (£(£,-)  m  (S,.)  by  less  than  e  +  e2  +  /     lf(a;)\dx. 

Ja  r=l  J  (E) 

r=m  r=m 

Now   2    (f>  (|v)  m  (£,.)  differs  from    2   /(&•)  in  (Sr)  by  less  than 

r=l  r=l 

le  +  2e*+2Ue+  I       f(x)   dx. 

J  (A) 

As./V'x/oo  ,wehavee^0,andthen  I     \f(oc)   d^andl      |/(^)   d*  converge 

J  (A)  J  (£•) 

to  zero.    Also  2  t/e  (=  2Ne)  converges  to  zero  ;  (see  §  388). 

/b  r=m 

f(x)  dx  differs  from  the  Riemann  sum    2  f(%r)  m  (Br) 
i  r=l 

by  an  amount  which  can  be  made  arbitrarily  small  by  choosing  N  sufficiently 
large. 

We  have  thus  proved  the  following  theorem,  of  which  Lebesgue  *  has  given 
another  proof: 

*  See  Annales  de  Toulouse,  (3),  vol.  i,  p.  33. 
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Iff(x)  be  any  function  that  is  summable  in  the  interval,  or  cell,  (a,  b),  a 

rb 
net  can  be  fitted  on  to  (a,  b)  such  that   I    f(x)  dx  differs  arbitrarily  little  from 

J  a 
r  =  m 

the  Riemann  sum   2   /(£/•)  m  (8r),  provided  the  points  gr  be  chosen  properly  in 

r=l 

the  meshes  8r  of  the  net. 

The  chief  interest  of  this  theorem  arises  from  the  fact  that,  if  we  have  a 
finite  set  of  functions  /(I)  (x),  /(2)  (x},  .  .  .  f(8)  (x),  all  summable  in  the  interval, 
or  cell,  (a,  b),  it  is  possible  to  determine  a  net,  and  a  definite  set  of  points 
£i>  £2,  •••  %m  in  the  meshes  B1}  82,  ...  8m,  respectively,  so  that,  for  each  of  the  s 
functions,  the  difference  of  the  integral  and  the  corresponding  Riemann  sum 
is  less  than  an  arbitrarily  chosen  number,  the  points  £  being  the  same  for  all 
the  s  functions. 

To  shew  that  this  is  the  case,  only  a  slight  modification  in  the  foregoing 
proof  is  required.  We  have  to  deal  with  s  sets  E1}  E.2,  ...  Es,  corresponding  to 
a  fixed  value  of  N  ;  the  number  e  can  be  taken  to  be  the  greatest  of  the 
numbers  m  (  -&\),  m  (E2),  ...m  (Es).  The  meshes  of  the  net  which  are  such  that 
every  point  of  one  of  them  belongs  to  one  at  least  of  the  2s  sets  m(E1), 
m  (E.2),  ...in  (Es),  m  (L^,  m  (L2),  ...  m  (Lg)  have  a  total  measure  ^  2s  e;  and  these 
meshes  must  be  considered  separately  from  the  others,  as  in  the  case  s  =  1  ; 
they  form  a  set  A,  such  that  m  (A)  ^  2se.  In  one  of  these  meshes  8r,  we  take 


ffr  so  that         |/w  (£,.)  |  differs  from  the  lower  boundary  of         \f(^(x)  I  in 

p=l     '  p=l 

Br,  by  less  than  e;  we  then  have,  for  the  set  of  meshes  A, 

*y  f     \f'W(x)\dx 

p=l    J(A) 

for  each  of  the  values  p=  1,  2,  ...  s,  of  p.    The  preceding  proof  may  then  be 
applied  to  each  of  the  functions  /(1)  (x),  ...f(8)  (x),  without  essential  change. 

We  have  now  obtained  the  following  theorem  : 

Iff(l]  (x},f(-]  (x),  ...  f(s]  (x)  be  functions  that  are  summable  in  the  interval 
or  cell,  (a,  b),  then  a  net  can  be  fitted  on  to  (a,  b),  and  a  set  of  points  £1}  £2,  ...  %m 
can  be  determined  in  the  m  meshes  of  the  net,  so  that,  for  each  of  the  s  functions, 

b  r=m 

f(x)  dx  differs  from  S  f(J~r)  m  (8r)  by  less  than  an  arbitrarily  chosen  number. 


435.    By  means  of  this  theorem,  various  algebraical  inequalities  can  be 
extended  so  as  to  obtain  corresponding  inequalities  which  involve  //-integrals 
The  simplest  case  we  shall  consider  is  that  of  the  well-known  inequality 
(<*!  6,  +  tta  6a  +  . . .  +  am  bmy  £  (a,2  +  a./  +...+  am2)  (b*  +b.i+...+  6m2). 
If  we  write 

«i)}*,  *-/»(fc)  (»&)}*•  .- a.,-/"  (f«)  {!»(«„)}*; 
(S,)i*.  ^  =/w  (&)  {«*(«,)}*»  •••  bm=f®  (?,„)  {m  (SM)}i 
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we  have 


r=l 


J 


2 

r=l 


Employing  the  theorem,  we  have  at  once  the  Schwarzian  inequality  (see 

§  396), 

(  rb  Y       rb  rb 

(J  a  )         J  a  J  a 

since  the  expressions  on  the  two  sides  are  given  to  any  required  degree  of 
approximation  by  the  corresponding  expressions  in  the  preceding  inequality, 
provided  £1,  £2,  ...  £r  ai>e  so  chosen  that  the  theorem  of  §  434  is  applicable  to 
the  three  functions  }/(1)(a?)jf,  {/(2)O)}2>  /(1)  0)/(2)O). 

The  following  more  general  theorems*  may  be  obtained   in   a   similar 
manner : 

ri>  ^  r  rb  ~\i/p    r  /•&  ~ii/g 

7        \  *-^  /  /        \*AS  )  ttw       =    I  i    /        \  vO  /i      CLOC  \   T       ( OC  If     €i  vC  I        * 

f          */  ^         '   •/  \        '  I   ^  \        /  J  \V\fl 

Ja  Ua  J          L'«  J 

where  p  and  q  are  any  positive  numbers  such  that  l/p  +  l/q  =  1 ;  and 

U[/(l)  (#)  +/(2)  («)  I p  dx  [     =\   I     /(1)  (x)  | P  c?«       +  I   /    I  f{2)  (x}  I p  dx 
a  )  \_J  a  J 

where  p  >  1. 

These  are  derived  from  the  known  inequalities 


ar  br 


r=m 
r=l 


r=m 

S 
r=l 


l/p 


r=l 

l/p        P   r=m 


l/p 


r=l  J  L    »•=! 

where  p  and  g-  are  positive  numbers  such  that  l/p  +  l/q  =  1. 

In  the  case  of  the  first  inequality  we  take  ar  =/(l)  (£r)  [m  ($rj\llp>  an^ 

This  method  has  been  applied  f  by  Lebesgue  to  obtain  a  proof  of  the  second 
mean  value  theorem  (§  423). 

436.  Another  method  has  been  indicated  J  by  F.  Riesz  of  expressing  the 
i-integrals  of  functions  approximately  by  finite  sums,  in  such  a  manner  that 
the  above  extensions  of  known  inequalities  to  the  case  of  integrals  can  be 
carried  out. 

It  will  be  sufficient  to  consider  the  case  of  the  summable  functions  /(1)  (as), 
/(2)  (#),  as  the  extension  to  the  case  of  any  finite  number  of  such  functions  will 
then  be  obvious. 

*  See  F.  Riesz,  Math.  Annalen,  vol.  LXIX,  p.  456. 
t  Annales  de  Toulouse,  (3),  vol.  i,  p.  36. 
1  loc.  cit. 
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rb 
It   has   been    seen,   in   §§  385 — 388,   that    the    integrals    I    /(l)  (x)  dx, 

J  a 


b 

/(2)  (x)  dx  differ  respectively  from  two  finite  sums 


«!  m  (gj)  +  a2  m  (e2)+  ...  +apm  (ep), 
at  m  fa')  +  a2m  (e^)  +  .  .  .  +  ap  m  (  ep'), 

by  less  than  an  arbitrarily  chosen  positive  number,  provided  the  increasing 
set  of  numbers  aly  a2,  ...  ap  is  properly  chosen,  and  er,  er'  are  the  sets  of  points 
for  which  ar  £  f(1}  (x)  <  ar+1  ;  ar  ^  f(2}  (x)  <  ar+l  ,  respectively. 

s=p  r=p 

If  we  denote  by  En  the  set  D(er,es),  we  have  er=   2   Ers>  eg=   2,   Ers\ 

*=1  r=l 

all  the  sets  Ers  can  be  arranged  in  order  as  a  single  sequence  {Fq},  and  the 
two  finite  sums  then  take  the  forms 


where  Fq  is  the  set  Ers,  and  Aq  =  ar,  Bq  =  as.    In  the  set  Fq,  we  have 

ar  2i/(1>  (as)  <  Or+l,  as  ^/(2)  (x)  <  as+l. 
It  is  clear  that  AlB1m  (F^  +  A2B2m  (F2)  +  .  .  .  +  Ak  Bk  m  (Fk)  is  an  approxima- 


f6 
tion  to 

.'a 


LEBESGUE   INTEGRALS   OVER   AN   UNBOUNDED   FIELD. 

437.  The  definition,  in  §§  385 — 388,  of  the  .L-integral  of  a  summable 
function  f(x)  over  a  measurable  set  E,  is  applicable  when  E  is  unbounded, 
provided  it  have  a  finite  measure,  in  accordance  with  the  definition,  given  in 
§  134,  of  the  measure  of  such  a  set.  We  proceed  to  consider  the  case  in 
which  E  is  a  set  such  that  the  portion  of  it  in  each  finite  cell,  or  interval, 
is  measurable,  but  in  which  E  has  not  a  finite  measure.  Let  f(x)  be  a 
function,  defined  over  the  set  E,  and  everywhere  —  0 ;  and  let  it  be  assumed 
that  f  (x)  is  summable  in  every  finite  cell,  or  interval,  A. 

If         f(x)  dx  converges  to  a  definite  number,  as  the  distances  of  all  the 

.'(A) 

boundaries  of  A  increase  indefinitely,  in  any  manner,  that  number  is  said  to 


define  the  I/-mtegral         f(x)  dx,  off(x),  over  the  unbounded  set  E. 

J  (E) 

If  the  set  E  be  in  p  dimensions,    I      f(x)  dx  is  the   measure  of  the 

J  (A) 

(p  +  l)-dimensional  set  of  points  (#(l),  x(-\  . . .  x(p\  y),  where  a(r)  <  x(r)  =  b(r\  for 
r=  1,2,  3,  ...jo,  and  0^  y  ^f(x(l\  x(*\  ...  x^);  the  cell  A  being  (aw,a{-2}, ...  a(?]; 
b(l\  6W, . . .  bW).  At  points  x,  not  in  E,  we  take  f(x)  =  0. 
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It  is  clear  that,  when  a  sequence  of  cells  A  is  taken,  each  one  of  which  is 
contained  in  the  next,  and  such  that,  as  we  progress  in  the  sequence,  all  the 
numbers  6(l),  6(2),  . . .  6(p)  become  indefinitely  great,  and  are  positive,  whereas 
the  numbers  a(l),  a(2),  ...  a(v)  become  indefinitely  great,  and  are  negative,  the 

values  of         f(x)  form  a  monotone  sequence.    Assuming  that  this  sequence 

.'(A) 

has  a  finite  upper  limit,  that  limit,         f(x}  dx,  is  the  measure  of  the  (p  +  1)- 

J  (E) 

dimensional  unbounded  set  for  which 

-oo  <tfw<oo,  (r=l,  2,  3,  ...p),  Q£y^f(x); 
it  being  assumed  that  f(x)  =  0  at  all  points  that  are  not  in  E. 

If  f(x)  is  not  ^  0,  at  all  points  of  E,  it  may  be  expressed  as  the  difference 
f+(x)—f~(x)  of  two  functions  /+  (x),  f~ (x},  each  of  which  is  ^0;  where 
y-(#)  =  0,  at  any  point  x  at  which  f  (x)  is  positive. 

In  tase  the  two  integrals  I      /+  (x)  dx,   I      f~  (x}  dx  both  exist,  in  ac- 

J  (E)  J  (E) 


cordance  with  the  above  definition,  the  Z-integral  /  .  f(x)  dx  is  defined  to  be 

c  f 

the  value  of  I      /+  (as)  dx  —   I      /~  (x)  dx ;    and  f(x)   is  said  to   have   an 

J  (E)  J  (E) 

absolutely  convergent  Z-integral  over  the  unbounded  set  E.    The  existence 
of  the  absolutely  convergent  integral  entails  the  existence  of  the  integral 

I        f(x}   dx. 

J  (E) 

It  may  however  happen  that,  although   I      /+  (x)  dx,   I      /~  (x)  dx  become 

./(A)  J  (A) 

indefinitely  great  as   the   cell,  or  interval,  A    becomes   indefinitely   great, 

r 

I       {f+(x)  —f~(x)}  dx  converges  to  a  finite  limit.    In  that  case,  the  limit  is 
J  (A) 
said  to  define  the  non-absolutely  convergent  Z-integral  of  f(x)  over  the  un- 

r 
bounded  set  E.    When         f(x)  dx  exists  as  a  non-absolutely  convergent 

integral,   I       I  f(x)  \  dx  does  not  exist,  because          f  f+  (x)  +  f~  (x]\  dx  does 

J  (E)  J(E) 

not  converge. 

rx, 

438.    In  the  case  of  integration  over  a  linear  integral   I     f(x)  dx  is  the 

J  a 

limit,  as  b  ~  oo  ,  of  I    f(x)  dx ;   it  being  assumed  that  f(x)  is  summable  in 

7  a 

/•oo  /•  oo 

(a,  6),  for  all  values  of  6(>a).    If   I    |  f(x)  \  dx  exists,  the  integral  I     f(x)dx 

J  a  J  a 

is  said  to  be  an  absolutely  convergent  .//-integral  over  the  unbounded  interval 
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foe  Too 

(a,  oo  ).    If        |  f(x)  \  dx  is  not  finite,  I    f(x)  dx  is  said  to  be  a  non-absolutely 

J  a  J  a 

convergent  //-integral  over  the  unbounded  interval  (a,  oc  ). 

fo>  /•« 

If  both   I    f(x)  dx  and     I      f(x)  dx  exist,  their  sum  defines  the  value  of 

Jo  .'-oo 

f°° 

I      f(x)  dx,  which  is  absolutely  or  non-absolutely  convergent  over  (—00,00), 

J    -00 

fm 

according  as    /       |/(#)  |  dx  is  finite  or  not. 

J    —  00 

The  following  theorem  provides  a  criterion  for  the  existence  of  an  .L-integral 
over  an  unbounded  interval  which  is  frequently  of  use : 

If  f(x)  and  f(x)  $  (x)  be  both  summable  in  every  interval  (a,  x),  and  if 

f* 

f(x)  dx  exists,  and  <f>  (x)  be  monotone  and  bounded  in  (X,  oo  ),  for  some 

J  a 

fee 

value  of  X  (^  a),  then   \    f(x)  (f)  (x)  dx  exists. 


A  value  £,  of  x,  can  be  so  determined  that 


f(x)  dx  \<€,  for  all  positive 


values  of  h,  where  e  is  an  assigned  positive  number ;  and  this  value  of  £  can 
be  chosen  to  be  ^  X.    We  have  then,  by  the  second  mean  value  theorem, 

rt+h  fi+eh  rt+h 

f(x)  $  (x)  dx  =  cf>  (£)  f(x)  dx  +  <f>(£  +  h)  f(x)dx, 

J*  J  t  J  t+eh 

where  6  has  some  value  in  the  interval  (0, 1).    If   <f>(x)   <  K,  for  every  value 
of  x  in  the  interval  (X,  oo  ),  we  have 


I       f  \x)  (f>  (x)  dx 

J  s 


<  2/0- ; 

and  since  e  is  arbitrary,  the  integral   /    f(x)  <f>  (x)  dx  exists. 

J  a 

A  similar  criterion  for  the  existence  of  an  absolutely  convergent  integral 
is  the  following : 

If*  f(x)  and  f(x)  <£  (x)  be  both  summable  in  every  interval  (a,  x),  and  if 
r°° 
I    f(.x}  ax  wists  as  an  absolutely  convergent  L-integral,  and  \  <f)  (x)  \  be  bounded 

J  a 

in  (X,  oo  ),  for  some  value  of  X  (=  a),  then  the  integral   I    f(x)  <£  (x)  dx  exists, 

J  a 

as  an  absolutely  convergent  L-integral. 

rco 

For,  since   I    f(x)  dx  is  absolutely  convergent,  a  number  f  (=  X}  can  be  so 

J  a 

rl+A 

determined  that   I       \f(°c}  \  dx  <  e,  for  all  positive  values  of  /i;  where  e  is  any 

assigned  positive  number. 

*  See  Eiemann's  Werke,  1st  ed.,  p.  229;  also  Pririgsheim,  Math.  Annalcn,  vol.  xxxvn,  p.  591. 
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We  have  then 

rt+h  rt+h  rt+h 

I      f(x)<j>(x)dx    £   \       \f(x)(f>(x)   dx<K   I        \f(x)\dx<ice, 

J  s  '   v  J  £ 

where  «  is  the  upper  boundary  of    <£  (x)  |  in  (f ,  oo  ).    Since  e  is  arbitrary,  the 

r°° 

condition  for  the  existence  of  I     |  f(x)  <f>  (x)   doc  is  satisfied. 

J  a 

439.    An  important  class  of  integrals  over  an  unbounded  interval,  of  which 
the  convergence  is  not  necessarily  absolute,  is  that  of  the  integrals 

roo  r  oo 

<j)  (x)  sin  xdx,         <f>  (x)  cos  xdx, 

.'o  -'o 

where  <£  (x)  is  monotone,  from  and  after  some  fixed  value  of  x,  and  converges 
to  zero,  as  x  is  indefinitely  increased. 

r**  /*£  .  r*2 

We  have    I     <f>  (x)  sin  xdx  =  (f>  (x^  I    sin  xdx  +  <f>  (x2}       sin  xdx,  where  x-^ 

J  Xi  J  Xi  .'£ 

is  so  large  that  <f>(x)  is  monotone  for  x^xl,  and  £  is  some  number  in  the 
interval  (^ ,  x2). 

From  this  we  have 


<j)  (x)  sin  xdx 


+  2|0(O   • 


Since    </>  (x)    converges  to  zero,  x1  may  be  so  chosen  that  |  </>  (x^  \  and  |  <f)  (#2)  | 
are  both  arbitrarily  small ;  hence  the  condition  is  satisfied  for  the  existence  of 

y*oo 

I    </>  (x)  sin  xdx,  as  an  i-integral  which  is  not  necessarily  absolutely  convergent. 
Jo 
The  case  of  the  second  integral  may  be  treated  in  the  same  manner. 

CHANGE  OF  THE  INDEPENDENT  VARIABLE  IN  A  LEBESGUE  INTEGRAL. 

rb 

440.    The  transformation  to  a  new  variable,  of  the  integral       f(x)dx,of 

J  a 

a  function  f(x)  that  is  summable  in  the  linear  interval  (a,  6)  will  now  be  con- 
sidered.   The  following  theorem*  will  be  established: 

Iff(x)  be  summable  in  the  linear  interval  (a,  b},and  if  </>(£)  be  a  monotone 
function  of  £  which  is  an  indefinite  integral  a  +  I   %  (£)  dg,  where  the  interval 

J  a 

(a,  /3),  of  £,  is  made  to  correspond  to  the  interval  (a,  b),  of  x,  by  means  of  the 
relation  x  =  <$>(%),  then   I    f(x)dx=\   ^(£)  %  (£)  c££,  where  F  (%)  denotes  the 

J  a  Jo. 

function  f  {$(!;)}. 

We  may  assume,  without  loss  of  generality,  that  (/>  (£)  is  non-diminishing 
*  See  Lebesgue,  Annales  de  Toulouse,  (3),  vol.  i,  p.  44. 
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in  (a,  /9).  It  has  been  shewn,  in  §  407,  that  the  necessary  and  sufficient  con- 
dition that  the  monotone  function  <f>  (£)  should  be  an  indefinite  integral  is  that, 
to  any  set  of  points  of  measure  zero  on  the  ^-segment,  there  should  correspond 
a  set  of  points  of  measure  zero  on  the  ^-segment. 

The  above  theorem  is  equivalent*  to  the  statement  that,  when  the  con- 
dition here  referred  to  is  satisfied,  then  the  equality 


holds  for  every  summable  function  f  (x). 

Let  Ante)  denote  a  set  of  non-overlapping  intervals  in  (a,  6),  and  An(f)  the 
corresponding  set  of  intervals  in  (a,  /3) ;  we  have  then 


m 


If  E(x]  be  the  inner  limiting  set  of  a  sequence  of  sets  of  intervals,  each  of 
which  sets  contains  the  next,  there  corresponds  to  E(x)  a  set  E^\  the  inner 
limiting  set  of  the  corresponding  sequence  of  intervals  in  the  ^-segment.  We 
have  then 

m  (E(*>)  =  lira  m  (  An<*>)  =  lira  f          y  (f  )  d  %  =  f         %  (f  )  d  £ 

n~«=  M~°O  ./   A*)  •/£<*) 


If  m  (E(x))  =  0,  it  follows  that  %  (f)  =  0,  at  all  points  of  E^\  with  the  possible 
exception  of  those  which  belong  to  a  component  of  measure  zero. 

The  set  of  points  on  the  ^-segment  that  corresponds  to  a  set  of  points  of 
measure  zero  on  the  ^-segment  has  not  necessarily  the  measure  zero,  and  is 
conceivably  not  a  measurable  set  ;  but  in  any  case  that  component  of  the  set 
at  which  %  (£)  is  not  zero  is  of  measure  zero. 

A  set  of  points  on  the  ^-segment  which  is  measurable  (B)  corresponds  to 
a  set  on  the  £-  segment  that  is  also  measurable  (B).  Any  measurable  set  E(x], 
on  the  ^-segment,  contains  a  set  Efx\  which  is  measurable  (B),  and  such  that 
EM  -  Ep*  has  measure  zero.  If  E(x\  E^  correspond  to  E^\  E^\  on  the  £- 
segment,  E^}  contains  the  measurable  set  E^\  and  at  all  points  of  the  set 
E^\  with  the  exception  of  a  component  of  measure  zero,  the  condition 
=  0  is  satisfied.  Thus  E^  is  the  sum  of  a  measurable  set,  arid  of  a  set 
of  points  in  which  ^  (£)  =  0. 

If  E(x]  is  the  set  of  points  at  which  /(#)  >  A,  E^  is  the  set  of  points  at 
which  F(%)  >  A.  If  F1  (£)  is  a  function  defined  only  at  those  points  of  the 
interval  (a,  fi)  in  which  ^  (£)  =£0,  and  is  equal  to  F  (%)  at  the  points  for  which 
it  is  defined,  the  set  of  points  at  which  Fl  (£)  >  A  is  measurable.  It  follows 
that  F!  (|)  %  (£  )  is  a  measurable  function,  defined  over  that  set  of  points  at 
which  ^  (£)  =/=  0  ;  and  thus  F(£)  %  (£)  is  measurable  over  (a,  /3). 


*  See  Hahn,  Monatshefte  der  Math.  u.  Physik,  vol.  xxm,  p.  167. 
H.  38 
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Let  us  first  consider  the  case  in  which  f(x)  is  continuous  in  (a,  6);  then 
F  (£)  is  continuous  in  (a,  /3).  Let  a  net  (a,  #1}  #2,  ...  xn-i,  b)  be  fitted  on  to 
(a,  6);  and  let  (a,  fa,  f2,  •••  fw-i»  /3)  be  corresponding  points  of  (a,  /:?).  If  Ur, 
Lr  are  the  upper  and  lower  boundaries  of  f(x)  in  (xr_-lt  xr),  or  of  F  (^}  in 
(£r-i,  £/•),  we  have 


Ir-i 

Therefore 

T  Z7r  (av  -  av-i)  2;  f  ^(1)  %  (?)  d£  ^Lr  (xr  -  x^}. 

r=l  J  a  r=l 

If  the  net  be  one  of  a  system  of  nets  fitted  on  to  (a,  b),  the  two  sums  con- 

rb 
verge  to  I  f(x)  dx,  when  the  nets  of  the  system  are  taken  successively.    It 

J  a 

then  follows  that 


Since  6  and  /3  may  be  replaced  by  x  and  f,  the  theorem  has  been  established 
for  the  case  of  a  continuous  function  f(x). 

Next,  let/(#)  be  any  summable  function,  bounded  in  (a,  b).    A  continuous 

rb 
function  f,  (x)  can  be  so  determined  that  I    \f(x)—ff(x)  dx<ez,  and  so  that 

J  a 

the  upper  boundary  of  \ff(as)    does  not  exceed  that  of   f(x)\  (see  §430). 
Denoting  by  Fe(%)  the  function  that  is  equivalent  to/e(#),  we  have 


where  f/  is  the  upper  boundary  of  \f(x)  \  in  (a,  6),  and  H1  is  that  set  of  points 
at  which  %  (£)  ^=  0,  and  which  is  a  component  of  the  set  that  corresponds  to 
the  set  H  on  the  ^-segment,  in  which  f(®)—f<(®}\  >e.  The  measure  of  H 
cannot  exceed  e  ;  and  if  we  assign  to  e  successively  the  values  in  a  diminishing 
sequence  that  converges  to  zero,  the  set  H^  which  corresponds  to  a  value  er 
in  the  sequence  contains  the  set  H^er+l\  The  inner  limiting  set  of  the  sequence 
[TfMJ  has  the  measure  zero,  and  therefore  the  inner  limiting  set  of  the  corre- 
sponding sequence  {H^^}  has  the  measure  zero,  as  it  contains  no  points  at 
which  %  (|)  =  0. 

It  follows  that  m  (H^  ~Q,  as  e  ~  0  ;  and  thus  we  have 


F(%)  %(%}&};  =  lim     Fe  (£)  x  (£)  c££  =  lim 

.    a  e~0  -   a  c~0    - 

It  has  now  been  shewn  that 

•b 


f3  f 

I   F(g)'%(%)d%=     f(x)dx, 

J  a  J  a 

where  f(x)  is  any  bounded  summable  function. 
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Lastly,  let  f(x)  be  unbounded  and  summable ;  since  it  can  be  expressed 
as  the  difference  of  two  non-negative  summable  functions,  it  will  be  sufficient 
to  consider  the  case  in  which  f(x)  =  0,  in  (a,  6).  Let  fN  (x)  =/(#),  at  points 
where  f(x)  ^  N,  and  let  fN  (x)  =  N,  at  points  where  f(x)->  N',  the  positive 
number  N  being  arbitrary.  We  have  then 


where  -^v(£)  is  equivalent  to  fN(x).    Since  the  integral  on  the  right-hand 

side  is  non-diminishing  as  N  is  increased,  and  since  it  has  the  finite  limit 
b 
f(x)dx,  as  N~  oo  ,  it  follows  from  the  theorem  of  §  399  that 


We  may  replace  b  by  any  point  x,  in  the  interval  (a,  b),  and  /3  by  the  point  in 
(a,  /3}  that  corresponds  to  x. 

If  E  be  any  measurable  set  in  (a,  b),  and  EI  be  the  measurable  component 
of  that  set  in  (a,  ft)  which  corresponds  to  E,  and  in  which  x  (£)  =£  0,  we  may 
replace  f(x)  by  a  function  which  has  the  same  value  as/(#)  in  E,  and  is  zero 
in  G(E)]  we  have  then 


f     f(x)dx  =  f 

J(E)  J(El) 

rf 
In  accordance  with  the  theorem  of  §  405,  since  x  =  a  +  I   x  (f)  d%,  we  have 

J  a 
Ci  IT 

-,-£  =  x  (IX  f°r  almost  all  values  of  f  in  (a,  /3).    Hence  the  above  theorem  may 
be  written  in  the  form 

f(x)dx  = 

J  a 

dx 
it  being  assumed  that  -^  is  integrable  (L),  in  (a,  /3). 

441.  The  extension  of  the  theorem  of  §  373  to  the  case  in  which  the 
function  f(x,  y)  is  any  summable  function  will  here  be  considered. 

It  will  be  assumed,  as  in  §  372,  that  a  (1,  1)  correspondence  between  the 
points  of  a  perfect  connex  domain  H,  in  the  plane  of  (x,  y),  and  the  corre- 
sponding domain  H,\  in  the  plane  of  (£,  ij),  is  defined  by  x  =/j  (£,  77),  y  =f.2  (%,  77), 
where  the  partial  differential  coefficients  of /i  and  fz  are  continuous  in  H1 ,  and 

f(  (  f    f\  __ 

the  Jacobian      / «!     !  =  J  does  not  vanish  in  H.    It  was  shewn,  in  §  372,  that, 


to  a  closed  set  h,  in  H,  of  content  zero,  there  corresponds  a  closed  set  h,  in  H, 
of  content  zero. 

38—2 
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If  8  denote  a  closed  cell  in  H,  and  8  the  corresponding  closed  domain  in 
H,  by  taking  f(x,  y)  =  1,  in  .ff",  we  have,  in  accordance  with  the  result  ob- 
tained in  §  373, 


(«)  '  (fi) 

where  k  is  some  number  between  the  upper  and  lower  limits  of  |  J  \  in  8. 

If  G  be  any  connex  perfect  domain  of  which  the  frontier  has  its  plane 
content  zero,  G  being  contained  in  H,  and  if  G,  in  the  plane  of  (£,  17),  corre- 
spond to  G,  we  see  that,  J  \  being  continuous  and  never  vanishing,  it  has  a 
lower  limit  1/X,  in  G,  where  X  >  0  ;  therefore  m  (IT)  <  \m  (8),  for  every  cell  8, 
contained  in  G. 

If  A  be  a  set  of  non-overlapping  cells,  contained  in  G,  and  A  be  the  set  of 
domains  in  G  that  corresponds  to  A,  we  have  m  (A)  <  \m  (A). 

Let  E  be  any  measurable  set  of  points  in  G,  then  E  is  contained  in  a  set 
E1}  of  measure  m(E),  which  is  the  inner  limiting  set  of  a  sequence  {A(w)},  of 
sets  of  non-overlapping  cells.  Let  {A(n)}  be  the  corresponding  sequence  of 
non-overlapping  domains  in  G.  The  set  E1  —  E  =  F,  which  has  measure  zero, 
is  contained  in  a  set  Fl}  also  of  measure  zero,  which  is  the  inner  limiting  set 
of  a  sequence  |A'(n)}  of  sets  of  cells.  Let  F1  be  the  set,  in  G,  which  corresponds 
to  F!;  then  m  (f()  =  lim  m  (A(n))  ^  \  lira  m  (A(n))  ;  hence  m  (Fj)  =  0.  The  set 

n~<x>  w~oo 

E  is  contained  in  Elf  and  contains  E1  —  F1,  and  hence  the  set  E,  corresponding 
to  E,  is  contained  in  El}  and  contains  E1  —  F1.  Since  m(Fl)  =  0,  and  El  is 
measurable,  it  follows  that  E  is  measurable,  and 


)  =  m  (E,)  ^  \m  (E,)  ^  \m  (E). 

It  has  thus  been  shewn  that  the  set  E,  in  G,  that  corresponds  to  a  measurable 
set  E,  in  G,  is  measurable,  and  that  its  measure  does  not  exceed  \m(E). 

If  f(x,  y}  be  a  measurable  function,  denned  over  the  set  G,  the  function 
F(%>  v)  which  has  the  same  value  at  a  point  (£,  77),  of  G,  as  f(x,  y)  has  at  the 
point  (x,  y}  that  corresponds  to  (£,  77)  is  measurable  in  G. 

First,  let  f(x,  y}  be  bounded  in  G',  then  a  continuous  function  <£e(#,  y) 
can  be  so  determined  (see  §  430)  that 

>  y)  -  &  (®>  y}  \  d  (as,  y}  <  e2. 

(G) 

In  a  set  L,  of  points  of  measure  >  m  (G)  —  e,  we  have  \f(x,  y}  —  <£e  (x,  y}\^e. 
Moreover  </>e  (x,  y)  can  be  so  defined  that  its  numerical  value  never  exceeds 
the  upper  boundary  of  \f(x,  y)  \  in  G. 
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From  the  theorem  established  in  §  373,  we  have 


where  Ft  (£,  ?;)  =  (/>e  (x,  y),  at  corresponding  points. 
To  estimate  the  value  of 


f 

J 


we  divide  the  set  G  into  the  two  parts  L  and  C  (L),  where,  in  L,  the  condition 
I  f(^>  v)  —  ft  (£>  "»?)  |  =  e  is  satisfied.  The  absolute  value  of  the  difference  is 
then  seen  to  be 


J 


_ 

(G) 


J 


where  77  is  the  upper  boundary  of  \f(x,  y)   in  6r. 

Since  in  {G  '(L)}  ^  \m  {C(L)}  <  Xe,  we  see  that  this  expression  is  arbitrarily 
small,  if  e  be  taken  small  enough. 

It  has  thus  been  shewn  that 

limf   J*.( 

«~0./(8J 

and  therefore  we  have 


f 

j 


,  17). 


Next,  lety(«,  y)  be  unbounded.  Since  it  can  be  expressed  as  the  difference 
of  two  summable  functions,  each  of  which  is  non-negative,  it  will  be  sufficient 
to  assume  that  /(#,  y)  ^  0.  Let  N  be  an  arbitrarily  chosen  positive  number 
and  let  fy(x,  y)  =f(x,  y},  at  all  points  where  f(x,  y)  ^  N,  and  elsewhere  let 
fN  (x,  y)  =  N.  We  have  then,  since  fN  (x,  y)  is  bounded, 

fN(x,y)d(x,y}= 

) 

where  FN  (f  ,  17)  has  the  same  value  as  fN  (x,  y\  at  the  point  (x,  y)  which 
corresponds  to  (f  ,  77). 

Since 

lim        fN  (x,  y}  d  (x,  y)=\     f(x,  y)  d  (x,  y\ 

N~<*>J(G)  J  (G) 

it  follows  that  lim  I     FN  (%,  rf)\J  d  (£,  tj)  exists,  and  is  equal  to 
N~»J  ffi 

f(x,  y)  d  (x,  y). 

J  (G) 


598  The  Lebesgue  integral  [CH.  vu 

Since  FN  (£,  77)  |  «7  1  is  monotone  as  JV  is  increased,  it  follows,  by  applying  the 
theorem  of  §  399,  that 


f 

J 


-       f(x,y}d(x,y). 

(G) 


The  transformation  obtained  in  §  373  has  now  been  extended  to  the  case  in 
which  f  (x,  y)  is  any  function  that  is  summable  in  G. 

The  extension,  given  in  §  374,  to  the  case  in  which,  in  a  closed  set  of  points 
of  content  zero,  the  Jacobian  either  vanishes  or  is  indefinite,  is  applicable  when 
f(x,  y)  is  a  summable  function.  Moreover  the  case  in  which  one  of  the  domains 
G,  G  is  unbounded,  or  in  which  both  are  unbounded,  can  be  considered,  as 
in  §  375. 

442.  A  more  general  treatment  of  the  transformation  of  a  double,  or  of  a 
multiple,  integral  has  been  given  by  W.  H.  Young,  in  which  it  is  not  assumed 
that  the  Jacobian  is  necessarily  of  one  sign,  or  that  the  correspondence  of  the 
points  (x,  y),  (£,  77)  is  necessarily  a  (1,  1)  correspondence.  He  has  established 
the  following  theorem  : 

Let  x  =  x(%,  T)),  y  =  y(%>  7?)  be  functions  of  (£,  77)  possessing  the  property 
of  having  all  their  partial  derivatives  with  respect  to  (£,  77)  bounded  for  all 
values  of  (£,  77)  in  the  fundamental  rectangle  (a.l,  &;  or2,  /32);  and  let  A  be  the 
area  of  the  curve  in  the  (x,  y}-plane  which  is  the  image  of  the  perimeter  of  this 
fundamental  rectangle.  Then 

d  (ac,  y) 


where  ^  ,  =-  ,  xC,  ~  represent  any  of  the  partial  derivatives  of  x  and  y  with 
respect  to  £  and  77. 

More  generally,  the  same  is  true,  if  the  partial  derivatives  are  not  bounded, 
provided  only  the  following  conditions  are  satisfied: 

(1)  x  (f  ,  77)  is  an  integral  with  respect  to  £,  and  an  integral  with  respect  to  77  ; 

(2)  y  (£,  77)  is  an  integral  with  respect  to  %,  and  an  integral  with  respect  to  77; 

fjfi  HIT 

(3)  ^-  and  ~-  are,  except  for  a  set  of  values  of  77,  of  measure  zero,  less  than 


summable  functions  of  77  alone,  say  ^(77)  and 

(4)  the  same  condition  as  (3)  is  satisfied,  ivith  £  and  77  interchanged,  or 

I*  {}  IT  I    f7  U 

more  generally,  when     ^  /i  (77)  rf  (f,  77),   I  ^MM  d  (f,  77)  e^a's^  as  absolutely 

J  Vq  J    U£ 

convergent  integrals. 

For  the  proof  of  this  theorem  reference  may  be  made  to  a  memoir*  by 
W.  H.  Young. 

*  See  Proc.  Land.  Math.  Soc.  (2),  vol.  xvm,  p.  339,  "  On  a  formula  for  an  area." 
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For  the  transformation  of  the  integral  f(a>,  y),  he  has  given  the  following 
theorem : 

Let  x  =  x(£,  77),  y  =  y(j-,  77),  where  the  functions  are  continuous  with  respect 
to  (£,  77),  in  a  fundamental  cell.  Let  it  be  assumed  that  any  cell  in  ilw  (£,  77 )- 
space  has  for  its  image  in  the  (x,  y)-space,  a  portion  of  that  space  whose 
boundary  divides  it  into  two  distinct  portions,  an  exterior  and  an  interior.  Fur- 
ther let  it  be  assumed  that  the  area  of  a  portion  of  the  (x,  y)-space  is  given  by 

Cn  \  ST    77  ) 

a /g    \  d  (£,  77),  an  absolutely  convergent  integral,  which  may  be  positive,  negative, 

or  zero.    Then  I  f(x,  y)  d  {x,  y),  taken  over  the  portion  of  the  (x,  y)-space  which 
corresponds  to  a  cell  in  the  (£,  ij)-space,  is  equal  to 


/. 


taken  over  the  cell. 


For  the  proof  of  this  theorem  reference*  may  be  made  to  a  memoir  by 
W.  H.  Young. 


HARNACK  S  DEFINITION  OF  AN  INTEGRAL. 

443.  A  definition  of -the  integral  of  a  function  f(x),  unbounded  in  the 
linear  integral  for  which  it  is  defined,  was  given  f  by  Harnack,  which  in  its 
original  form  depended  upon  the  employment  of  Riemann's  integrals  in  sub- 
intervals  of  (a,  6)  in  which  such  integrals  exist.  The  definition  given  by 
Harnack  can  however,  without  essential  change  of  form,  be  extended  to  the 
case  in  which  Lebesgue  integrals  take  the  place  of  Riemann  integrals. 

The  set  of  points  of  infinite  discontinuity  of  a  function  f(x)  defined  for 
the  finite  interval  (a,  6)  form  a  closed  set  G. 

It  will  be  assumed  that  this  closed  set  G  is  of  content  zero.  Let  G  be 
enclosed  within  the  intervals  S1}  82,...8n,  of  a  finite  set  A,  such  that  each 
interval  of  A  contains  at  least  one  point  of  G.  The  complement  G  (A)  consists 
of  a  finite  set  of  intervals,  free  at  their  ends,  and  in  their  interiors,  from  points 
of  G. 

Let  it  be  assumed  that  f(x)  is  integrable  (L)  in  each  of  the  intervals  of 

r 
C  (A),  and  thus  that  f(x)dx  exists  as  an  Z-integral.    The  integral  of 

f(x),  in  (a,  b),  when  it  exists,  is  defined  as  follows : 

*  Proc.  Royal  Soc.,  vol.  xcvi,  p.  82. 

t  Math.  Annalen,  vol.  xxiv,  p.  220.  See  also  Jordan,  Cours  d'Analyse,  vol.  n,  p.  50,  where 
a  similar  definition  is  given,  except  that  the  condition  that  the  set  of  points  of  infinite  dis- 
continuity should  have  content  zero  is  omitted. 
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Let  it  be  assumed  that  a  number  I  exists  such  that,  corresponding  to  each 
arbitrarily  chosen  positive  number  e,  a  positive  number  £e  can  be  determined  so  that 

I  -  f(x)  dx   <e 

J  C  (A)  J 

for  every  finite  set  A  of  intervals  which  satisfy  the  conditions  given  above,  and 
which  is  such  that  m  (A)  <  £e.    The  number  I  is  then  taken  to  define  the  value  of 


I   f(x)  dx. 

J  a 


It  is  convenient  to  define  a  function  /A  (x)  which  is  zero  in  all  points  of  the 
intervals  A,  and  is  equal  in  value  to  f(x),  at  all  interior  points  of  C  (A). 

We  have  then,  in  accordance  with  the  above  definition, 

rb  rb 

I    f(x)  dx  =    lim     I    /A  (x)  dx, 

J  a  m(&)~QJa 

the  convergence  being  uniform  with  respect  to  all  sets  A. 

The  necessary  and  sufficient  condition  for  the  existence  of  the  integral 
rb 

I    f(x}  dx,  as  a  finite  number,  is  that,  e  being  assigned,  £e  can  be  so  deter- 
mined that 


I  fb  [b 

/A  (x)  dx  —      /A-  (x)  dx 

\J  a  J  a 


<2e 


for  every  pair  of  finite  sets  A,  A'  of  intervals,  such  that  each  interval  of  either 
set  encloses  within  it  at  least  one  point  of  0,  provided  m  (A)  <  £e,  m  (A')  <  fc ; 
and  that  this  condition  is  satisfied  whatever  value  e  may  have. 

It  is  easily  seen  that  the  above  definition  is  equivalent  to  that  of  Cauchy 
(§  352),  in  case  the  set  O  consists  of  a  finite  number  of  points. 

It  will  be  shewn  that  the  above  definition  is  more  general  than  that  of 
Lebesgue,  in  that  it  defines  integrals  which  do  not  necessarily  exist  in 
accordance  with  Lebesgue's  definition.  It  is  clearly  less  general  than 
Leb  esue's  definition  in  that  it  applies  only  to  the  case  in  which  the  set  of 
points  of  infinite  discontinuity  of  the  integrand  is  of  content  zero. 

As  m  (A)  converges  to  zero,  the  number  n,  of  intervals  of  A,  will  increase 
indefinitely.  If  any  sequence  A1}  A2,  A3,  ...  corresponding  to  a  sequence 
€1,  ea,  e3,  ...,  of  values  of  e,  which  converges  to  zero,  be  taken,  a  sequence 

Ani,  A?l2,  Ana,  ... 

may  be  chosen  out  of  the  given  sequence  in  such  a  way  that  Arir  contains 
Anr+1,  for  all  values  of  r.  For  let  C(Ani)  consist  of,  say,  rni  intervals  (a,  /3) 
each  contained  in  some  interval  (a,  /3)  contiguous  to  G.  The  number  nz  can 
be  so  chosen  that  it  is  the  smallest  integer  (>MI)  such  that  w(A7l2)  is  less 
than  the  least  of  all  the  2rx  numbers  a  —  a,  (3  —  /3.  Clearly  then  (a,  /3)  is 
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within  an  interval  complementary  to  the  set  A^.  The  numbers  n3,  nt,  ... 
may  then  be  chosen  successively  by  the  same  procedure.  The  sequence 
Ani>  AMj,  ...  is  therefore  such  that  each  set  contains  the  next. 

rb  rb 

When  both  I    f(x)  dx  and  I    \f(x)  \  dx  exist,  in  accordance  with  the  above 

J  a  J  a 

Cb 

definition,  I  f(x)  dx  is  said  to  be  absolutely  convergent.     Those  integrals 

J  a 

which  exist  in  accordance  with  the  above  definition,  and  are  not  absolutely 
convergent,  will  be  considered  in  Chapter  VIII. 

444.    It  will  be  shewn  that : 

rb 
If  I   f(x)  dx  exists,  and  is  absolutely  convergent,  the  integral  also  exists  as 

J  a 

an  L-integral,  and  conversely,  that,  if  it  exists  as  an  L-integral,  it  also  exists  as 
an  absolutely  convergent  integral  in  accordance  with  the  above  definition. 

First,  let  f(x)  ^  0,  in  the  interval  (a,  b).    Taking  the  sequence  Ax,  A2,  A3,  ... 
so  that  each  set  contains  the  next,  we  see  that  the  values  of 

•b 

/A  (x)  dx, 


i: 


for  r  =  1,  2,  3, ... ,  form  a  monotone  non-diminishing  sequence  of  numbers,  and 
thus  they  either  increase  indefinitely,  or  they  converge  to  a  limit  A.  Assume 
that  the  latter  is  the  case. 

Let  A/,  A2',  A3',  ...  be  any  other  sequence  of  finite  sets  of  intervals  which 
satisfy  the  conditions  in  the  definition,  but  not  necessarily  such  that  A/ 
contains  A'g+1,  for  all  values  of  s.    We  have 
rb  rb 

/A '(*)<&-     /A.  (#)  efo  <  Pro  (Ar)  ; 

•la  J  a 

where  P  is  the  upper  boundary  of  f(x)  in  the  intervals  of  C  (A/) ;  for  the 
above  difference  is  less  than  the  integral  of  f(x)  over  those  intervals  that  are 
in  Ar  and  also  in  C  (A/).  If  e  be  arbitrarily  chosen,  and  s  be  fixed,  for  all 
sufficiently  large  values  of  r  we  have,  on  the  assumption  of  the  existence  of 

rb 

the  limit  A,  of      /A  (x)  dx, 

J  a 

rb  rb 

/A;  O)  dee  -     /A  (x)  dx<e; 

J  a  J  a 

rb 

and  therefore  I    /Ag>  (x}  dx  ^  A  +  e  ; 

J  a 

this  holds  for  each  value  of  s.  Similarly,  we  see  that,  for  a  fixed  value  of  r, 
and  for  all  sufficiently  large  values  of  s, 
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and  by  choosing  the  fixed  value  of  r  so  great  that 

rb 
/A,  (#)  dx>A-  £e, 

J  a 

rb 
we  have  fA>(x)dx>A-  e, 

J  a 

for  all  sufficiently  large  values  of  s.    As 

"b 


r 

•/  a 


/ 


lies  in  the  interval  (^.  -  e,  A  +  e)  for  all  sufficiently  large  values  of  s,  and 
since  e  is  arbitrary,  it  follows  that 

/A/  (a?)  dx 


converges  to  A.    Moreover,  since 


f  /Ar  (x)  dx  -  f  /Ag,  (a)  dx  <  P'm  (A/), 

•  «  J  a 

where  P'  is  the  upper  boundary  of  f(x)  in  C  (  Ar)  ;  and  also  since 

A",  0)  <&  <  ^'™  (A'V), 


where  {A"6v|  is  any  other  sequence  of  intervals  enclosing  G ;  from  these  in- 
equalities, we  see  that,  if  m  (A/),  m  (A'V)  are  both  less  than  |e/P',  the 
integrals 

rb  "b 

/A/  \x)  ax>        /A'V  \x)    x 

.•a  -a 

are  both  >  A  —  e,  and  they  have  been  shewn  to  be  ^  A  +  e  ;  it  follows  that  the 
integrals  differ  from  one  another  by  less  than  2e.  Hence  the  condition  for  the 

rb 
existence  of  the  integral  I    f(x)  dx  is  satisfied,  on  the  assumption  that  the 

J  a 

integrals  of  /A^  (x}  converge  to  a  finite  number  as  r  ~  oo  . 
Thus  it  has  been  shewn  that : 
Iff(x)  be  never  negative,  it  is  necessary  and  sufficient  for  the  existence  of 

.  fb 

the    integral        f(x)dx   in   accordance   with   Hamack's   definition   that,   if 

J  a 

AJ,  A2,  ...  be  a  sequence  of  finite  sets  of  intervals  each  containing  the  next,  such 

that  each  interval  of  each  set  contains  within  it  a  point  of  G,  and  such  that 

rb 
m  ( Ar)  ~  0,  then  I    /A  (#)  dx  should  be  less  than  a  fixed  finite  number,  for  all 

J  a 

the  sets  Ar. 
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rb 
Let  it  now  be  assumed  that  /   f(x)  das  exists  as  an  Z-integral  ;  we  have 

J  a 
then  rb  rb  f 

f^(x)dx  =     f(x)dx-\      f(x)dx\ 

J  a  J  a  J  (A) 

and  since  (§  392)  I      f(x)  dx  converges  to  zero,  uniformly  as  m  (A)  ~  0,  we 
J(A) 

rb  rb 

see  that  I    f^  (x)  dx  converges  to  I    f(x)  dx  ;    and  thus  the  integral  also 

.'  a  J  a 

exists  in  accordance  with  Harnack's  definition.  It  appears  moreover  that  in 
this  case  of  a  non  -negative  function  the  condition  that  each  interval  of  A 
should  contain  a  point  of  G  is  unnecessary. 

It  has  thus  been  proved  that,  if  f(x)  be  non-negative,  and  have  an  L- 
integral,  it  has  also  an  integral  in  accordance  with  Harnack's  definition,  and 
the  two  have  the  same  value. 

Next,  let/(#)  have  both  positive  and  negative  values,  in  (a,  b),  and  assume 
that  it  is  summable  in  the  interval. 

If  f(x)  =f+(x)—f~(x),f+(x)  and/~(#)  are  summable,  and  therefore  they 
have  not  only  Z-integrals  but  also  Harnack  integrals.  It  is  clear  that 

rb  rb  rb 

/A  («)  dx  =       /A+0)  dx  -        /A~(«)  dx. 

J  a  J  a  J  a 

Now,  as  m(A)  converges  to  zero,  the  two  integrals  on  the  right-hand 
side  converge  to 


as  has  been  shewn  above,  although  A  is.  such  that  an  interval  of  it  does  not 
necessarily  contain  a  point  both  of  Oi  and  of  G.2,  the  sets  of  points  of  infinite 
discontinuity  of  /+(#)  and  of  /"(«). 

rb 

It  follows  that  I    f(x)dx  exists  in  accordance  with  Harnack's  definition, 

J  a 

and  is  equal  to  the  Z-integral. 

rb 

It  follows  immediately  that  I     f(x)  \  dx  exists  as  a  Harnack  integral,  and 

J  a 

is  equal  to  the  sum  of  the  integrals  off+(a;)  &ud  f~(x). 

Conversely,  let  us  assume  that 

rb  rb 

I    f(x)  dx  and   I     f(x)  dx 

J  a  J  a 

exist  as  Harnack  integrals. 

The  points  of  infinite  discontinuity  of  the  two  functions  /(#),  |/(#)|  ar® 
the  same,  hence  the  Harnack  integrals  of  the  two  functions  are  the  limits  of 
Z-integrals  taken  over  sets  of  intervals  that  are  the  same  for  the  two 
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functions.  It  follows  that  \f(x)  |  +/(#),  |/(#)  \  —  f(x)  have  Harnack  integrals, 
the  sum  and  difference  of  those  of  \f(oc}\  and  f(x)\  or  the  two  functions 
/+(#),  f~(ai)  have  Harnack  integrals. 

It  is  then  sufficient  to  shew  that  the  existence  of  the  Harnack  integral  of 
a  non-negative  function  involves  the  existence  of  the  i-integral  of  the  same 
function. 

Let  it  therefore  be  assumed  that  f(x)  ^  0,  and  that  the  Harnack  integral 
of/(#)  over  (a,  b)  exists.  The  points  of  infinite  discontinuity  of  f(x)  can  be 
enclosed  in  a  finite  set  A,  of  intervals,  where  m  (A)  is  so  small  that 


f(x)  dx 
J  C  (A)  J 


is  less  than  the  Harnack  integral 


\bf(x}dx 

J  a 


by  less  than  an  arbitrarily  chosen  positive  number  £  Let  N  be  a  positive 
number  not  less  than  the  upper  boundary  of /(#)  in  0(A),  and  let  /V(#)  be 
the  function  corresponding  to  f(x)  employed  in  de  la  Vallee  Poussin's  defini- 
tion (§  387).  Let  another  set  of  intervals  A',  all  interior  to  intervals  of  A, 
enclose  all  the  points  of  infinite  discontinuity  of  f(x).  The  integral  of  f(x} 
over  O(A')  lies  between  the  integral  over  C  (A)  and  Harnack's  integral,  and 
therefore  differs  from  the  last  by  less  than  £ 

It  follows  that  I  f(x)  dx  <  £ 

J(A-A') 

and  therefore  I  /N  (#)  dx  <  £ 

J  (A  -  A') 

From  this  we  deduce  that 


(A)" 

and  since  this  holds  for  an  arbitrarily  small  value  of  m  (A'),  N  being  fixed, 
we  have  r 

TIT  i /¥>  \  fi  np   ^~~    i 

/  -A'  V  "^  /  ^^  ^—   S* 

J(A)-7 

It  now  follows  that 


and  since  £  is  arbitrarily  small,  N  being  sufficiently  increased,  it  follows  that 


rb 

fN(x}dx 

J  a 


has  a  definite  limit,  as  N~  oo  ,  and  that  this  limit  is  Harnack's  integral 

dx. 


f* 

J  a 
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It  has  thus  been  shewn  that  the  existence  of  Harnack's  integral  implies 
that  of  the  integral  as  defined  by  de  la  Vallee  Poussin,  the  integrals  having 
the  same  value.  By  §  387,  it  follows  that  the  Z-integral  exists,  and  is  the  same 
as  the  Harnack  integral. 

Accordingly,  an  absolutely  convergent  integral  which  exists  according  to 
either  Lebesgue's  definition,  or  that  of  Harnack,  exists  also  according  to  the 
other  definition,  and  has  the  same  value  for  the  two  definitions.  It  has  more- 
over been  shewn  that  the  condition  that  each  interval  of  A  must  contain  at 
least  one  point  of  G,  the  set  of  points  of  infinite  discontinuity  of  the  function, 
is  unnecessary  when  the  integral  is  absolutely  convergent. 

THE   LEBESGUE-ST1ELTJES   INTEGRAL. 

445.  Let  f(x)  be  a  measurable  function,  defined  for  the  linear  interval 
(a,  b),  and  let  <f>  (x)  be  a  bounded  monotone  non-diminishing  function  defined 
in  the  same  interval.  Denoting  <j>(x)  by  £,  we  consider,  as  in  §  252,  the 
functional  image  E^\  on  the  ^-segment,  of  a  set  of  points  E^  on  the  #-seg- 
ment,  such  that  to  a  point  x',  at  which  <f>  (x')  is  discontinuous,  there  corre- 
sponds the  whole  interval  (<£  (x'  —  0),  <f>  (x  +  0))  of  points  on  the  ^-segment. 
It  was  remarked  in  §  252  that,  if  E^  is  measurable  on  the  ^-segment,  it  is 
not  necessarily  the  case  that  E®  is  measurable,  but  that,  if  E^  is  measurable 
{B),  then  E^  is  measurable  (B). 

Let  F(%)  be  the  function  defined,  at  every  point  of  the  ^-segment,  by  the 
condition  F(^)=f(x),  where  x  is  the  point  that  corresponds  to  f;  if  x'  is  a 
point  of  discontinuity  of  <j>  (x),  the  interval  (<f>  (x'  —  0),  <£  (x'  +  0))  is  an  interval 
of  invariability  of  F(%),  in  which  F(%)  =  <f>  (x'). 

f3 
Whenever    the  L-integral  I    F  (£)  dg  exists,  where  a  =  <f>  (a),  $  =  <f>  (b),  its 

J  a 

value  may  be  said  to  define  that  of  the  Lebesgue-Stieltjes  integral,  or  LS  -integral 
rb 
f(x)d(j)(x),  of  the  measurable  function  f(x),  with  respect  to  the  monotone 

J  a 

function  <£  (x). 

rb  rb 

In  case  I    F(%)  d%  exists  as  an  ^-integral,  the  integral      f  '  (x}  d(f>  (x)  may 

J  a  J  a 

be  an  .RS-integral  (see  §  378).  It  has  been  shewn,  in  §  377,  that  the  necessary 
and  sufficient  condition  that  the  integral  may  be  an  J^-integral  is  that  the 
variation  of  (f)  (x)  over  the  set  of  points  of  discontinuity  off(x)  should  be  zero. 

If  the  function  f(x)  is  measurable  (B),  the  function  ^(£)is  measurable  (B) 


on  the  ^-segment,  and  Lebesgue's  definition  is  applicable  to  define 

In  case  <£  (x)  is  of  bounded  variation,  and  is  therefore  expressible  as  the 
difference  of  two  monotone  non-diminishing  functions  <£i  (x),  <f>2  (x),  the  LS- 
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integral  of  f(x)  with  respect  to  <£  (x)  may  be  defined  to  be  the  excess  of  the 
.L$-integral  of/(#)  witji  respect  to  <f>i(x)  over  its  jL$-integral  with  respect  to 
<£2  (x),  whenever  these  latter  integrals  exist. 

Referring  to  the  definition,  in  §  252,  of  the  variation  of  the  monotone 
function  </>(#)  over  a  set  of  points  E  in  the  ^-segment,  and  denoting  by 
en  that  set  of  points  at  which  cn^f(x)<  cn+1,  we  see  that  the  .L$-mtegral 

rb  °° 

f(x)d<f>(x)  is  defined  as  the  limit*  of  2  cnV(-en)(j)(x);  the  set  of  numbers 

J  a  -» 

{cn}  being  such  that  cn+1  —  cn  is  less  than  a  fixed  number  e,  for  all  positive  and 
negative  values  of  n,  and  the  limit  being  taken  as  e  ~  0;  it  being  assumed 
that  the  variation  F(e'i)  ${x)  exists  for  all  the  sets  en,  so  that  F(%)  is  measur- 
able in  (ex,  /3). 

The  definition  of  an  Z$-integral,  over  the  ^-segment,  as  an  Z-integral  over 
the  ^-segment,  may  be  employed  to  extend  the  properties  of  Z-integrals  to 
the  case  of  Z$-integrals. 

We  have,  for  example,  the  extension  of  the  property  of  ^-integrals,  that, 
if  {fn(x}\  is  a  sequence  of  summable  functions  that  converges  to  f(x),  and 
such  that  \fn(x)  \  is  bounded  for  all  values  of  n  and  x,  then 

rb  rb 

I  f  (x)  dx  =  lim      /„  (x)  dx. 

J  a  n~  x>  J  a 

The  corresponding  property  of  the  Z$-integral  is  that 


f(x)d(f>(x)=  lim      fn(x)d<j>(x), 

n~<x>  J  a 

where  the  bounded  functions  {/„  (x)}  are  such  that  their  Z$-integrals  with 
respect  to  <£  (x}  exist. 

446.  It  has  been  pointed  out  by  Hildebrandt  (loc.  cit.)  that  the  definition 
of  an  integral  due  to  W.  H.  Young,  referred  to  in  §  389,  may  be  so  extended 
as  to  give  rise  to  a  definition  of  integration  of  a  summable  function  with 
respect  to  a  monotone  function,  more  general  than  the  definition  given  above. 

Let  the  interval  (a,  6)  be  divided  into  a  finite,  or  enumerably  infinite,  set 
of  parts  el}  e2>  ...  en,  ...  over  each  of  which  the  function  <£ (x)  is  measurable, 
and  such  that,  over  each  of  the  sets  en,  the  function  f(x)  has  finite  upper 
and  lower  boundaries  Un,  Ln.  Consider  the  sums 

8,  =  ^Un  7W  0  (X\      8,  =  'S.Ln  VM  </>  (a-); 

then  the  lower  boundary  of  Slt  for  all  such  modes  of  division  of  (a,  6)  into 

7b 

the  sum  of  sets,  is  defined  to  be  the  upper  integral      f(x}d(j)(x),  of  f(x) 

•J  a 

*  See  Hildebrandt,  "  On  integrals  related  to,  and  extensions  of,  Lebesgue  integrals,"  Bull.  Amer. 
Hath.  Soc.  (2),  vol.  xxiv,  p.  191. 
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with  respect  to  <j>  (x),  over  the  interval  (a,  6).     Similarly,  the  lower  integral 
rb 
I  f(x)d<^)(x)  is  defined  to  be  the  upper  boundary  of  $2,  for  all  such  modes  of 

J  a 

division  of  the  interval  (a,  b).    When  the  upper  and  lower  integrals  have  the 

rb 

same  value,  that  value  may  be  .said   to  define  the  integral  /  f  (x)  d<f>  (x), 

J  a 

off  (x)  with  respect  to  <f>  (x).  In  accordance  with  this  definition  any  sum- 
mable  function f(x)  will  have*  an  integral  with  respect  to  <£(#);  the  division 
of  (a,  6)  into  parts  being  restricted  to  be  a  division  into  sets  that  are  all 
measurable  (B).  The  definition  of  §  445  is  such  that  all  functions  that  are 
measurable  (B)  have  integrals,  finite,  or  infinite,  with  respect  to  the  mono- 
tone function  <£  (x). 

Lebesgue  has  given  f  two  modes  by  which  a  Stieltjes  integral  can  be 
reduced  to  an  Z-integral.  A  transformation  of  an  Z-integral  into  a  Stieltjes 
integral  has  been  given  J  by  Van  Vleck.  If  e(y)  denotes  the  measure  of  the 

set  of  points  for  which  L  =/(#)  <  y,  where  L  is  the  lower  boundary  of  f(x)  in 

rb  rU 

(a,b),  the  Z-integral     /(#)  dx  is  equal  to  the  Stieltjes  integral     y  de  (y),  where 

J  a  J  L        • 

?7is  the  upper  boundary  of  f(x)  in  (a,  b).  This  includes  the  case  in  which 
one,  or  both,  of  the  numbers  U,  L  are  infinite.  It  has  been  shewn  by  Bliss  § 
that  an  Z-integral  is  reducible  to  an  J?-integral. 

rU  rU 

For,  I    y  de  (y)  =  U  (b  —  a)  —  I    e(y)dy\   and  the  last  integral  is  an  R- 

J  L  J  L 

integral,  since  e  (y}  is  a  monotone  function  of  y. 

rx 

447.    From  the  theorem  of  §  440,  we  see  that,  if  <f>  (x)  =     <f>  (a?)  dx,  where 

J  a 


<E>  (x)  is  a  monotone  function  of  x,  in  an  interval  (a,  b),  then  /   /  (x)  (f>  (x)  dx  is 

J  a 

re 

equal  to  I    F '(£)  dg,  provided  F(^)  is  summable  in  the  interval  (a,  /3)  that 

J  a 


corresponds  to  (a, 6),  when  the  transformation  f  =  a  +  I    (f)(cc)dx  is  employed,  and 

J  a 

F (£)  =  f  (x).    It  then  follows  from  the  definition  in  §  445  that 


f 

J  a 


This  may  be  extended  to  the  case  in  which  <l>  (x)  is  the  indefinite  integral  of 

*  See  Hildebrandt,  loc.cit.,  p.  191. 
f  Comptes  Rendus,  Paris,  vol.  CL,  p.  86. 
I  Trans.  Amer.  Math.  Soc.,  vol.  xviu,  p.  326. 
§  Bull.  Amer.  Math.  Soc.,  vol.  xxiv,  p.  1. 
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any  summable  function  <f>  (x).    For  </>  (x)  may  be  expressed  as  fa  (x)  —  fa  (x), 
where  fa  (#),  fa(x)  are  non-negative  summable  functions;  and  thus 

Fx  rx 

<£(#)  =      fa  (x)  dx—  I    fax  (dx) 

J  a  J  a 

=  ^>1(x)-^2(x). 
In  accordance  with  the  definition  in  §  445,  we  now  have 

(*/<*)  <#>  (*)  ^  =  r  /(*>  d*i  («)  -  f  /(«>  <**«  (*>> 

•/a  ^  a  J  a 

rx  rx 

or  f(x)(f>(x)dx  =     f(x)d3>(x\ 

J  a  J  a 

where  <I>  («)  denotes  the  indefinite  integral  of  the  summable  function  0  (x). 
Let/  (a;)  be  monotone  and  bounded  in  the  interval  (a,  6),  then 

/*/(«)  d*(«) 

•/  a 

is  an  ^/S-integral,  since  the  variation  of  3>  (a?)  over  the  set  of  points  of  discon- 
tinuity of  f(x)  is  zero,  because  3>  (x)  is  an  indefinite  integral. 

We  then  have  (see  §  376) 

O)  d®  (x)  = 

and  thus  the  theorem  for  integration  by  parts  may  be  written  in  the  form 
f  /  (x)  (/>  (x)  dx  =  [/(a)1*  (a;)]  6  -  f  '  ^  (x)  df(x). 

J  a  |_  _ja       ^  a 

Moreover,  since  f(x)  is  monotone, 


where  /u,  is  some  point  in  the  interval  (a,  b). 

Therefore       f  /(*)  0  (a)  <te  =/(6)  ^>  (6)  -  ^>  (/*)  {/(6)  -/(«)} 

J  « 

=  /(a)  [^  (*)  ^  +/(6)  f  ^  ^  ^ 

J  a  J  ^ 

A  proof  of  the  second  mean  value  theorem*  has  thus  been  obtained  by 
employing  the  method  of  integration  by  parts,  as  applied  to  the  J?$-integral. 
The  more  general  form  of  the  theorem,  and  Bonnet's  form,  can  be  deduced,  as 

in  §  422. 

*  See  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  280. 
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448.  If  f(xw,  #(2))>  <f>(oc(1),  #<2))  are  both  functions  of  bounded  variation, 
in  accordance  with  the  definition  of  §  254,  it  can  be  shewn  that  the  following 
formula*  for  integration  by  parts  holds  good.  The  integrals  exist  in  accord- 
ance with  the  definition  given  in  §  381. 

(6  <»,&«») 


r 
= 
L 


&«> 


•(&<«,  6») 
+ 


where 


-I 

/O(1),  tf  (2))  ^  (a?w,  a;'2)) 

>)^  (&(1),  «(2))  -/(«(1),  « 

r  -| 

/(««,«»)*(««,«») 

L  J 


denotes  /(&<",  «(2>)^  (&(1),  «(2))  -/(«(1),  «0))^  O(1),  «(2)). 

|(6a),  6  ») 
and 

denotes 

,  a'2>)  -/(a(1>,  6<2>)  0  (a'1', 


From  this  formula,  the  second  me&n  value  theorem  for  functions  of  two 
variables  (see  §  425)  can  be  deduced. 

The  method  has  been  developed  by  W.  H.  Young  for  the  case  of  functions 
of  any  number  of  variables. 

BELLINGER'S  INTEGRALS. 

449.  In  connection  with  the  theory  of  the  reduction  of  a  quadratic  form 
involving  an  infinite  number  of  variables  to  a  canonical  quadratic  form,  certain 
limits  were  introduced  by  Hellingerf  ,  for  the  representation  of  which  he  used 
the  notation  of  integration.  A  limit,  of  this  species,  may  accordingly  be 
spoken  of  as  a  Hellinger  integral.  It  was  shewn  by  Hahn  that  every  Bellinger 
integral  can  be  expressed  as  an  i-integral. 

Let  g  (y)  be  a  continuous  non-decreasing  monotone  function,  defined  for 
the  linear  interval  (a,  /3);  and  let  f(y)  be  a  summable  function,  defined  for 
the  same  interval,  and  such  that  it  is  constant  in  any  interval,  contained  in 
(a,  /3),  in  which  g  (y)  is  constant. 

Let  a  net  be  fitted  on  to  the  interval  (a,  /3);  and  denote  its  meshes  by 

(y<»  yi\  (yi>  2/2),  ...  (ym-i,  ym),  where  y0  =  «,  ym  =  &> 

*  See  W.  H.  Youn^,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  281. 

t  See  the  dissertation  "Die  orthogonal  invarianten  quadratischen  Formen  von  unendlich 
vielen  Veranderlichen,"  Gottingen,  1907;  also  a  memoir  in  Crelle's  Journal,  vol.  cxxxvi. 

H.  39 
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__  _ 

If  the  sum  2         ,\        /       >.     Aas  a  finite  upper  boundary,  when  all  pos- 

- 

[d  f(y)]z 


sible  nets  are  taken  into  account,  that  upper  boundary  is  denoted  by 

• 

awcZ  w  called  a  Hellinger  integral.   In  the  above  sum,  any  term  in  which  the 
denominator,  and  consequently  the  numerator,  vanishes,  is  omitted. 

Let  the  new  variable  x  be  defined  for  the  interval  (a,  b),  where  a  =  g  (a), 
b  =  g  (/3),  by  the  definition  x  =  g  (y). 

The  sum  employed  in  the  definition  of  the  Hellinger  integral  becomes 

r-m  \F(X  \  —  F(x  _  Y2 
2   J  j  ,  where  xr  corresponds  to  yr,  and  F(x),  or  F{g(y)},  is 

r  =  1  ^r       Xr—  i 

identical  with/"(y). 

If  fi(y),  fz(y)  are  two  summable  functions,  defined  in  (a,  /3),  CM?C?  eacA  is 
constant  in  any  interval  in  which  g  (y)  is  constant,  then  if  the  sum 

r?n  l/i  (yr)  -/!  (yr-0)  {/.  (yr)  -/.  (yr-i)l 
r^i  9  (yr)  -  g  (yr-3 

has  a  definite  limit,  the  same  for  all  systems  of  nets  fitted  on  to  (a,  /3),  this 
limit  is  denoted  by  I  f  /  -f        ,  *w^  *'*  «^o  called  a  Hellinger  integral. 

The  sum  here  employed  is  equivalent  to 

(ay)  -  F2  (^r_1)} 


where  ^  {^r  (y)}  =/x  (y),  and  .P2  {g(y)}  =f,(y). 

450.    The  following  theorem  will  be  established: 

If  $  (x)  denote  a  function,  bounded,  or  unbounded,  in  (a,  b),  such  that  {</>  (#)}2 
is  summable  in  the  interval,  then 

!(/"  )  2      1    f  f  )  2  Iff  )  2 

5-  \         (f>  (x)  dx  I  +  JT  ]         <j>  (.x)dx[  +...  +  ;-  U       (f)  (x)  dx\  , 

6l    U  (Si)  )  °2    U(«J  J  0OT    (J(6m)  j 

where  8l,  S^,  ...  Bm,  are  the  intervals  of  a  net  fitted  on  to  (a,  b),  converge?  to 
rb 
I    {0  (x)}2  doc,  for  any  system  of  nets.   If  {<f>  (#)}a  is  not  summable,  the  above  sum 

J  a 

diverges,  for  a  system  of  nets. 

In  case  <j>(x)  is  integrable  (R),  the  truth  of  the  theorem  is  obvious;  for 
I       (j)  (x)  dx  is  equal  to  Sf  multiplied  by  a  number  lying  between  the  upper  and 

J  (Sr) 

lower  boundaries  of  0  (x)  in  8r;  and  thus  the  expression  reduces  to  the  Riemann 

rb 

sum  employed  in  the  definition  of     {0  (a?)}8  dx.    Thus  the  theorem  holds,  in 

J  a 

particular,  for  a  continuous  function  0  (x). 
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Next,  let  <j>(x)  be  any  function  that  is  non-negative,  summable,  and  bounded, 
in(a,6).  Acontinuous  function/(#),also  non-negative,  can  be  so  determined  that 

f  I  {*(*))'-{/<*)!'  I  <**<«. 

J  a 

rb 
and  consequently  that  I   {<f>  (x)  —  /(#)}2  dx  <  e,  (see  §  433);  where  e  is  an  arbi- 

J  a 

trarily  prescribed  positive  number. 

r-m   1     (  r  )2 

The  sum  2    -R-  i  /      <f>(x)dx\   may  be  expressed  by  $j  +  $2  +  $s>  where 

r  =  l    °r  (J  (Sr)  ) 

r~m    Iff  ")  2  r  =  m    ~\ 

'- 


(Sr)  r-l   f>r         (Sr) 

r-m 


and  £3  =  2  2  {(/>(«)-/(»}  da?        f(x)dx. 

r  =  l    Or  J  ($r)  J  (Sr) 

By  using  the  Schwarzian  inequality  (§  396),  we  have 

{  f    l^(*)-/(«)}  ^|2  ^  «r  f    {^(*)-/W^; 

U  (W  J  »  (*r) 

and  hence  we  find  that  Sj_<  e.  Also,  since  f(x)  is  continuous,  we  have,  for  all 
nets  of  sufficiently  large  order,  in  any  system  of  nets, 

f  b  [f(x)Y  dx>8z>  f  {f(x}}*  dx  -  e. 

J  a  J  a 

We  also  have 

S3 

and  thus  S3  1  <  2e?7/(6  —  a);  where  U  denotes  the  upper  boundary  of  <£(#); 
since  the  function  f(x)  may  be  so  chosen  that  its  upper  boundary  does  not 
exceed  that  of  0  (x). 

Since  e  is  arbitrary,  8l  and  $8  are  arbitrarily  small;  and  $2  differs  from 
rb 
I   {(£>(x)}2  dx  by  less  than  2e.    It  now  follows  that 

.'  a 

r  =  m    Iff  ")2 

S    j\         4>(x)dx\ 

r  =  l    (>r   (J  (Sr)  J 

f6 

converges  to  I    {<^>  (x)}2  dx,  as  the  summation  is  taken  successively  over  the 

J  a 

nets  of  any  system  of  nets. 

Next,  let  <j>(x)  be  unbounded,  but  everywhere  ^  0;  and  let  ^NS^)  have 
the  same  value  as  $(#),  at  a  point  where  <£  (x)  £  Ns,  and  have  the  value  Ng> 
at  a  point  where  </>  (&•)  >  Ns. 

r=mifr  )2 

Denoting  by  amg  the  number   2    -^  \\      ^Ai(*)<^f  5  as  s  nas  ^ne  values 

r  =  l    °r  (J  (Sr) 

in  an  increasing  sequence  such  that  Ns  ~  oo  ,  and  m  increases  indefinitely  as 
we  proceed  through  the  nets  of  a  given  system,  the  numbers  ains  form  a  mono- 

39—2 
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tone  double  sequence.   For  a  sub-division  of  any  net  increases  the  value  of  am8> 

(f)    +  W  )2        T)  2        ft  2 

as  is  easily  seen  from  the  inequality  ^ — £^-  S  *r-  +  ^r- .     If  either  of  the 
repeated  limits  lim  lim  ams,  lim  lim  ams  exists,  then  also  the  other  exists, 

and  the  two  have  the  same  value  (see  §  388). 

rb  rb 

Now  lim  ams=  I    {<f>xs(x)}2dx,  and  Km  lim  ams=\   {<j>(x)}2dx', 

r 
hence  lim  lim  ams  =  I   [<f>  (x)}2  dx, 

and  thus  the  limit,  as  we  proceed  through  a  system  of  nets,  of 

r—m  1    (   /"  )  2  rb 

£    jr  s  I       <f>  (x)  dx  [  ,  is  I    l<f>  (x)}2  dx. 

r-l  Or  (J  (Sr)  )  Ja 

Lastly,  let  (f>(x)  be  any  summable  function  whose  square  is  summable,  and 
which  may  have  both  positive  and  negative  values.  It  may  be  expressed  as 
the  difference  of  two  summable  functions  fa  (x),  fa  (x),  each  of  which  is  =  0, 
for  all  values  of  x. 


Since  2^-      I       {^>i  (x)  +  <£2  («)}2  dx\    converges  to       {<f>l(ac) 

Or  \_J  (Sr)  J  Ja 

rb  rb 

that  is  to      {fa  (x)}2  dx  +  I   {<£2(#)}2e&e;  and  also 

J  a  J  a 

1    (  f  )a  rb 

S  -K-  •{  I      (^)l(x)dx\   converges  to       {fa  (x)}2  dx, 

Or  (J  (Sr)  )  J  a 

i  (  r  )2  r& 

and  2  £-  \\       <f)z(x)dxy  converges  to  I    {02  (x}}2  dx, 

Or    (J  (Sr)  )  J  a 

-A  i  r  r 

it  follows  that  ^  -r-  I      </>j  (x)  dx        <j>2  (x)  dx  converges  to  zero. 

Or  J  (Sr)  J  (Sr) 

It  now  appears  that 

i  (  r  r6 

2  5-  •II     {rf>!  (x)  —  d>2  (^)}2  dx   converges  to   I   {^>i  (x)  —  fa  (x)}2  dx  ; 

Or  (J  Sr  Ja 

and  thus  the  theorem  is  established  generally  for  every  function  </>  (x)  whose 
square  is  summable. 

The  following  theorem  which  includes  the  theorem  established  above  may 
now  be  stated: 

If  fa  (x),  fa(x)  be  any  functions  whose  squares  are  summable  in  the  interval 
(a,  b),  the  sum 


fa(x)dx\      fa(x)dx  converges  to       fa(x)fa(x)dx, 

)  J  (Sr)  Ja 

as  the  summation  is  taken  successively  over  the  nets  of  any  system  of  nets  fitted 
on  to  (a,  b). 
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To  prove  this  theorem,  we  observe  that  fa  (x),  fa  (as)  may  be  replaced  by 
fa+  (x)  —  fa-  (x),  fa+  (as)  —  far  (x),  respectively,  where  the  four  functions  in 
these  expressions  are  all  non-negative.  As  the  theorem  holds  for  each  of  the 

integrals 

rb  rb 

fa+  (x)  fa+  (x)  dx,        fa+(x)fa~(x)dx, 

J  a  J  a 

rb  rb 

fa-  (x)  fa+  (x)  dx,          fa-  (x)  fa~  (x)  dx, 

J  a  J  a 

it  clearly  holds  also  for 

f  {fa+  (x)  —fa-  (x)}  [fa+  (x)  -  fa-  (x)}  dx, 

J  a 

rb 
that  is  for  I   fa  (x)  fa  (x)  dx. 

J  a 

451.  Let  x  =  g  (y),  a  =  g  (a),  b  =  y  (/3),  where  g  (y)  is  a  continuous,  non- 
diminishing  function  of  y,  in  the  interval  (a,  /3);  also  \etf(y)  =  F  {g  (y)},  where 

F(x)=T(f>(x)dx. 

•  a 

The  sum  employed  in  the  theorem  of  §  450  then  becomes 


where  the  net  (y0,  2/1),  (yi,  y.2),  ...  (ym-\,  ym)  corresponds  to  the  net 


It  will  now  be  shewn  that  : 

The  necessary  and  sufficient  condition*  that!      ^    /  -T    should  exist  is  that, 

if  x  =  g(y),  the  function  F(x)~f(y)  should  be  the  indefinite  integral  of  a 
function  (f>(x)  whose  square  is  summable  in  the  interval  (a,  b),  of  x. 

(P[df(y)]2      fb 
Moreover  I     -jf\   —  I   [<f>(x)}2dx;  the  Helling  er  integral  being  thus  ex- 

pressed as  an  L-integral. 

To  establish  the  sufficiency  of  the  condition,   let  it  be    assumed   that 
F(x)  =  I   <f>  (x)  dx,  where  {<£  (x)}2  is  summable. 

J  a 

/XT 
0  (x)  dx,  and  thence  we  see  that 
fr-l 


*  This  theorem  was  given  by  Hahn,  see  Monatshefte  fiir  Math.  u.  Physik,  vol.  xxm,  p.  172. 
Another  proof  was  given  by  Hobson,  Proc.  Land.  Math.  Soc.  (2),  vol.  xvm,  p.  258. 
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and  therefore  T  {FM-F(x*-$*  <,  [  *  {$  (a.)j.  ^ 

r=l  «#r       #V—  1  -'« 

for  any  net  fitted  on  to  (a.  b\    It  follows  that  the  limit  of          /  \  (    exists. 

J«    dg(y} 

Next,  let  the  existence  of  the  Hellinger  integral  be  assumed.    It  is  evident, 
from  the  definition,  that  I       ,    .  {    exists,  for  any  value  of  y  in  the  interval 

L  dg(y) 

(a,  /3),  and  that  it  is  a  monotone  increasing  function  of  y,  say  ^r  (y). 

We  have  then,  in  any  interval  of  y,  contained  in  (a,  /3),          , 
and  thus 


j  F  (av)  -  F(x^  |  ^  [(af  -  aw)  {%  (av)  -  x  (av_,)}],  where  ^(y)  =  x  («)• 
From  this,  we  have 


V  BY-y  \  —  F  (r  \  i  <  51  [( r  —  r  ^  i-v  <V  ^  —  -v  <V  1 
-v  I  -P  V*V/  *  \xr-\)  !  =  *  lA-*'1  *T— V  1%  \*»V  ~~  X  \xr—\) 
r=l  r=l 

*[(&-a)fc(&)-x(a)}]*; 

and  thus  the  function  F(x)is  of  bounded  variation  in  (a,  6);  from  which  it 
follows  that  F'  (x)  exists  for  almost  all  the  values  of  x.  In  order  to  shew 
that  F  (x)  is  the  indefinite  integral  of  F'  (x),  it  is  sufficient  to  shew  that  the 
variation  of  F(x)  over  any  set  of  points  of  measure  zero  vanishes  (see  §  407). 
Such  a  set  can  be  enclosed  in  the  intervals  (8)  of  a  non- overlapping  set  of 
which  the  measure  is  arbitrarily  small.  The  variation  of  F(x]  over  this  set 

of  intervals  is,  as  above,  ^  [SS  {%  (b)  —  y  (a)}P>  where  28  is  arbitrarily  small; 
hence  the  variation  is  zero.  Denoting  F'  (x),  at  every  point  at  which  it  exists, 
by  $  (x),  we  may  suppose  <£>  (x)  to  have  the  value  zero  at  all  points  at  which 

F'  (x)  does  not  exist.    The  function  (/>  (x)   being  summable  in  (a,  b),  the 

rb 
theorem  of  §  450  is  applicable,  and  it  follows  that  I  {<£(a.')}2c?a;exists,  and  is  the 

J  a 
r=m  \P  (x\  —  F  (x  ._  )12 

limit  of  the  sum  £  - —  — ,  for  any  system  of  nets  fitted  on  to  (a,b). 

"Thus  we  have 


and  accordingly  the  theorem  has  been  established. 

It  is  sufficient  for  the  existence  of  the  Bellinger  integral  , 

-!  a         dg  (y) 

that  the  functions  F1  (x),  F2  (x)  should  be  the  indefinite  integrals  of  two 
functions  fa  (x),  <f)2(x)  whose  squares  are  summable  in  (a,  b)',  where  Fl(x), 
F2  (x)  are  the  functions  which  are  equivalent  to  /i  (y),  f2  (y),  and  x  =  g  (y}. 
In  accordance  with  the  second  theorem  of  §  449,  we  then  see  that 
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452.  The  theorems  of  §  450  can  be  generalized  so  as  to  relate  to  a  function 
(j>  (x),  such  that  {<£(#)}1+;i  is  summable  in  the  interval  (a,  b),  where  A,  is  a  positive 
number.  The  proofs  of  the  theorems  in  the  extended  form  are  essentially 
similar  to  those  of  the  original  theorems.  Thus  we  obtain  *  the  theorems : 

If  </>  (as)  be  a  function  that  is  non-negative  in  the  interval  (a,  b),  and  such 
that  {(f>  (#)}1+A  is  summable  in  (a,  b),  where  \  denotes  some  positive  number, 
then  the  sum 

r=m    Iff  )  !+A 

2    j-7  ]  I       $>(x)dx\ 

r=l    Or     (J  (Sr)  ) 

rb 
converges  to      {<$>  (x)}1+*dx,  as  the  sum  is  taken  successively  over  the  nets  of  any 

J  a 

system  of  nets  fitted  on  to  (a,  b).    In  case  X  is  a  positive  integer,  <j>  (x)  may  be 
any  summable  function,  not  necessarily  of  one  sign  in  (a,  b). 

Bellinger's  definition  may  be  extended  to  apply  to  the  integral 


defined  as  the  limit,  when  it  exists,  of  the  sum 


In  this  case: 


rp  r^f  (w)]1+A 

The  necessary  and  sufficient  condition  that  I    —  r  ,    ;    -,.    should  exist  is  that 

Ja   [dg(y)Y 


the  function  F(x)  (=f(y))  should  be  the  indefinite  integral  of  a  function  $(x) 
such  that  [<£(#)]  m  is  summable  in  (a,  b).    Moreover 

+ 

Other  generalizations  of  the  Bellinger  integral  have  been  made  by  Radon  f 
and  by  E.  H.  Moore  J. 

*  See  Hobson,  loc.  cit.,  p.  263.    See  also  F.  Eiesz,  Math.  Annalen,  vol.  LXIX,  p.  462. 
t  See  the  memoir  "Absolut  additive  Mengenfunktionen,"  Wiener  Sitzunysbcr.  vol. 
p.  1351. 

J  See  Hildebrandt,  Bull.  Amer.  Math.  Soc.  (2),  vol.  xxiv,  p.  198. 


CHAPTER  VIII 

NON-ABSOLUTELY  CONVERGENT  INTEGRALS 

453.  THE  definition  of  an  integral  by  Harnack,  given  in  §  443,  has  been 
shewn  (§  444)  to  be  equivalent  to  that  of  Lebesgue,  in  the  case  of  absolutely 
convergent  integrals,  that  is,  when  the  absolute  value  of  the  function  has  an 
integral  in  accordance  with  Harnack's  definition.  We  proceed  to  consider 
the  integral  of  a  function  f(x)  over  a  linear  interval,  in  accordance  with 
Harnack's  definition,  when  that  integral  is  non-absolutely  convergent,  that  is, 
when  \f(x}  \  is  not  integrable  over  (a,  b)  in  accordance  with  Harnack's  defi- 
nition. A  point  of  (a,  b)  which  is  such  that  f(x}  is  not  summable  in  any 
interval  which  contains  the  point  within  it,  or  at  an  end-point,  may  be  called 
a  point  of  non-summability,  or  a  Harnack  point,  for  the  function  /(«).  It  is 
clear  that  the  points  of  non-summability  of  f(x)  form  a  closed  set  H,  for 
if  P  be  a  limiting  point  of  H,  any  interval  that  contains  P  contains  points  of 
the  set,  and  consequently  f(x)  is  not  summable  in  the  interval.  The  points 
of  H  are  points  of  infinite  discontinuity  of  f(x\  but  H  is  not  necessarily 
identical  with  the  set  G,  employed  in  §  443,  of  all  the  points  of  infinite  dis- 
continuity of  f(x).  In  any  interval  interior  to  a  contiguous  interval  of  H  the 
function  f(x)  is  summable,  though  it  is  not  necessarily  bounded.  The  defi- 
nition of  §  443  will  be  modified,  by  employing  the  set  H  instead  of  G;  so 
that  the  L -integrals  employed  in  the  definition  are  not  necessarily  i-integrals 
of  a  bounded  function,  those  points  of  infinite  discontinuity  of  f(x}  which 
are  not  points  of  non-summability  being  interior  to  the  contiguous  intervals 
of  H.  Thus  the  definition  takes  the  following  form  : 

If  there  be  a  non-dense  closed  set  H,  of  content  zero,  of  points  of  non-summa- 
bility off(x)  in  (a,  b),  and  H  be  enclosed  within  intervals  of  a  finite  set  A,  such 
that  each  interval  of  A  contains  at  least  one  point  of  H,  then  if  the  L-integral 

I       f(x)dx  exists,  and  is  such  that,  corresponding  to  an  arbitrarily  chosen 

J  C(A) 

number  e,  a  number  £e)  which  converges  with  e  to  zero,  exists  such  that 
\       f(x)dx-\         f(x)dx 

•'  C  (A)  •  C  (A') 

for  all  suchpairs  of  sets  of  intervals  A,  A',  provided  m  (A)  <  £e,  m  (A')  <  ££,  then 
the  limit  of  I        f(x)dx,  as  m(A)~0,  defines  a  number  which  is  denoted  by 

J  C(A)' 

rb 
f(x)  dx,  and  is  called  the  Harnack- Lebesgue,  or  H  L-integral,  off(x),  in  (a,  b). 

J  a 

The  HL -integrals  of  functions  contain  as  a  sub-class  the  Harnack-Riemann, 
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or  #.R-integrals,  in  which  the  integral  of  the  function  over  any  interval 
interior  to  a  contiguous  interval  of  H  is  an  .^-integral. 

It  will  appear  that  the  condition  that  each  interval  of  A  must  contain  a 
point  of  H  is  an  indispensable  condition.  It  has  been  shewn  (§  444)  that,  in 
the  case  of  absolutely  convergent  integrals,  the  corresponding  condition  re- 
lating to  G  is  unnecessary.  When  this  condition  is  a  necessary  one,  the 
integral  has  been  termed*  by  E.  H.  Moore  a  narrow  integral.  In  the  contrary 
case  the  integral  is  called  a  broad  integral ;  and  it  will  appear  that  a  broad 
integral  is  necessarily  absolutely  convergent,  so  that  the  set  H  does  not  exist. 

It  will  now  be  shewn  that,  if  f(x)  have  an  .ffZ-integral  in  (a,  b),  it  has 
an  .HX-integral  in  any  interval  (a,  b')  contained  in  (a,  b). 

Taking  two  sets  of  intervals  A,  A',  as  in  the  definition  given  above,  we 
have,  for  every  value  of  e, 

rb  rb 

/A  (x)  dx  —      /A'  (x)  dx  <  e, 


provided  m  (A)<  £,  m(A')<  £,;  where  f^(x)  =f(x),  in  the  intervals  C(A),  and 
/A  (x)  =  0,  in  A;  with  a  corresponding  definition  for/A'  (x).  Assuming  that  this 
condition  is  satisfied  for  every  value  of  e,  it  will  be  shewn  that 

rb'  rb' 

/A  0)  dx  -      /A,  (as)  dx    <  e, 

J  a'  J  a' 

provided  A,  A'  satisfy  the  more  stringent  conditions  m  (A)  <  £  £, ,  m  (A7)  <  ^-£. 
Let  it  be  assumed  that,  if  possible,  A  and  A'  can  be  determined  so  as  to  satisfy 
these  last  conditions,  and  so  that 


rb'  rb' 

I    /A  (x)  dx  —  I    /A'  (x)  dx 

J  a'  J  a' 


^  6. 


It  will  then  be  shewn  that  finite  sets  of  intervals  A,  A'  can  be  determined 
which  satisfy  the  condition  that  each  interval  of  either  set  contains  at  least 
one  point  of  G,  and  such  that 


/A  (x)  dx  —  I   /A'  (x)dx 

%  J  a 

and  since  this  is  contrary  to  the  hypothesis  made  above,  the  impossibility 
of  the  assumption  will  have  been  demonstrated. 

To  define  A,  A'  we  take  each  interval  of  A,  in  (a',  &'),  as  an  interval  of  A, 
and  each  interval  of  A',  in  (of,  b'),  as  an  interval  of  A'.  Further,  we  take  for 
the  parts  of  A,  A'  in  (a,  a)  and  (b',  b),  the  set  of  those  intervals  which  are 
common  to  the  parts  of  A  and  A'  that  lie  in  (a,  a)  and  (b',  b).  In  case  a  is 
contained  in  intervals  (a,  ft),  (a,  ft')  of  A,  and  of  A',  respectively,  we  take 
(«',  ft),  (a,  ft')  as  intervals  of  A  and  A'  respectively,  where  «'  >  a.  A  similar 
specification  will  refer  to  b'. 
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It  is  now  clear  that/A  (x)  =/A  (x\  and/A-  (x)  -fa  (x),  when  *  is  in  (a',  6'); 
and  that,  in  (a,  a')  or  in  (6',  b\  we  have/A«)  -fa  (x).    It  follows  that 

Cb  rb  rV  rb' 

/A  (as)  dx  -     fa  (x)  dx  =  \   fa  (x)  dx  -      fa  (x)  dx, 

•J  a  J  a  J  a'  .'  «' 


and  hence  that 


rb  rb 

fa  (x)  dx  —  \    fa  (x}  dx 

J  a  J  a 


Moreover  it  is  clear,  from  the  mode  of  construction  of  A,  A',  that 


The  impossibility  in  question  has  therefore  been  demonstrated. 

Since,  for  every  pair  of  numbers  a,  b',  such  that  a  ^  a  <  b'^  b,  corresponding 
to  an  arbitrarily  chosen  e,  the  number  ^£  can  be  so  chosen  that 

fb'  r  b' 

J  a'  .'  a' 

for  every  pair  of  sets  A,  A'  that  enclose  the  set  H  narrowly,  and  such  that 

rv 
m  (A)  <  !£,  m  (A')  <  |£ ,  it  follows  that  I   f(x)  dx  exists. 

J  a1' 

Moreover,  since  £  is  independent  of  a'  and  b',  we  have  established  the 
following  theorem  : 

rb  rb' 

If  I    f(x)  dx  exists  as  an  HL-integral,  then  I     f(x)dx  also  exists,  where 

J  a  J  a1' 

a  ^  a  <b'  =  b ;  and  the  convergence  of  this  integral  is  uniform  with  respect  to 
a  and  b'. 

The  last  part  of  the  theorem  expresses  the  fact  that 

rb'  rb' 

f(x)  dx  —  I    /A  (x)  dx  <  e, 

J  a'  J  a' 

provided  m  (A)  <  £e,  for  every  value  of  a'  and  &',  the  number  £e  depending  on  e. 

454.    It  will  be  shewn  that : 

For  the  HL-integral,  the  theorem 

rb  re  rb 

f(x)dx=\   f(x}dx+\    f(x)dx 

J  a  ^  a  J  c 

is  valid. 

This  follows  from  the  corresponding  theorem  for  the  Z-integral  of  f±  (x). 
For  it  appears  that,  employing  the  last  theorem,  the  expressions  on  the  two 
sides  of  the  equation  differ  from  one  another  by  less  than  2e ;  and  since  e  is 
arbitrary,  their  equality  is  established. 

Since  the  existence  of  the  JSX-integral  of  f(x)  in  any  sub-interval  (a',  b'), 
of  (a,  b),  has  been  shewn  to  be  a  necessary  consequence  of  the  existence  of 
the  //Z-integral  over  (a,  b),  it  is  clear  that  the  integral  of  f(x)  taken  over 
any  finite  set  of  non-overlapping  intervals,  contained  in  (a,  b),  also  exists ; 
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being  the  sura  of  the  integrals  taken  over  the  separate  intervals.  However, 
if  a  non-finite  set  of  non-overlapping  intervals  be  taken  in  (a,  b),  it  is  not  in 
general  true  that  the  sum  of  the  integrals  taken  over  these  intervals  con- 
verges to  a  definite  number,  unless  the  integral  of  f(x)  is  absolutely  convergent, 
which  case  has  been  treated  in  connection  with  the  Z-integrals.  It  will  in 
fact  be  shewn,  by  means  of  an  example,  that  the  property  in  question,  that 
f(x)  is  integrable  over  a  non-finite  set  of  intervals  in  (a,  b),  does  not  apper- 
tain to  non-absolutely  convergent  integrals,  and  must  be  regarded  as  peculiar 
to  absolutely  convergent  integrals.  This  does  not  however  seem  to  be  a 
sufficient  reason  for  refraining  from  applying  the  term  "  integral "  to  non- 
absolutely  convergent  integrals,  as  defined  above. 

455.     The    following   property   of    an    indefinite    #Z-integral    will    be 
established : 


The  HL-integral  I    f(x)dx  is  a  continuous  function  of  the  upper  limit  x. 

J  a 

Denoting  the  indefinite  integral  by  F(x),  we  have,  in  accordance  with  the 

addition  theorem  proved  in  §  454, 

rx+h 
F(x  +  h)-E(x}=\       f(x)dx. 

J  x 

Employing  the  function  /A  (x),  which  vanishes  at  all  points  of  the  intervals 
of  the  set  A  enclosing  the  set  of  points  H,  we  have 

I  rx+h  rx+h 

f(x)  dx-          /A  (x)  dx   <  e, 

I  -'   X  J    X 

provided  m  (A)  <  £,  for  every  point  x,  and  for  all  values  of  h,  such  that  x  +  h 

rx 
is  in  (a,  b).    The  integral  I    f^(x)dx  being  a  continuous  function  of  x,  the 

J  a 
rx+h 
numerical  value  of         /A  (as)  dx  in  the  interval  (0,  h^,  of  h,  is  less  than  e,  if 


'x+h 


A!  be  properly  chosen.    Hence  the  numerical  value  of          f(x)  dx  is  less 

J    X 

than  2e,  if  h  is  in  the  interval  (0,  AJ.    Since  e  is  arbitrary,  this  shews  that  F(x) 

is  a  continuous  function  of  x,  on  the  right,  in  the  interval  (a,  b).     Since 

rx 
F  (x)  —  F  (x  —  h)=  I       f(x)  dx,  it  can  be  shewn,  in  a  similar  manner,  that 

J  x—h 

F(x)  is  continuous  at  x,  on  the  left. 

rx 
In  an  interval  of  the  set  C  (A),  I    f(x)dx  differs  by  a  constant  from  an 

J  a 

[x 

Z-integral,   and   therefore        f(x)dx   possesses,  almost  everywhere   in  the 

-    a 

interval,  a  differential  coefficient  equal  to  f(x).    Since  the  measure  of  C(A) 
converges  to  b  —  a,  when  A  belongs  to  a  sequence  |An],  such  that  m(An) 
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converges  to  zero,  we  have  the  following  theorem,  which  is  the  extension  to 
the  ZTZ-integral,  of  the  property  of  the  //-integral  proved  in  §  405. 

~x 

The  indefinite  HL-integral      f(x)dx  has,  almost  everywhere  in  (a,  6),  a 

.   a 

differential  coefficient  of  which  the  value  isf(x). 

THE  ZTL-INTEGRAL  OVER  A  FINITE  SET  OF  INTERVALS. 

456.  An  extension  of  the  theorem  that  the  integral  of  f(x)  over  any 
interval  contained  in  (a,  b)  necessarily  exists,  if  the  .HX-integral  over  (a,  6) 
exists,  will  now  be  made. 

Let  D  denote  a  finite  set  of  intervals,  each  one  of  which  is  contained  in 
an  interval  complementary  to  the  set  H,  and  no  two  of  which  are  contained 
in  any  one  such  interval. 

Using  the  notation  of  §  453,  as  regards  A(1),  A(2),  such  that  m(A(1))  <  £S€> 
ra(A(2))  <  £8e,  it  will  be  shewn  that 

I      /A<D  (x)  dx  —  \      /A(2>  (x)  dx    <  €. 

J  (D)  J  (Dl 

For,  assuming  that  the  expression  on  the  left-hand  side  is  ^  e,  we  can 
define  A(3),  A(4>  as  follows  : 

The  parts  of  A(3),  A(4)  that  are  interior  to  C  (D),  which  is  a  finite  set  of 
intervals,  we  take  to  consist,  in  each  case,  of  those  intervals  that  are  common 
to  parts  of  A(1)  and  A(2)  interior  to  C  (D}.  If  intervals  (a,  ft),  (a,  ft'),  of  A(1)  and 
A(2),  contain  an  end-point  av,  of  an  interval  of  D,  we  take  (a',  ft),  (a',  ft'),  when 
a'  >  a,  or  (a,  ft),  (a.  ft),  if  a' <  a,  as  intervals  of  A(3)  and  A(4)  respectively.  No 
interval  of  A(l)  or  A(2)  is  interior  to  an  interval  of  D. 

We  have  now  m(A(3))  <  £8e,  m(A(4))  <  £8, ;  and 

/A(3)  (x)  =/A<D  (X),  /A<4)  (x)  =_/k<2)  (X), 

where  x  is  in  D\  also,  in  C(D),  we  have  f^?)  (x)  =  f&w  (x} ;  it  then  follows  that 

rb  rb 

/A(s)  (x)  dx  -  I    /A(4)  (x)  dx  j  ^  e, 

J  a  J  a 

which  is  inconsistent  with  the  conditions  that  m(A(3))<  ^S6,  m(A(4))<  |-Se. 
Hence  I     /A<D  (x)  dx  —       f^m  (x)  dx  <  e. 

\J(D)  J(D) 

\  r  r 

It  follows  that  |  I       f(x)dx—\      f±(x)dx    <  e,  provided  ra(  A)  <  ^Se.    It 

I  J  (D)  J  (D) 

has  thus  been  established  that : 

If  D  denote  a  finite  set  of  intervals,  each  of  which  is  in  one  of  the  comple- 
mentary intervals  of  H,  and  no  two  of  which  are  in  one  and  the  same  such 


455-457]        The  HL-integral  over  a  finite  set  of  intervals        621 

complementary  interval,  then,  if  e  is  prescribed,  there  exists  a  number  $8f,  such 
that 

\      f(x)  dx  —  I     /4  (x)  da;   <  e 

J  (D)  J  (D) 

for  every  such  set  D,  and  every  set  A  enclosing  G  narrowly,  such  that 


This  is  a  particular  case  of  a  somewhat  more  general  theorem  given  by 
E.  H.  Moore,  which  allows  more  latitude  as  regards  the  set  of  intervals  D. 

Since  the  function  f&,  (x)  is  summable,  we  see  that,  in  accordance  with 
the  property  of  summable  functions  established  in  §  392, 


J( 


(x)  dx   <  e, 


provided  m  (D\  is  less  than  some  number  Se'  which  converges  to  zero  with  e. 
Also,  provided  D  satisfies  the  condition  of  the  above  theorem,  we  have 


It  now  follows  that 


j        f(x)  dx  -        /A  (x)  dx 

I  •  (D)  -'  (D) 

J  (D 


<  6. 


}  dx 


<2e. 


We  have  now  the  theorem  that : 


If  D  be  a  set  of  intervals,  finite  in  number,  each  of  which  is  in  an  interval 
complementary  to  the  set  H,  of  points  of  non-summability,  and  no  two  of  which 
are  in  the  same  such  interval,  then 


f(x)  dx 
J(D) 


<  6 


for  all  such  sets  D,  provided  m  (D)  <  St" ;  where  e  is  arbitrary,  and  £e"  depends 
upon  e,  and  converges  to  zero  as  e  does  so. 


THE  CONDITIONS  FOR  THE  EXISTENCE  OF  AN  ^TZ-INTEGRAL. 

457.  The  following  theorem,  due  to  E.  H.  Moore*,  contains  the  necessary 
and  sufficient  conditions  for  the  existence  of  the  HL-integral  of  a  function 
/(#),  defined  in  the  linear  interval  (a,  6),  in  which  H  is  the  set  of  points  of 
non-summability  of  f{x).  The  content  of  H  is  assumed  to  be  zero  : 

The  complementary  intervals  of  H  being  denoted  by  (av,  bv),  the  necessary 

rb 

and  sufficient  conditions  for  the  existence  of  I    f(%)  dx  are, 

J  a 

/bv 
f(x)dx  shall  exist,  each  such  integral  being 
<»„ 

*  Trans.  Amer.  Math.  Soc.,  vol.  n,  p.  324. 
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defined  as  the  limit  of  f(x)  dx,  when,  e,  e  converge  independently  to  the 

limit  zero,  and, 

(2)    that  a)l  + a)2  + ...  +  cov  shall  converge  to  a  definite  number,  as  v  is 

rx 
indefinitely  increased ;  where  &>„  denotes  the  fluctuation  of        f  (x)  dx  in  the 

J  av 

interval  (av)  &„). 

Moreover,  when  the  conditions  (1)  and  (2)  are  satisfied,  the  sum  2  I     f(x)dx 

r=lJ  av 
f 

is  convergent,  and  its  limit,  as  v+*  oo  ,  is  I    f(x)  dx. 

J  a 

rb 
To  shew  that  the  conditions  are  necessary,  we  assume  that  I    f(x)dx 

J  a 

A 

exists;  it  then  follows  from  the  theorem  of  §  453,  that  I    f(x)dx  exists.    If 

£„',  £„"  be  the  points  of  (a,,  bv)  at  which  I    f(x)  dx  attains  its  maximum  and 

f' 

minimum  values  we  have  o)v=  I      f(x)dx. 

J  fv" 

00 

The  number  p  may  be  chosen  so  large  that     2    (bv  —  av}  <  S".    For  some 

V  =  |U.  +  1 

values  of  v  we  may  have  ^/  >  %v",  and  for  others  £„'  <  £v" ;  one,  or  both,  of 
these  two  sets  of  values  of  v  will  be  an  infinite  set.    If  £&>„  is  not  convergent, 

a  number  /*'(>  u  +  1)  can  be  found  such  that     %    wv  >  2e.    One  at  least  of 


the  two  sets  of  those  values  of  v  between  /A  -f-  1  and  //  for  which  ^/  >  £„", 
and  for  which  f  /  <  f  „",  must  be  such  that  the  sum  of  the  corresponding 
values  of  wv  is  >  e,  and  the  sum  of  the  corresponding  intervals  j  £„"  —  £„'  |  is 
<  £/'•  We  have  thus  a  finite  set  of  intervals  D,  whose  sum  is  <  Se",  all  con- 
tained in  complementary  intervals  of  0,  no  two  of  which  are  in  the  same  such 

interval,  and   such   that  f(x}  dx    >  e,  which    is    contrary   to    the    last 

\->(D) 

theorem  of  §  456. 

It  follows  that  2&>v  must  converge. 

Further,  let  («„,  $„)  be  interior  to  (av,  bv}  and  such  that 


We  may  choose  m  so  that  S  (&„  —  av)  >  b  —  a  —  77,  where  77  is  positive,  and 

f=i 
arbitrarily  chosen.    Then 

v=m  /  1\  1 

2  (/8V  —  £*„)  >  (6  —  a  —  77)    1  —  T    >  6  —  a  —  ij  —  T  (b  —  a). 


T 

» 

If  we  take  A  to  be  the  set  of  intervals  complementary  to  («„,  /?„),  for 
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v  =  1,  2,  3,  ...  m,  we  have  m  (A)  <  t]  +  r  (b  -  a)  <  Bt  if  77  and  A;  are  properly 
chosen.    It  then  follows  that 

\b  f(x}dx-^\  "f(x)dx   <e. 

•/  a  v=l  J  a,, 

„=»!  r&v  v  =  m  r'V 

Also  A;  may  be  so  chosen  that  2   |     f(x}dx  differs  from  21    f(x)dx\)y 


less  than  e  ;  we  have  then 


|f*/(«)(to-Tr/(«)<to 

I  ./  a  v=l  •/  a.. 


<  2e. 


oo      /<x  fb 

It  now  follows  that  21    /(#)  dx  converges  to  the  value  of     f(x)dx. 

f=l-'  a  J  a 


S  &>„  <  ^e ;  then  if  («„,  /S,)  denote  any  interval  contained  in  (av,  bv),  we  have 
K'+I 


To  prove  that  the  conditions  (1)  and  (2)  are  sufficient;  let  r/  be  such  that 

;note  any  inten 

2          f(x)  dx 

v=v'+l'  av 

Let  the  finite  set  A  enclose  H  narrowly ;  the  complementary  set  C  (A)  is 
a  finite  set  of  intervals  interior  to  a  finite  set  of  the  intervals  C  (H)  comple- 
mentary to  H.  Assume  that  these  intervals  C  (H)  are  arranged  in  descending 
order  of  magnitude.  Those  of  them  that  are  of  length  >  m  (A)  must  each 
contain  an  interval  of  C  (A),  and  others  may  also  contain  intervals  of  G  (A). 
Let  the  first  s  of  the  intervals  (av,  &„)  be  each  of  length  >m(A).  The 

00  "  "v 

convergence  of  2        /(#)  dx  follows  from  that  of  2&>,,.    We  have  then 


v=s 

^  2 


/A  (x)  dx  -   2   I    f(x)  dx 

fOv  r/3,, 

I     f(x)  dx  —  I     f(x)  dx 

J  a,,  J  a,. 


+   2 


+  2 

n'  >  s 


f(x}dx 
where  n'  (>  s)  has  a  finite  set  of  values,  and  (av,  /&„)  are  the  intervals  of  0(A). 

00 

We  may  suppose  s  to  be  so  chosen  that  2  «„  <  e,  and  thus  that 


and  also  so  that    2 


n'  >  s 


f(x)  dx 


f(x}  dx 


<  e.    The  number  s  having  been  so  chosen, 


we  may  suppose  A  so  chosen  that  m  (A)  is  less  than  each  of  the  first  s  inter- 
vals of  C  (H).    Also  m(A)  may  be  so  chosen,  still  smaller  if  necessary,  so  that 


V=S 

2 


f(x)dx-\    f(x)dx 

av  J  a., 


<  €. 
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I   rb  oo     rbv 

We  now  have  |  I    /A  (x)dx  —  S   I    f(x)  dx 

i  J  a  v=lJ  «„ 


<  3e,  provided  m  (A)  is  less 


than  some  number  depending  on  e ;  and  since  this  holds  for  every  value  of  e, 

/b  oo      rbv 

f±(x)dx  must  converge  to  the  value  of  2   I     f(x}  dx,  as 
a  v=\J  av 

rb 
m(A)~0;  and  thus  the  existence  of  I    f(x)dx  has  been  established. 

J  a 

rb 

It  has  been  here  established  that,  if  I    f(x)  dx  exists  as  an  .STL-integral, 

J  a 
rl>v 

it  is  necessary  that  2       f(x)  dx  should  converge,  where  («„,  &„)  denotes  a 

J  av 

contiguous  interval  of  the  set  H  of  points  of  non-sum mability  off(x),  it  being 

r&"  f/3 

assumed  that  m  (H)  =  0,  and  I     f(x)  dx  being  determined  as  lim  I    f(x)  dx, 

J  av  a.~av  •>  »• 

F-&V 

where  (a,  yS)  is  interior  to  («„,  &„),  but  it  is  also  necessary  to  assume  that,  «„  de- 


noting the  fluctuation  of       f(®)  dx  in  (av,  bv),  the  series  of  positive  terms  2<ov 

Jav 
should  be  convergent. 

rb 
458.    A   more   general  definition  of   I    f(x)  dx  has   been  suggested  by 

J  a 

Lebesgue*,  as  a  development  of  a  definition  given  by  Jordan.    This  is  obtained 
by  substituting  for  the  condition  that  2&>n  should  converge,  the  less  stringent 


condition  that 


f(x)  dx 


should  be  convergent.   It  is  clear  that  an  HL- 


integral  is  also  an  integral  in  accordance  with  this  definition  of  Lebesgue,  but 
the  converse  is  not  necessarily  the  case.  It  can  be  seen  that  the  substitution 
of  Lebesgue's  definition  for  that  of  Harnack  involves  the  introduction  of  a 
limitation  in  the  selection  of  the  sets  of  intervals  A  that  include  the  points 
ofH. 

This  limitation  must  be"  such  that,  in  the  proof  of  sufficiency  in  E.  H.  Moore's 


/•Pi- 
theorem,  the  convergence  of  2       I     /(#)  dx 

v=I    i    J av 


can  be  inferred  from  that  of 


f(x)  dx 


In  case  the  sum  of  the  integrals  over  the  intervals  (a,,,  &„)  were  not  absolutely 
convergent,  the  sum  of  the  integrals  themselves,  if  convergent,  would  have  a 
sum  which  would  depend  upon  the  order  in  which  the  intervals  are  taken,  and 
thus  would  not  afford  a  suitable  definition  of  the  integral  over  (a,  b). 

*  See  his  "Eemarques  sur  les  theories  de  la  mesure  et  de  1'integration,"  Annales  de  Vecole 
normale,  (3),  vol.  xxxv,  p.  204. 
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459.  A  method  will  now  be  given  of  constructing  a  function  f(x)  which 
is  continuous  at  every  point  of  the  interval  (a,  6),  except  at  the  point  b,  at 
which  the  function  has  an  infinite  discontinuity  of  such  a  character  that 

b 

f(x)  dx  converges  non-absolutely. 


Let  a  sequence  of  intervals  (al,  b^,  (a2,  b2),  ...  (an,  bn),  ...  be  defined  in  the 
interval  (a,  b),  such  that  no  two  of  them  overlap,  and  that  b  is  the  limiting 
point  of  each  of  the  sequences  (a1}  a2,  ...  an,  ...),  (blt  b2,  ...  bn,  ...).  Let 
Ui  +  u2  +  ...  +  un+  ...  denote  a  non-absolutely  convergent  arithmetic  series;  that 

is,  the  series  is  to  be  convergent,  but  not  the  series  |  u^   +  \u2  +  . . .  +   un  \  + 

In  (an,  bn),  let  f(x)  be  defined  so  as  to  be  continuous,  and  everywhere  of  the 
same  sign,  and  let  f(x)  be  zero  at  an  and  bn.   Further,  let  f(x)  be  so  chosen, 

in  (an,bn\  that  I    f(x)dx=un.    At  all  points  of  (a,  b),  external  to  all  the 

J  an 

intervals  (an,  bn),  \etf(x)  =  0. 

The  function  f  (x),  so  defined  in  (a,  b),  is  continuous,  except  at  the  point  b. 

In  (an,  bn),  the  function  f(x)  has  a  maximum  value  greater  than 
I  un  \/(bn  —  ttn),  and  therefore  f(x)  has  indefinitely  great  positive  and  negative 
values  in  every  neighbourhood  of  the  point  b. 

For,  if  there  existed  a  positive  number  k,  such  that  j  un  \/(bn  —  an)  <  k,  for  all 

r=n  r=n 

values  of  n,  we  should  have    2     un  <  k   2   (br  —  ar),  and  thus  the  series 

r=l  r=l 

oo 

2   \un\  would  be  convergent,  which  is  not  the  case. 
We  have  now 

r  x  r=n 

f(x)dx=   2   un  +  0un+1, 

J  a  r=l 

if  x  lies  in  the  interval  (bn,  bn+1),  where  0  ^  6  ^  1. 

rb  rx 

Also  the  integral  I    f(x)  dx  is  defined  as  lim  I    f(x)  dx ;  and  its  value  is 

J a  a~6  J a 

r=n 

therefore  lim    2  un,  which,  by  hypothesis,  has  a  definite  value. 

rb 
It   is   further    clear    that    I    \f(x)   dx   is   not   finite,   since    the   series 

J   a 

\  ui  \  +  |  ui  \  +  •  •  •  +  |  un  j  +  . . .  is  not  convergent. 

This  result  may  be  employed  to  illustrate  the  fact  that  the  non-absolutely 
convergent  integral  is  not  necessarily  the  limit  of  the  sum  of  the  integrals 
taken  through  a  set  of  intervals  which,  in  the  limit,  converges  to  the  whole 
interval  of  integration;  and  thus  that  such  an  integral  is  not  a  broad  integral. 

Let   the  integral  of  f(x)  be  taken  over  the   intervals  (a,bnt),  (aPl,bPl), 
(ap.2 ,  6^2)  . . .  (aPr,  bPr),  where  pl ,  p2,  ...pr  are  increasing  integers,  all  greater  than 
H.  40 
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m,  and  such  that  ^^Pl,up^,  ...  uPr  are  all  of  the  same  sign.   It  is  clear  that  m 

I'bm  rb 

may  be  so  chosen  that  I    f(x)  dx  is  arbitrarily  near  in  value  to  I    f(x)  dx ; 

•'a  J  a 

then,  for  such  a  value  of  m,  the  integers  ply  p.2,  ...  pr  may  be  so  chosen  that 
uPi  +  UP2  +  ...  +  uPr  is  as  large  as  we  please,  since  the  series  Swn  does  not  con- 
verge absolutely.  As  m  is  increased  indefinitely,  the  set  of  intervals  (a,  bm), 
(aPl,  bPl),  ...  (aPr,  bPr)  converges  to  the  whole  interval  (a,  b),  the  complementary 
part  of  (a,  b)  diminishing  indefinitely,  and  yet  the  sum  of  the  integrals  off(a;) 
taken  over  the  intervals  of  the  set  does  not  converge. 

460.  The  construction  here  given  of  an  /TL-integral,  with  a  single  point 
of  infinite  discontinuity,  may  be  employed  to  illustrate  the  fact  that,  in  the 
definition  of  such  an  integral  given  in  §  453,  the  condition  is  indispensable  that 
the  set  A  of  intervals  must  be  such  that  there  is  at  least  one  point  of  infinite 
discontinuity  in  each  interval  of  A.  This  fact  differentiates  the  non-absolutely 
convergent  ZTX-integral  from  an  absolutely  convergent  integral,  in  which 

f  f  ^ 

\       f  (x)  c&c  converges  to       f(x)dx,  for  every  set  of  intervals  A,  when  m(A)«vO. 

JC  (A)  •« 

It  is  not  in  fact  true  that,  in  defining  the  HL -integral,  the  set  of  points 
alya2,  ...  an,  ...  l>i,l>2,  ...  bn,  ...  b,  which  is  of  content  zero,  may  be  excluded  by 
enclosing  these  points  in  a  set  of  intervals  of  arbitrarily  small  sum. 

For  we  may  include  all  -the  2m  points  alt  a2,  ...  a^  6n  b2,  ...  bm  which  occur 
in  (a,  bm)  in  a  finite  set  of  intervals,  so  that  when  these  are  excluded  from 

the  interval  (a,  bm),  the  integral  I      f(x)  dx  is  altered  by  an  arbitrarily  small 

J  a 
amount.    Again,  we  may  shorten  each  of  the  intervals  (aPl,  bPl),  (aPa,  bPz), ... 

(aPr,  bPr)  at  each  end,  so  that  the  sum  of  the  integrals  off(x)  taken  over  these 
intervals  is  diminished  by  an  arbitrarily  small  amount.  All  the  points  al}a2, ... 
&!,  &2,  ...  are  now  included  in  intervals  of  a  finite  set,  such  that  the  integral 
of  f(x)  over  the  complementary  intervals  is  arbitrarily  great.  These  com- 
plementary intervals  consist  of  those  intervals  which  have  been  obtained  by 
shortening  the  intervals  (aPl,  bPl),  (ap2,bPa),  ...  (apr,bPr),  and  of  the  parts  of 
(a,  bm)  which  remain  when  the  points  alyb1}  a2,  b2,  ...  am,bm  have  been  included 
in  a  suitable  set  of  intervals. 

rb 

Let  </>  (a?)  be  a  function  for  which  I    $  (x)  dx  exists  as  an  /ZX-integral, 

J  a 

with  the  points  aly  a2, ...  an,  ...  b1}b2,  ...  bn,  ...  b,  as  the  set  of  points  of  infinite 

discontinuity,  in  the  neighbourhoods  of  which  <£  (x)  is  not  summable.   Also  the 

rb 
integral        f(x)  dx  exists,  as  constructed  above,  with  its  single  point  of  infinite 

J  a 

[b 

discontinuity  at  b.    It  appears  however  that  I     {/(#)  +  <£  (x)}dx  does  not  exist 

J  a 

as  an  JHX-integral.  The  set  H,  of  points  of  infinite  discontinuity  of  the  function 
f(x)  +  <f>  (x),  consists  of  the  points  ax,  b1}  a2,  &2»  •••  «n»  bn,  •  •  • ,  and  of  the  point  b. 
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We  may  now  define  a  set  A,  of  intervals  enclosing  the  points  of  H,  to  consist 
of  the  finite  set  defined  above  for  the  case  of  the  integral  of /(#).  For  a  sequence 

of  such   sets   A,  I       f(x)dx   increases   indefinitely,   as   ra(A)~0,  whereas 

J  C'(A) 

|       (f>  (x)  dx  has  a  finite  limit;  thus  I       {/(#)  +  </>  (#)}  dx  increases  indefinitely 

rb 

as  m  (A)  ~  0,  and  therefore  I    {•/(«)  +  <£  (x)}  ^  ^oes  no^  exist- 

./a 

461.    The  following  property*  of  an  .HX-integral  will  now  be  established: 

rb 

If       <f)  (x)  dx  is   an  HL-integral,   and  f(x)  be  any  bounded   monotone 

J  a 

fb 

function,  I   f(x)  <f>  (x)  dx  exists  as  an  HL-integral.    More  generally  f  (x)  may 

J  a 

be  any  function  of  bounded  variation  in  (a,  b). 

Let  A,  A'  be  two  sets  of  intervals  enclosing  narrowly  the  points  H  of  non- 
summability  of  <f>  (x),  so  that 

rx  rx 

<f>±  (x)  dx  —  I   <f>±>  (x)  dx  <  e, 

J  a  J  a 

where  m  (A)  <  £e,  m  (A')  <  £e,  for  all  points  x  in  (a,  b).    Let 


we  may  then  apply  the  second  mean  value  theorem  to  the  functions  F(x), 
f(x).    Thus 


where  f  is  some  point  in  (a,  b).    We  have  therefore 

rb  rb 

I   f  (*')  $A  (x)  dx  —  I  f(x)  (f)±'  (x)  dx 

J  a  J  a 

Denoting  the    expression    on  the  right-hand  side  by  e',  we  see   that,   e 

being  arbitrary,  and  £e  depending  only  on  e,  the  condition  is  satisfied  for  the 

rb 
existence  of     f(x)<f>  (x)  dx.    Whenf(x)  is  any  function  of  bounded  variation, 


it  is  the  difference  of  two  monotone  functions /i  (x),f2(x);  and  thus  the  two 

rb  rb 

HL -integrals  I   fl(x)(^>(x)dx,      /2  (x)  (f)  (x)  dx  exist.     Since  the  set  of  points 

J  a  J  a 

of  non-summability  is  the  same  for  the  two  integrands,  it  follows  that 

b  rb 

[fi  (x)  —f2  (x)}  (fr  (x)  dx,  or   I   f  (x)  (f)  (x)  dx , 


exists  as  an  JSTL-integral. 

*  See  Hobson,  Proc.  Loud.  Math.  Soc.  (2),  vol.  vn,  p.  22. 

40—2 
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THE   SECOND   MEAN   VALUE   THEOREM   FOR   AN   jHX-INTEGRAL. 


462.    The  second  mean  value  theorem 

•b 


rb  rx  rb 

f(x)  <j)(x)dx  =  A      <f>  (as)  dx  +  B  \   $  (x)  dx, 

a  J  a  J  x 

as  given  in  §  422,  holds  good,  when  <f>  (x)  has  an  HL-integral  in  (a,  b). 

ft  /"XA  f  * 

For  I  /(a?) <f>± (x) dx=A        <$>±(x)dx  +  B\     <f>± (x) dx, 

J  a  J  a  J  XA 

where  %&  is  some  point  in  (a,  b),  dependent  on  A. 

Now          /     <£A  (x)dx  —       <£  (x)  dx,          <£A  (x) dx—  \     <f> (a?)  dx, 

Ja  Ja  JXA  •  XA 

are  both  numerically  less  than  an  arbitrarily  chosen  positive  number  e,  pro- 
vided m  (A)  is  sufficiently  small;  this  follows  from  the  uniform  convergence  of 

rx 

<j>  (x)  dx\ 

'  a 

rb  rb 

Also  I  f(x)  ^>A  (x)  dx  differs  from  /  f($)<f>  (x)  dx  by  less  than  e,  if  m  (A) 

J  a  J  a 

be  sufficiently  small,    Hence  we  have 

Cb  /"XA  fb 

I  f  (x)  cf>  (x)  dx  =  A  I     <f>(x)dx  +  Bl     <f>(x}dx  +  i>), 

J  a  J  a  J  XA 

where   77    is  arbitrarily  small.    From  the  continuity  of 

/•XA  f6 

$  (x)  dx,          (j)  (x)  dx, 

J  a  J  x 

we  see  by  reasoning  similar  to  that  in  §  423,  that 

rb  rx  f& 

I  f(x)  (f)  (x)  dx  =  A  I    (f)  (x)  dx  +  B\    </>  (x)  dx, 

J  a  J  a  J  x 

Bonnet's  form  of  the  mean  value  theorem  may  be  deduced  as  in  §  423. 

INTEGRATION   BY   PARTS   FOR   THE   HARNACK-LEBESGUE   INTEGRAL. 

463.   The  formula  for  integration  by  parts,  given  in  §  420,  may  be  extended* 

P 
to  apply  to  the  case  in  which  one  of  the  integrals,  I    U  (x)  dx,  is  an  jETZ -integral. 


Let  un  (x)  —  I    Un  (x)  dx,  where  Un  (x)  —  U(x)  at  every  point  not  belonging 

J  a 

to  a  finite  set  of  intervals  A,  enclosing  the  points  H,  of  non-summability  of 
U  (x) ;  and  Un  (x)  =  0,  in  A. 

*  W.  H.  Young,  Proc.  Lond.  Math.  Soc.,  ser.  2,  vol.  ix,  p.  432. 
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Since  un  (x)  is  an  Z-integral,  we  have 

dun  j        fb        dv  , 


As  rj  ~  oo  ,  so  that  m  (A)  ~  0,  we  have  [wnv]a  ~  [uv]a. 

dv 
dx 


Also 


cb    dv  7      r6    dv  7         rb 

I    u  -j-  dx  —  I    un  -r-  dx    <  e  I 

J  a       CIX  J  a          dx  J  a 


dx, 


because,  for  sufficiently  large  values  of  n,  we  have    u  (x)  —  un  (x)  \  <  e,  whatever 
value  a?  may  have,  in  accordance  with  the  theorem  of  §453.    It  follows  that 

/b       fa  rb       fa 

un  -r  dx  =  \    u  ~r-  dx, 
l      dx          Ja      dx 

rb      ^u 
and  hence,  utilizing  the  above  equation,  we  see  that  Km  I    v  -^  dx  exists 

n~oo  J  a         && 

rb   dv 

and  is  equal  to  [uv]a  —I    u  -^  dx. 

J  a        aX 

Thus  the  equation 
f  V(x)  \  !X  U(x}dx\  dx=T  U(x)  dx  \b  V(x)dx  -  fV(«)  {  f  V(x}dx\  dx 

J  a  (J  a  j  J  a  J  a  J  a  [J  ) 

holds  good,  when  one  of  the  two  indefinite  integrals 

\XU(x)dx,     \XV(x)dx, 

J  a  J  a 

exists  only  as  an  .ffi-integral,  the  other  being  an  i-integral. 


THE    DENJOY   INTEGRAL. 

464.  In  the  definition  of  the  .HX-integral,  even  if  it  be  modified  in  the 
manner  proposed  by  Lebesgue  (§  458),  it  is  assumed  that  the  set  H,  of  points 
of  non-summability  of  the  function,  has  content  zero.  A  definition  has  been 
introduced*  by  Denjoy,  which  is  applicable  when  the  set  H  is  not  so  restricted, 
but  is  capable  of  being  any  non-dense  closed  set.  The  process  of  passing 
from  a  given  function  f(x),  defined  in  the  linear  interval  (a,  b),  to  a  function 
F  (x),  which  has  a  relation  tof(x)  similar  to  that  of  the  indefinite  Z-integral 
of  a  summable  function  to  the  function  itself,  has  been  termed  by  Denjoy 
totalization  off(x);  it  will  however  be  here  spoken  of  as  integration  (D\  and 
the  function  F  (x)  will  be  spoken  of  as  the  indefinite  D-integral  of  f(x), 

whenever  it  exists.    Thus,  in  any  interval  (a,  ft),  of  (a,  6),  the  D-integral  of 

r/3 
f(x)  will  be  denoted  by  I  f(x)dx,  or  by  F(@)-F(a),  or  also  by  F(«,  /3). 

J  a 

*  Comptes  Rendus,  Paris,  vol.  CLIV,  p.  859  and  p.  1075.  See  also  Lusin,  i6td.,vol.CLv,  p.474.  See 
also  a  thesis  by  Nalli,  Esposizione  e  confronto  critico  delle  diverse  definizioni  di  una  funzione 
limitata  o  no,  Palermo,  1914. 
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The  definition  of  the  Denjoy  Integral,  or  .D-integral,  in  the  most  general 
case,  can  only  be  given  by  indicating  the  steps  of  a  gradual  process  by  which, 
commencing  with  the  employment  of  .//-integrals,  a  function  is  obtained  which 
satisfies  certain  postulations  which  are  of  the  nature  of  definitions.  These 
may  be  stated  as  follows: 

Having  given  a  measurable  function /(#),  defined  for  the  interval  (a,  6); 

(1)  In  any  interval  (a,  ft),  contained  in  (a,  b),  in  which  f  (x)  is  integrable 

re 
(L),  F(a,  ft)  is  taken  to  be  the  L-integral     f(%)  dx. 

J  a 

(2)  For  a  finite  set  of  intervals  (a1}  a2),  (a2,  «3),  ...  (aw_l3  an),  each  one  of 
which  abuts  on  the  next,  and  for  which  V  (a1}  a2),  F(a2,  a3),  ...  F  (%_!,  an)  have 

r  =  n-\ 

been  defined,  F(a1}  an)  is  defined  as    S     V(ar,  ur+l). 

r  =  l 

(3)  If  (a,  ft)  be  any  interval  in  (a,  b),  and  F(a',  ft)  have  been  defined  for 
every  interval  (a',  ft')  interior  to  (a,  ft),  then  F(a,  ft)  is  taken  to  be  the  limit  of 
F(a',  ft'),  when  a'~a,  ft' **  ft,  independently  of  orte  another,  it  being  assumed 
that  this  limit  exists. 

(4)  Let  P  be  a  perfect  set  of  points  in  an  interval  (a,  ft)  contained  in  (a,  b),  and 
assume  that  f  (as)  is  sumrnable  over  the  set  P,  and  suppose  moreover  that  F(a',  ft') 
has  been  defined  for  every  interval  (a',  ft'),  of  (a,  ft),  which  contains  no  point  of 

f  ft  as  an  interior  point.  Let  (an,  ftn),  where  n  =  1,  2,  3, ...,  denote  the  intervals 
of  (a,  ft)  that  are  contiguous  to  P,  and  let  Wn  (a,  ft)  denote  the  upper  limit  of 
|F(a',  ft')\,for  all  intervals  (a',  ft')  contained  in  (a,,,,  j3n).  Let  it  be  assumed  that 

CO 

the  series    2  W  (ctn,  ftn)  is  convergent.    Then  V(a,  ft)  is  defined  by 

n-l 

F(a,  ft)=  I  F(an,  ftn)  +  f     f(x)dx. 

n-l  J  (P) 

It  is  clear  that  W  (an,  ftn)  is  the  fluctuation  of  F(an,  x)  in  the  interval 

(«n>  ftn)- 

It  will  be  shewn  that  the  D-integral  of  f(x\  in  (a,  b),  exists,  or  in  accord- 
ance with  the  expression  employed  by  Denjoy,  that/(#)  is  totalizable  in  (a,  b), 
provided  f(x)  satisfies  the  following  conditions: 

I.  For  every  perfect  set  P,  in  (a,  b),  the  set  of  those  points  of  P  which  are 
points  of  non-summability  of  f(x),  with  respect  to  P,  is  non-dense  in  P. 

In  particular,  if  P  consists  of  all  the  points  of  (a,  b),  the  set  H,  of  points  of 
non-summability  of  f(x),  with  respect  to  the  interval  (a,  b),  is  a  non-dense  set. 

II.  If  (a,  ft)  be  any  interval  in  (a,  b),  and  F(a',  ft')  have  been  calculated 
for  every  interval  (a,  ft'),  interior  to  (a,  ft),  then  F(a',  ft')  tends  to  a  definite 
limit,  as  a',  ft'  converge  independently  to  a,  ft  respectively. 
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When  this  condition  is  satisfied  V  (a,  fi)  has  been  defined,  in  (3)  above, 
to  be  the  value  of  this  limit;  thus 


rft  r/s' 

I  f(x)dx  —     lim     I     f(x)dx. 

J  a  0/^*0,  iS'^/i  J  o! 


III.  For  every  perfect  set  P,  if  V(an,fin)  have  been  calculated  for  every 
interval  («„,  fin),  contiguous  to  P,  the  set  of  points  of  P  which  are  not  points  of 

oo 

convergence  of  2  W  (an ,  fin)  is  non-dense  in  P. 

n=\ 

00 

The  series   2  W(an,/3n)  is  said  to  be  convergent  in  an  interval  (a',  fi') 

M  =  l 

if  the  series  consisting  of  those  terms  for  which  (a^,  fin)  is  interior  to  (a',  ft') 
is  convergent.  The  series  is  said  to  be  convergent  at  a  point  p  if  p  is  interior 
to  some  interval  (a',  fi')  in  which  the  series  is  convergent. 

If  every  point  of  a  perfect  set  P  is  such  as  to  be  a  point  of  convergence  of 
2  W  (an,  fin),  then  the  series  2  W(an,  fin),  taken  for  all  the  intervals  con- 

n=\  n=l 

tiguous  to  P,  is  convergent. 

For,  let  a  system  of  nets  be  fitted  on  to  the  interval  (a,  b),  in  which  P 
is  contained;  then  if  2  W (a.n>  fin)  is  divergent,  there  must  be  at  least  one 

M=l 

mesh  d-i,  of  the  net  Dl,  in  which    2  W  is  divergent.    Of  the  nets  of  D2,  that 

71=1 

are  contained  in  dlt  there  must  be  one  at  least  in  which    2  W  is  divergent; 

M=l 

let  dz  be  that  net,  or  that  of  lowest  rank  if  there  is  more  than  one.  Proceeding 
in  this  manner,  we  obtain  a  sequence  of  meshes  d1}d2,  ...,  defining  a  point  Q, 
in  all  of  them,  such  that,  in  each  of  these  meshes,  the  series  2  W  is  divergent. 

n=l 

This  point  Q  is  one  of  divergence  of  the  series,  and  it  must  clearly  belong  to 
the  set  P,  which  by  hypothesis  contains  no  such  point.    Therefore  the  series 
2  W(on,  fin),  taken  for  all  the  intervals  contiguous  to  P,  is  convergent. 

n  =  l 

It  is  clear  that  those  points  of  a  perfect  set  P  which  are  points  of  divergence 
of  ^W(an,  fin)  form  a  closed  set.  For  in  any  neighbourhood  of  a  limiting  point 
of  the  set  there  are  points  of  the  set ;  from  which  the  result  follows. 

465.  It  will  be  shewn  that,  when  the  conditions  given  in  §  464  are  satisfied, 
the  /)-integral  of  f(x)  can  be  calculated  by  means  of  an  enumerable  set  of 
Z-integrals,  of  passages  to  the  limit,  and  of  summations  iri  a  certain  order. 

Every  point  of  (a,  b)  which  does  not  belong  to  H,  the  set  of  points  of  non- 

summability  oif(x)  with  respect  to  the  interval  (a,  b),  is  interior  to  a  contiguous 

interval  of  H.   Since  f(x)  has  an  Z-integral  in  any  interval  (a,  fi'),  interior  to 

(a,  fi),  one  of  the  intervals  contiguous  to  H,  V  (a,  fi)  can  be  determined  as 

lim     V(a,  fi'),  in  accordance  with  the  condition  II,  of  §  464.   Thus  the 
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jD-integral  is  determined  for  each  interval  contiguous  to  H.  Now  H  is  the 
sum  of  a  perfect  set  P1}  the  nucleus  of  H,  and  an  enumerable  set  Ml  ;  we  pro- 
ceed to  calculate  F(a.  ft)  in  the  intervals  (or,  ft)  that  are  contiguous  to  P1. 

The  part  of  M1  in  such  an  interval  (a,  ft)  is  reducible,  and  therefore  has  a 
derivative  of  order  7,  some  number  of  the  first,  or  of  the  second,  class,  which 
contains  no  points,  whereas  the  derivatives  of  lower  order  than  7  all  exist.  If 
an  interval  («',  ft'),  in  (a,  ft),  contains  no  point  of  M1}  the  D-integral  in  (a',  ft') 
is  an  D-integral  ;  and  by  condition  II  the  integral  can  be  calculated  for  (a,  ft'), 
when  a,  ft'  are  points  of  J/u  whereas  no  interior  points  belong  to  M^.  If  (a,  ft') 
contains  a  finite  set  of  points  of  M1}  the  D-integral,  for  (a'  ft'),  can  then  be 
calculated  by  means  of  the  definition  (2)  of  §464.  Next,  let  (a1;  ft.±)  be  an 
interval  contiguous  to  the  first  derivative  MI,  of  Ml  ;  then  in  an  interval 
(«/,•  fti)  interior  to  (al,  ft^,  there  is  only  a  finite  set  of  points  of  M^,  hence,  as 
before,  the  D-integral  in  (^  ,  &)  can  be  calculated.  The  integrals  in  all  the 
intervals  contiguous  to  MI  having  thus  been  determined,  by  proceeding  as 
before,  the  integrals  in  all  the  intervals  contiguous  to  M-i'  may  be  determined. 
Generally,  let  7  be  a  finite  number,  or  a  transfinite  number  of  the  first  species. 
As  before,  if  the  D-integral  has  been  calculated  for  every  interval  contiguous 
to  MJ-i~^,  it  can  be  calculated  for  every  interval  contiguous  to  M^.  If  7  be 
a  transfinite  number  of  the  second  species,  let  us  suppose  that  the  .D-integral 
is  known  for  every  interval  contiguous  to  M-^"1,  for  all  values  of  7'  <  7.  In 
any  interval  (a/,  fty)  which  is  interior  to  an  interval  (ay,  fty)  contiguous  to 
MI(I\  from  and  after  some  value  of  7',  all  the  sets  M-fi"1  have  no  points  in 
(dy,  fty')',  hence  the  .D-integral  can  be  calculated  in  (ay,  fty'),  and  consequently, 
by  employing  the  definition  (3),  it  can  be  determined  in  (ay,fty).  It  has  thus 
been  shewn  that  the  D-integral  can  be  calculated  for  every  interval  (a,  ft), 
contiguous  to  M^,  where  7  is  any  number  of  the  first,  or  of  the  second,  class. 
Since  M^^O,  for  some  number  7,  it  follows  that  the  .D-integral  can  be 
calculated  for  any  interval  contiguous  to  P,,  by  means  of  an  enumerable 
set  of  additions,  and  of  passages  to  the  limit,  in  which  the  D-integrals  over 
intervals  in  which  f(x)  is  summable,  are  employed. 

Let  us  suppose  this  process  to  have  been  carried  out  for  each  interval  con- 
tiguous to  Pj,  the  nucleus  of  H.  In  case/(#)  is  summable  relative  to  P1}  and 
if  every  point  of  Pl  is  a  point  at  which  the  series  2,W(an,  ftn)  is  convergent, 

we  have 

rb 


rb  r  »     r/ 

I    f(x)  dx  =        f(x)  dx  +    S 

J  a  J(Pi)  n  =  l  Ja 

Ul 


When  P!  does  not  satisfy  these  conditions  we  proceed  to  analyse  the  set  Px. 
It  should  be  observed  that  the  D-integral  is  determined  over  any  interval 
(a,  ft)  such  that  p1}  the  part  of  P!  contained  within  it,  satisfies  the  conditions 
that  f(oc)  is  summable  over  plt  and  that  each  point  of  pl  is  a  point  at  which 
the  series  2  W  (an>  ftn)  is  convergent. 
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In  accordance  with  the  conditions  I  and  III,  assumed  to  be  satisfied,  the 
set  £T2,  of  points  of  Pl  which  are  either  points  of  non-summability  of  f(x)  with 
respect  to  Pl}  or  are  points  of  non -convergence  of  2W(an,  /3n),  is  non-dense 
in  Pl ;  moreover  the  set  Hz  is  closed.  Let  H2  =  P2  +  M2,  where  P2  is  a  perfect 
set,  the  nucleus  of  H2,  and  M2  is  an  enumerable  set.  It  will  be  shewn  that 
the  .D-integral  can  be  calculated,  first,  in  every  interval  having  no  interior 
points  that  belong  to  H2,  and  secondly,  for  every  interval  contiguous  to  P2. 

Consider  an  interval  8,  which  contains  within  it,  and  at  its  ends,  no  point 
of  Hz.  We  need  only  consider  the  case  when  8  contains,  within  it,  points  of 
P!.  The  interval  8  can  be  decomposedjnto  a  portion  plt  of  Pl}  and  a  set  of 
intervals,  all  contiguous  to  pl}  except  possibly  two  of  them  which  are  semi- 
contiguous  to  plt  The  function  f(x)  is  surnmable  in  plf  and  it  has  therefore 
an  Z-integral  over  plt  The  series  consisting  of  those  terms  of  STP(an,  /3n) 
that  refer  to  intervals  contained  in  8  is  convergent.  The  D-integral  over  8  is 
therefore  the  sum  of  the  _D-integrals  in  the  intervals  contiguous  and  semi- 
contiguous  to  plf  together  with  the  jC-integral  over  the  set  pl}  in  accordance 
with  definition  (4).  By  passing  to  the  limit,  the  _D-integral  is  obtained  over 
any  interval  which  has  no  point  of  Hz  in  its  interior,  but  of  which  one  extremity, 
or  both  extremities,  belong  to  H2.  \  By  means  of  an  enumerable  set  of  passages 
to  the  limit,  the  D-integral  is  thus  determined  over  any  interval  in  which  H2 
is  reducible,  as  in  the  former  case^  Thus  the  D-integral  is  determined  for  any 
interval  contiguous  to  P2. 

From  P2,  we  proceed,  as  before,  to  define  R3,  which  is  composed  of  a 
perfect  set  P3,  the  nucleus  of  H3,  and  an  enumerable  set  M3.  This  set  H3 
consists  of  the  points  of  non-summability  of  f(x)  with  respect  to  P2,  and  of 
points  of  divergence  of  2  W. 

In  general,  we  obtain  a  set  Hy  =  Py  +  My,  where  7  is  a  number  of  the  first, 
or  of  the  second,  class,  and  where  it  is  assumed  that  Hy  and  Py>  are  known 
for  all  numbers  7',  less  than  7.  In  case  7  is  of  the  first  species,  we  suppose 
the  D-integral  of  f(x)  to  have  been  already  determined  in  every  interval  con- 
tiguous to  P[y-i}\  the  process  of  determining  the  D-integral  over  every  interval 
contiguous  to  Py  is  the  same  as  in  the  special  case  7  =  2,  which  has  been 
considered  above.  If  7  is  a  number  of  the  second  species,  Py  is  the  set  of  points 
common  to  all  the  sets  Py,  where  7'  has  every  value  <  7.  If  none  of  the  sets 
Py  is  devoid  of  points,  the  set  Hy  certainly  exists,  but  Py  does  not  exist  in 
case  Hy  is  enumerable.  The  set  PY,  when  it  exists,  is  non-dense  in  each  of  the 
sets  Py. 

It  will  be  shewn  that  the  .D-integral  can  be  determined  over  every  interval 
contiguous  to  PY,  it  being  assumed  that  the  corresponding  determination  has 
been  made  for  all  the  sets  Py,  where  7'  <  7. 

First,  let  y  be  a  number  of  the  first  species;  and  let  8  be  an  interval  which 
contains,  as  interior  point,  or  as  an  end-point,  no  point  of  Hy.  If  P[y-^  be  the 
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portion  of  P[y_i]  in  8,  f(x)  is  summable  in  p[y-i\ ,  and  those  of  the  D-integrals 
which  are  taken  over  intervals  contiguous  to  p[y-^  form  an  absolutely  convergent 
series.  We  calculate  the  Z-integral  of  f(x)  over  p[y-i\ ,  and  add  to  it  the  sum 
of  the  D-integrals  over  the  intervals  that  are  contiguous  or  semi-contiguous 
to  p[y-i] ',  the  result  is  the  D-integral  over  8.  This  D-integral  is  now  determined 
over  every  interval  which  contains  within  it,  or  at  its  ends,  no  point  of  Hy. 
By  an  enumerable  set  of  passages  to  the  limit  the  D-integrals  are  now  de- 
termined in  every  interval  in  which  Hy  is  reducible,  that  is,  in  every  interval 
of  which  no  interior  point  belongs  to  Py . 

Next,  let  7  be  a  number  of  the  second  species,  and  let  8  have  the  same 
meaning  as  before.  The  sets  Pyj(y  <  7)  have  no  points  in  8,  and  therefore  the 
D-integral  can  be  calculated  over  8.  By  a  passage  to  the  limit,  the  D-integral 
can  be  determined  over  any  interval  which  has  points  of  Hy  only  at  one,  or 
both,  of  the  end-points.  Hence,  by  an  enumerable  set  of  additions  and  passages 
to  the  limit,  the  D-integral  is  calculated  over  every  interval  contiguous 

toPy. 

All  the  integrals  are  obtained  by  an  enumerable  set  of  operations  of  the 
types  laid  down  at  the  beginning  of  the  discussion.  Since  Pv  is  non-dense  in 
all  the  sets  Py,  where  7'  <j,  there  exists  a  number  7,  for  which  Py  does  not 
exist  (see  §  82).  Hence,  by  an  enumerable  set  of  operations,  the  D-integral 
of  f(x)  over  the  interval  (a,  b)  can  be  calculated,  since,  in  the  interval  (a,  b), 
there  are  no  points  of  Py. 

It  is  clear  that  the  D-integral  over  (a,  x)  can  be  determined  in  the  same 
manner,  where  x  is  any  point  in  (a,  b). 

466.  In  the  construction  of  the  D-integral,  given  in  §  465,  the  conditions 
I  and  III  have  been  employed  only  in  relation  to  the  particular  perfect  sets 
P1;  P2, ...,  whereas  the  function /(#),  when  integrable  (D)  in  (a,  b),  has  been 
required  to  be  such  that  these  conditions  are  satisfied  in  relation  to  every 
perfect  set  P.  It  will  however  be  shewn  that  the  D-integral  constructed 
by  the  method  indicated  in  §  465  actually  satisfies  the  conditions  I  and  III,  of 
§  464,  in  relation  to  any  perfect  set  P. 

In  the  first  place  it  may  be  observed  that,  if  f(x)  be  summable  in  a  sub- 
interval,  and  Q  be  any  closed  set  in  that  sub-interval,  there  are  no  points  of 
Q  which  are  points  of  non-summability  with  respect  to  Q,  and  also  no  points 

at  which  the  series,  of  which  the  terms  are  the  fluctuations  of  I    f(x)dx  in 

J  a 

the  intervals  contiguous  to  Q,  is  not  convergent.  If  P  be  any  perfect  set  in 
(a,  b),  let  K  be  the  set  of  those  points  of  P  such  that,  at  any  one  of  them, 
either /(a?)  is  not  summable  with  respect  to  P,  or  is  not  a  point  of  convergence 
'••  of  STTn,  taken  for  the  intervals  contiguous  to  P.  Unless  K  is  non-dense  in 
P,  there  is  some  interval  in  which  K  and  P  coincide.  /Let  P(1)  be  such  a 
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portion  of  P ;  then  P(1)  must  be  contained  in  H.  For,  otherwise,  a  part  of 
P(1)  would  be  contained  in  an  interval  interior  to  an  interval  contiguous  to 
H ;  and  this  is  not  possible,  since  f(x}  is  summable  in  such  an  interval. 
Since  the  perfect  set  .P1'  is  contained  in  H,  it  must  be  contained  in  Plt 
Similarly,  if  a  portion  of  P(1)  were  contained  in  an  interval  contiguous  to  H2, 
since  P(1)  is  contained  in  Pj ,  that  portion  of  P(1)  would  be  a  part  of  Pl  over 
which  f(x)  is  summable,  and  such  that  every  point  would  be  one  of  con- 
vergence of  S  W  over  the  intervals  contiguous  to  the  part ;  and  this  is  not 
possible.  Hence  P(l)  is  contained  in  Hz,  and  therefore  in  P2.  Proceeding  in 
this  manner,  it  can  be  shewn  that  P(1)  is  contained  in  all  the  sets  P3,  P4, ...  Pa. 
But  Pa,  for  some  value  of  a,  contains  no  points,  and  therefore  the  set  P(1) 
cannot  exist.  It  has  thus  been  shewn  that  K  is  non-dense  in  P.  Therefore 
the  conditions  I  and  III  are  satisfied  for  any  perfect  set  P. 

In  the  construction  given  in  §465,  the  conditions  in  (3)  and  II  have  only  been 
employed  when  the  limiting  interval  (a,  /S)  is  a  contiguous  interval  of  one  of  the 
sets  H,  H1}  FT2,  ...  Hy  .  But,  as  the  condition  is  to  be  applicable  to  any  interval 
(a,  /3),  it  is  necessary  to  shew  that  the  function  V,  constructed  by  the  process 
given  in  §  465,  actually  satisfies  this  condition  for  every  interval  (a,  /?),  before 
the  existence  of  the  D-integral  can  be  regarded  as  completely  established. 

P 
The  condition  amounts  to  a  postulation  that  the  integral      f  (as)  das,  or  F  (x), 


shall  be  continuous  in  the  whole  interval  (a,  b).  For  the  continuity  of  F  (x) 
follows  from  the  definition  (3),  and  the  corresponding  assumption  II.  If 
ac1,  os2,  ...  xn, ...  be  a  sequence  of  points  interior  to  (a,  x),  and  converging  to  x, 
Km  V(a,  asn)  =  V  (a,  x);  and  thus  F  (x)  is  continuous  on  the  left.  Similarly, 

by  considering  V  (as,  b),  it  may  be  shewn  to  be  continuous  on  the  right,  and 
therefore  V  (a,  x},  which,  by  definition  (1),  is  V  (a,  b)  —  V  (x,  b),  is  continuous 
on  the  right.  Thus  the  Z)-integral,  V  (a,  x),  when  it  exists,  must  be  con- 
tinuous in  the  interval  (a,  b). 

To  prove  that  the  function  constructed  in  §  465  satisfies  this  condition, 
we  observe,  in  the  first  place,  that  V  (a,  x)  is  certainly  continuous  at  any 
point  that  does  not  belong  to  H.  For  such  a  point  is  interior  to  an  interval 
in  which  f  (x)  is  summable;  hence  V  (a,  as  +  h)  —  V  (a,  as)  is  the  Z-integral 

fx+h 

I       f(x)  dx,  which  converges  to  zero,  as  h  ~  0.    Next,  let  G  be  a  perfect  set 

of  points,  contained  in  an  interval  (a,  ft),  and  such  that  f(ao)  is  summable 
over  G,  and  also  such  that  every  point  of  G  is  a  point  of  convergence  of  the 
'  series  2)  W,  when  the  summation  is  taken  for  the  intervals,  contained  in  (a,  ft), 
which  are  contiguous  to  G.    We  have  then, 


Pf  f/3"  ~\x+h 

=\        f(x)dx  +  ^\    f(x)dx\      , 

U  (G)  Jan  Jx 
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r  r  ~\x+h 

where  (an,  /3n)  is  contiguous  to  G.    As  h  -  0,  f(x)dx        converges  to 

\J(Q)  _U 

zero,  on  account  of  a  known  property  of  the  .L-integral.    In  case  x  and  x  +  h 

r     rfa  ~\x+h 

are  both  points  of  G,    X       f(x)dx        also  converges  to  zero,  as  h  does  so. 

L     J  a-n  Jx 

But  if  x  +  h  is  not  a  point  of  G,  it  is  within  some  interval  (am,  /3m)  contiguous 

/•/3m  fx+h  i 

to  G,  and  we  have  to  consider  a  part  I      f(x)dx.  We  have          f  (®) dss\*tWmt 

J X+h  J am  [ 

and  therefore,  in  virtue  of  the  hypothesis  made  above,  this  part  makes  no 

H  r   /•£»          ~\x+k 

difference  in  the  convergence  of    S  I     /  (x)  dx         to  zero,  when  convergence 

\_      J  an  _U 

takes  place  on  account  of  the  assumption  that  x  is  a  point  of  convergence  of 
2  W.  It  has  thus  been  shewn  that  F(a,  x  +  h)  —  V  (a,  x)  converges  to  zero, 
as  h  converges  to  zero  through  any  sequence  of  values ;  therefore  V  (a,  x)  is 
continuous  in  (a,  /3). 

If  G  be  identified  successively  with  the  perfect  sets  in  the  intervals  con- 
tiguous to  P1}  P2,  ...,  the  perfect  sets  employed  in  §  465,  in  the  construction 
of  V  (a,  x),  we  see  that  it  is  sufficient  for  the  continuity  of  that  integral 
that  the  condition  Km  Wn  =  0  should  be  satisfied  for  all  the  perfect  sets 

W~ao 

contained   in   the    intervals    contiguous  to   Pa,   P2, —     The    continuity  of 
f(x)dx  in  the  whole  interval  (a,  6)  is  thus  established. 


THE  FUNDAMENTAL  THEOREM  OF  THE  INTEGRAL  CALCULUS  FOR  THE 
DENJOY  INTEGRAL. 

467.  With  a  view  to  proving  the  fundamental  theorem  that  the  indefinite 
Denjoy  integral  has  a  finite  differential  coefficient,  equal  to  the  integrand, 
almost  everywhere  in  the  interval  of  integration,  a  property  of  certain  types 
of  sets  of  intervals  will  be  investigated. 

It  has  been  shewn,  in  §  71,  that  a  set  A  of  open  intervals  all  contained 
in  the  given  interval  (a,  b),  can  be  replaced  by  a  non-overlapping  set  of  open 
intervals  A,  the  two  sets  A,  A  containing  the  same  open  set  of  points. 

The  set  of  open  intervals  A  is  said  to  be  a*  restricted  set  of  intervals,  if 
no  interval  of  A  can  be  removed  without  altering  the  equivalent  set  A. 
This  is  the  same  as  the  condition  that  each  interval  of  A  contains  a  point 
that  is  not  in  any  of  the  other  intervals  of  A. 

It  is  easily  seen  that,  if  three  open  intervals  S1}  S2,  S3  are  such  that  a  point 
P  belongs  to  all  of  them,  one  at  least  of  the  intervals  is  such  that  every 
point  of  it  belongs  to  one  of  the  other  two.  Accordingly,  that  interval  may 
be  removed  without  affecting  the  equivalent  set  of  points. 

*  See  Denjoy,  Journal  de  Math.  (7),  vol.  i,  p.  223,  where  the  theory  here  given  is  developed. 
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It  follows  at  once  that,  if  A  be  a  finite  set  of  intervals,  and  is  a  restricted 
set,  any  point  P  is  contained  in  at  most  two  of  the  intervals  of  A.  For  such  a 
set  of  intervals,  the  sum  2w  (8),  of  the  measures  of  all  the  intervals,  cannot 
exceed  2»i(A),  or  twice  the  measure  of  the  equivalent  set  of  open  intervals. 

A  set  of  open  intervals  A  is  said  to  be  complete,  if  it  satisfies  the  condition 
that,  when  (a1}  /3j),  («2,  &), ...  (an,  ftn),  ...  is  any  sequence  of  intervals  all 
belonging  to  A,  and  such  that  a1}  «2, ...  «n,  ...  converges  to  a  point  aw,  and 
PI}  ft. 2,  ...  ftn,  ...  converges  to  a  point  /3W,  then  (au,  /3W)  is  an  interval  of  A. 
In  case  (aw,  ftu)  is  not  necessarily  an  interval  of  A,  but  is  contained  in  such 
an  interval  (a,  ft),  the  set  is  said  to  be  semi-complete. 

If  the  set  A  is  complete  (or  semi-complete),  a  part  A1?  of  A,  exists  which 
is  equivalent  to  the  same  non-overlapping  set  A,  as  A,  and  such  that  it  is  a 
restricted  set.  For,  considering  the  closed  set  C  (A),  the  complement  of  A, 
let  (a,  ft)  be  one  of  its  contiguous  intervals.  Any  point  P,  of  the  open 
interval  (a,  ft),  is  interior  to  an  interval  B,  of  A. 

If  there  exist  intervals  (a,  a'),  (ft,  ft)  belonging  to  A,  the  closed  interval 
(«',  ft)  can  be  covered  by  a  finite  set  of  intervals  of  A,  in  accordance  with  the 
Heine-Borel  theorem.  This  finite  set,  together  with  (a,  a!)  and  (ft',  ft),  forms 
a  part  of  A  which  covers  the  open  interval  (a,  ft),  and  this  set  can  then  be 
reduced  to  a  restricted  set,  by  suppressing,  if  necessary,  some  of  the  intervals. 

If  no  such  intervals  as  (a,  a'),  (ft,  ft)  exist,  we  choose  a  set  of  points 
(...  P-n,  .  ..P_!,  P0,  PI,  ...  Pn,  ...),  contained  in  (a,  ft),  such  that  a  is  the  limit- 
ing point  of  the  sequence  (. . .  P-n,  . . .  P_i,  P0),  and  that  ft  is  the  limiting  point 
of  the  sequence  (P0,  PI,  ...  Pn, ...)',  any  point  Pm+1  being  taken  to  be  on  the 
right  of  Pm ,  whether  m  be  positive  or  negative. 

If  the  points  ...  P_n, ...  P_x,  P0  are  contained  respectively  in  intervals 
...  S-n, ...  S_i,  80,  of  A,  the  measure  of  S_n  must  converge  to  zero,  as  n~<x> , 
for  otherwise  their  right-hand  end-points  would  have  a  limiting  point  a"(>«) 
and  (a,  a")  would  be  an  interval  of  A,  if  the  set  A  is  restricted,  or  would  be 
contained  in  an  interval  of  A,  in  case  the  set  is  semi-restricted;  and  this  is 
contrary  to  the  hypothesis  that  no  interval  of  A  has  its  left-hand  end-point 
at  a.  A  similar  statement  holds  good  as  regards  ft. 

Every  point  of  the  closed  interval  PmPm+]  ,where  m  is  positive  or  negative, 
is  a  point  of  a  finite  set  ATO,  of  intervals  of  A.  Let  S  denote  the  enumerable 
set  of  intervals  obtained  by  adding  the  sets  Am,  for  all  values  of  m.  It  will 
be  shewn  that,  if  (A,  B)  is  any  interval,  such  that  A  >  a,  B  <  ft,  the  interval 
(A,  B)  has  points  in  common  with  only  a  finite  part  of  the  set  2.  Let  S  (A,  B) 
denote  the  set  of  those  intervals  of  ^  which  contain  at  least  one  point  of 
(A,B).  Let  the  intervals  of  2  be  denoted  by  2,,  22,  ...  2t,...,  then,  if  S  (A,  B) 
is  not  a  finite  set,  it  contains  intervals  S  with  indices  which  increase  indefi- 
nitely. There  must  then  be  in  S  (A,  B)  intervals  of  Am,  for  values  of  m  that 
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increase  indefinitely,  and  are  all  positive,  or  else  are  all  negative.  In  the 
former  case  their  right-hand  end-points  will  converge  to  /3,  and  in.  the  latter 
case  their  left-hand  end-points  will  converge  to  a.  But  it  has  been  shewn 
that  the  measure  of  such  an  interval,  in  either  case,  converges  to  zero,  and 
thus  an  infinite  set  of  such  intervals  cannot  all  contain  points  of  (A,B).  It 
has  thus  been  shewn  that  the  set  2  (A,B)  is  a  finite  component  of  the  set  2. 

The  case  in  which  one  of  the  intervals  (a,  a'),  (/3',  /9)  exists,  but  not  the 
other,  can  be  reduced  to  the  preceding  cases. 

The  set  2  can  be  reduced  to  a  restricted  set  2,  by  suppressing  some  of  the 
intervals,  without  altering  the  set  of  interior  points.  To  prove  this,  let  tj  be 
the  smallest  value  of  i  such  that  2tl  contains  a  point  that  does  not  belong  to 
2,1+fc,  for  k  =1,2,3,.. ..  If  P!  is  any  point  within  (a,  /3),  it  is  interior  to  only  a 
finite  number  of  the  intervals  2t;  let  N!  be  the  greatest  index  such  that  Pj 
is  interior  to  2^5  we  see  then  that  ^  ^  -Ara.  The  interval  2tl  having  thus 
been  determined,  we  proceed  to  determine  t2  as  the  smallest  index  ( >  tj)  such 
that  it  contains  a  point  that  does  not  belong  either  to  2t2+fc ,  for  k  =  1 ,  2, 3, . . . , 
or  to  2tl.  If  we  take  a  point  P2,  not  in  2tl,  there  is  a  greatest  number  N2 
such  that  Pz  is  a  point  of  2^2;  thus  i2  ^  N3.  In  general  if  2tl,  2,2> ...  2,  have 
been  determined  2tp+1  is  determined  by  the  condition  that  ip+l  is  the  smallest 
integer  (>  ip)  such  that  2tp+1  contains  a  point  that  does  not  belong  to  2tpfl+^, 
for  A;  =  1,  2,  3,...,  and  that  does  not  belong  to  any  of  the  intervals  2tl,  2la,...  2t  . 
If  Pp+i  is  a  point  that  does  not  belong  to  any  of  the  intervals  2tl,  2l2, ...  2t  ; 
and  if  Np+l  is  the  greatest  of  the  indices  ip+1,  ip+z, ...  such  that  iyp+1  contains 
Pp+l,  we  have  then  ip+1  £  Np+1.  The  set  of  intervals  {2,  }  is  a  restricted  set, 
and  it  can  be  shewn  to  cover  the  whole  of  the  open  interval  (a,  /3).  For,  let 
Q  be  any  point  of  that  interval,  then  there  exists  a  finite  set  2Vl,  2Y2,  ...  2yOT, 
of  intervals  of  2,  all  of  which  contain  Q.  If  none  of  the  indices  yl}  ry?,...'ym-i 
belong  to  the  set  {iq},  it  will  be  shewn  that  %n  must  belong  to  that  set,  and 
thus  that  2Vra  is  one  of  the  intervals  of  the  restricted  set  2.  The  interval  2V 

•  Tfl  iffl 

contains  a  point  that  does  not  belong  to  any  of  the  intervals  2tl,  2,2,  . . .  2,  ,  nor 
to 2,  +fc,for  h—  1,  2,  3,. . . ;  and  therefore,  for  a  properly  chosen  value  of  p,  it  must 
have  the  least  index  for  which  these  conditions  are  satisfied,  and  thus  fym=ip+-L. 

If  we  consider  all  the  intervals  (a,  /9)  contiguous  to  G  (A),  we  obtain  a 
restricted  set  of  intervals  that  has  the  same  interior  points  as  the  given  set  A. 
Moreover,  a  restricted  set  of  intervals  which  covers  the  open  interval  (a,  b), 
has  only  a  finite  component  such  that  each  interval  of  that  component  con- 
tains a  point  of  a  given  interval  (A,  B),  interior  to  (a,  b).  It  has  been  shewn 
that  the  measure  of  (A,  B)  is  not  less  than  half  the  sum  of  the  lengths  of 
the  intervals  of  this  finite  set.  By  considering  a  sequence  of  intervals  (A,  B) 
converging  to  (a,  b),  we  see  that  the  sum  of  the  lengths  of  the  intervals  of 
a  restricted  set  which  covers  the  open  interval  (a,  b)  cannot  exceed  twice  the 
length  b  —  a. 
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468.  The  properties  of  a  restricted  set  of  intervals,  covering  an  open 
segment,  will  now  be  applied  to  prove  the  following  theorem: 

Let  P  be  a  perfect  set,  contained  in  the  linear  interval  (a,  b),  and  the  intervals 
«!,  u2i...  un,  ...be  contiguous  to  P,  and  let  %n(#)  be  defined  in  each  interval  un, 
or  (an,  bn),  and  such  that  %n  (an)  =  0.  Let  wn  (un)  denote  the  upper  boundary  of 

|  %n  (x)  |  in  (an,  bn),  and  let  it  be  assumed  that  the  series  2  w  (un)  is  convergent. 

n-i 

X 

Let  the  function  F  (%)='£  %n  (bn)  +  Xm  (#)  consist  of  those  terms  for  which  bn  £  x, 

a 

and,  in  case  x  is  interior  to  one  of  the  intervals,  (am,  bm),  of  the  term  %m  (x\for 
the  portion  (am,  x}  of  the  interval;  then  F(x)is  such  that  it  has  a  differential 
coefficient  equal  to  zero,  at  almost  all  points  of  P. 

Let  the  intervals  ply  p2>  .  .  .  px+i  be  complementary  to  the  set  of  N  intervals 

MI,  MS,  ...MJV. 

Let  /8  denote  any  segment  of  which  the  ends  are  points  of  P,  and  let  a  ($) 
denote  the  sum  of  %7l(6m)  |,  for  all  those  intervals  (an,  bn)  that  are  in  /3. 

Let  A,  N  be  two  positive  numbers,  and  let  S(N,  A)  be  an  interval,  such 
that  its  end-points  belong  to  P,  and  that  it  is  contained  in  one  of  the  intervals 
p1}pa,...pN+1,an.d  is  also  such  that  a  \$>(N,A}\  is  not  less  than^.-1m  {&(N,A}}. 

Let  A  (N,  A)  be  the  set  of  all  such  intervals  B(N,  A);  it  will  be  shewn 
that  this  set  of  intervals  is  a  complete  set.  If  (a.p,  fip)  be  an  interval  of  a 
sequence,  all  the  intervals  of  which  belong  to  &(N,A),  and  such  that  ap,  /3P 
converge  to  a,  /3  respectively,  as  p~oo  ;  it  will  be  seen  that  (a,  /3)  is  an  interval 
of  the  set  A  (N,A).  In  the  first  place,  since  ap,  @p  are  points  of  P,  for  every 
value  of  p,  it  follows  that  a,  /3  are  points  of  P.  Moreover,  from  and  after  some 
fixed  value  of  p,  all  the  intervals  (ap,^p)  must  be  in  one  and  the  same  interval 
pm;  hence  (a,  /8)  is  also  in  that  interval.  The  smallest  index  n,  of  an  interval  un 
contained  in  (a,  ap),  or  in  (/3,  /3P),  must  increase  indefinitely,  as  p  does  so,  thus 
a  (ap,  a),  a  (/3,  /3P)  converge  to  zero,  as  p  ~  oo  ;  hence 


=  lim  £--—  a(ap,  &)  £  ^  . 

P  p~<x>   Pp  —  Hp  -il 

It  has  thus  been  shewn  that  (a,  /3)  satisfies  all  the  necessary  conditions  that 
it  may  belong  to  A  (N,  A  ).  The  set  A  (N,  A)  being  a  complete  set  of  intervals, 
it  follows  that  it  can  be  replaced  by  a  restricted  set  of  intervals  &(N,  A), 
containing  the  same  interior  points.  The  points  interior  to  at  least  one 
interval  of  A  (N,  A),  that  is,  to  at  least  one  interval  of  &(N,A),  form  a  non- 
overlapping  set  i  (N,  A)  of  open  intervals.  Every  point  of  one  such  interval  i 
is  interior  to  at  least  one,  and  to  not  more  than  two,  intervals  of  &(N,A). 
Thus  the  end-points  of  the  intervals  of  A  (TV,  A),  that  are  interior  to  i,  divide 
it  into  segments  alternately  in  one,  and  in  two,  of  the  intervals  of  A  (i\r,  A); 
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let  A!  (i)  denote  the  first,  and  A2  (t)  the  second,  set  of  these  segments.  We 
have  then 

m  (t)  =  m  (A!  (t)}  +  m  (A^  (t)}, 

and  also  cr(t)  =  tr  (A^t)}  +  <r  {A2(t)j. 

Every  interval  8  ( JV,  A )  having  points  in  common  with  i,  and  consequently 
contained  in  t,  is  decomposed  into  one  interval  of  Aj  (/)  and  two  of  A2  (t)',  hence 
we  have 

crjA(0}  =  o-{A1(0}  +  2er{Aa(0}. 

Now  a  {A  (t)}  ^  A-*m  {A  (0}  ^  ^-'m  (i), 

and  <r  I  A!  (t)}  +  2o-  { A2  (t)}  <  2<r  (*) ; 

therefore  m(t)  <  2Ao-(t).    This  inequality  holds  for  each  interval  covered  by 

A(iV,  A),  and  these  intervals  t  are  interior  to  the  segments  plt  p2,  ...  px+1. 

The  intervals  un,  interior  to  an  interval  i,  all  have  indices  greater  than  W, 

00 

hence,  denoting  by  t$  the  sum  2  |  %n  (bn}  \ ,  the  total    measure  of  all    the 

N+l 

intervals  i,  covered  by  the  intervals  A  (N,  A),  is  <  2Aty.  This  measure 
diminishes  indefinitely  as  N  is  increased  indefinitely. 

Let  each  interval  un  be  increased  at  each  end  by  adding  on  a  segment  of 
length  Awn;  and  denote  the  interval  so  increased  by  un' (A). 

If  we  exclude  from  (a,  b)  all  the  segments  u1}  u2, ...  UN,  and  the  segments 
&(N,  A),  and  also  the  segments  u'N+p(A),  where  p  =  1,  2,  3,  ...,  the  measure 
of  these  excluded  segments  is  less  than 

N  00 

%Un  +  2AtN+    2    Wn'Ol), 
1  .ZV+1 

00  00 

or  than  "Zun  +  4<AsN,  where  s^=   2  wn. 

1  2V+1 


The  measure  of  P  being  6  —  a  —  2  wn,  the  measure  of  the  points  excluded 

i 

is  at  most  6  —  a  —  m  (P)  +  4iAsN-  If  N  is  so  large  that  4>AsN<m  (P),  the 
interval  (a,  6)  contains  points  that  are  not  excluded.  These  points  form  a 
set  E(N,A)  of  which  the  measure  is  at  least  m(P)  —  4iAs^',  and  this  set  is 
contained  in  P,  since  it  does  not  contain  any  points  of  un,  for  any  value  of  n. 
Moreover  E  (N,  A)  is  contained  in  E  (N  +  I,  A),  as  is  seen  by  observing  that 
every  interval  of  A  (N  +  1,  A)  is  an  interval  of  A  (N,  A).  If  E  (A)  denote  the 
outer  limiting  set  of  the  sequence  of  sets  {A  (.AT,  A)},  as  N  is  increased  indefi- 
nitely, we  see  that  E(A)  is  contained  in  P,  and  has  the  same  measure  as  P, 
since  4>As^  ~  0,  as  N ~  oo  . 

Let  £  be  any  point  of  E(A')\  it  then  belongs  to  E(N,  A}  for  a  certain 
value  of  N,  and  for  all  greater  values;  it  is  moreover  a  point  of  P  which  is  a 
limiting  point  of  P,  on  both  sides.  The  point  £  is  interior  to  a  segment  pL 
belonging  to  the  finite  set  p1}  p2,  .../o^v+i;  a  sequence  of  values  of  a;  converging 
to  £  may  be  taken  to  be  interior  to  pt. 
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If  all  the  points  x,  of  the  sequence,  are  points  of  P,  we  have 

X 

F  (x)  —  F  (q)    ^2     Vn  (bn)    ==  ""  (b »  ^0>    ^      ""  •""*  b> 

and  |  ^(a;)- ^(Dl^o- (a?,  |),  if«<f 

Since  £  is  in  the  set  E  (N,  A),  we  see  that  a-  (£,  #),  or  er  (#,  ^),  is  <  A'1  x-£\ ; 
1 


If  a;  be  a  point  of  pt  that  does  not  belong  to  P,  and  is  thus  interior  to  um, 
for  some  value  of  m,  greater  than  N  +  1,  we  have  \x  —  %\>  Awm,  since  a;  is 
not  in  the  closed  interval  u'm(A). 

If  #>  £,  we  have  F  (x)  -F(£)  =  %m  (x)  +  F  (am)  -F(%);  hence 


If  a;  <  £,  we  have 


—  # 


It  is  now  seen  that,  for  any  point  x,  in  pl} 


;  hence  none 
' 


of  the  four  derivatives  of  F(x\  at  |f,  can  exceed  3/A.  If  we  assign  to  A,  the 
values  in  an  increasing  sequence  that  diverges,  we  see  that  each  of  the 
sets  E  {A}  is  contained  in  the  preceding  set;  for  E(N,  A')  is  contained  in 
E(N,A),  if  A'  >  A.  Each  of  the  sets  E  (A)  has  the  measure  m(P),  and 
therefore  the  set  E  of  points  common  to  all  the  sets  E(A)  has  the  measure 
m(P).  At  a  point  £,  of  E,  the  four  derivatives  of  F(£)  are  all  less  than  3/A, 
for  all  the  values  of  A  in  the  sequence.  Therefore  F(%)  has  a  differential 
coefficient,  equal  to  zero,  at  every  point  of  E,  that  is,  at  almost  every  point 
of  P.  The  theorem  has  thus  been  established. 

469.  From  the  theorem  of  §  468,  the  following  theorem  can  be  deduced 
at  once: 

If  P  be  a  perfect  set,  and  {(an,  bn)}  be  the  set  of  contiguous  intervals,  then,  if 
the  function  V(a,x)  be  summable  over  P,  and  if  £TF(an,  bn)  be  convergent, 
the  function  V(a,x)  has  a  finite  differential  coefficient,  at  almost  all  points  of  P. 

The  term  W  (a,  /3)  denotes,  as  in  §464,  the  upper  limit  of  V(a,  /?')  ,  for 
all  intervals  (a,  /3')  contained  in  (a,  /3).  In  this  case 

rx  rln 

Xn  O)  =          f(x)  dor,     X?l  (&„)  =   /       /<>•)  dx. 


H. 
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4    Moreover  we  have 

w(an,  bn)  £  W(an,bn)^2w(an,  6n); 

00  00 

and  hence,  if  S   Wn(an,  /3n)  is  convergent,  so  also  is  S  wn(an,l3n)>  and  the 

w=l  n=l 

converse  also  holds  good. 
We  have 

V  (a,  x)  =  f  */»  ^  +  2  FB  (on,  £„)  +  F  (aw, <c), 

•/  a  a 

where  the  summation  is  taken  for  all  the  intervals  (an,  /37l)  interior  to  (a,  a;), 
and  the  term  V(am,x)  exists  if  x  is  interior  to  (am,  8m),  and  the  function 
f  (x)  is  equal  to  /(#)  at  every  point  of  P,  but  is  elsewhere  zero.  Since  f(x) 
is  summable  in  (a,  b),  it  has,  almost  everywhere,  a  differential  coefficient,  equal 

* 
to  /(#);  and  by  the  theorem  of  §  468,  the  function  S  Fn(«n,  /3n)  +  F(om,  #) 

a 

has  a  differential  coefficient  equal  to  zero,  at  almost  all  points  of  P.  It  follows 
that  the  function  V(a,x)  has  a  differential  coefficient,  equal  to/(#),  at  almost 
all  points  of  P. 

470.  The  theorem  of  §  469  may  now  be  employed  to  establish  the  follow- 
ing property  of  the  D-integral: 

The  indefinite  D-integral  F  (x},  of  a  f  unction  f  (x},  in  the  interval  (a,  b), 
has  almost  everywhere  in  the  interval  a  differential  coefficient,  equal  to  f(x). 

This  is  the  extension  to  the  D-integral,  of  the  property  of  the  Z-integral, 
given  in  §  405. 

With  the  notation  of  §  465,  it  is  clear  that,  in  any  interval  contiguous  to 
H,  F  (x)  has  almost  everywhere  a  differential  coefficient,  equal  to  f(x).  For 
f(x)  has  an  Z-integral  in  every  interval  («',  /3'),  interior  to  an  interval  (a,  ft}, 
contiguous  to  H\  and  therefore  (§405)  F(x}  has  a  differential  coefficient, 
equal  to  f(x),  almost  everywhere  in  (a',  /3'),  and  therefore  almost  everywhere 
in  (a,  /3).  It  can  now  be  shewn  that  F  (x)  has  this  property  in  any  interval 
contiguous  to  Pl.  For  it  holds  in  any  interval  contiguous  to  MI,  and  there- 
fore also  in  any  interval  contiguous  to  M^".  Generally,  it  can  be  shewn  that 
F  (x)  has  the  property  in  every  interval  contiguous  to  any  derivative  M^, 
of  MT.;  and  thence  it  is  shewn  that  F  (x)  has  the  property  in  any  interval  con- 
tiguous to  Pj. 

It  will  next  be  shewn  that,  in  any  interval  contiguous  to  P2,  the  function 
F  (x)  has  almost  everywhere  a  differential  coefficient  equal  to  /(#).  For,  con- 
sider an  interval  B,  which  contains  no  points  of  H2,  but  contains  a  portion  p1} 
of  Pj.  The  property  holds  in  each  interval  contained  in  8,  contiguous  to^j, 
and  by  the  theorem  of  §  469,  it  holds  for  the  set  p^\  and  consequently  it 
holds  for  8.  It  can  now  be  shewn,  as  in  the  case  of  P1}  that  the  theorem 
holds  for  every  interval  contiguous  to  P2.  Proceeding  in  this  manner,  as  in 
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§  465,  we  see  by  induction,  that,  in  virtue  of  the  theorem  of  §  469,  the  theorem 
holds  for  any  interval  contiguous  to  Py,  where  7  is  a  number  of  the  first, 
or  of  the  second,  class.  Since,  for  some  value  7,  of  7,  the  set  P-  is  non- 
existent, it  follows  that  the  function  F(x}  has  a  differential  coefficient,  equal 
),  almost  everywhere  in  (a,  b). 


471.  The  importance  of  the  D-integral  in  the  general  theory  of  integra- 
tion depends  largely  upon  the  property  of  the  integral  expressed  in  the 
following  theorem  : 

If  F  (x)  have,  at  each  point  of  the  interval  (a,  b),  a  finite  differential  coeffi- 
cient f(x),  the  function  f  (x)  is  integrable  (D),  in  the  interval  (a,  b),  and 


F  f(x)dx  =  F(x)-F(a). 

J  a 


This  theorem  expresses  the  fact  that  the  operations  of  differentiation  and 
integration  are  completely  reversible  in  the  case  of  any  function  which  has, 
everywhere  in  the  interval  in  which  it  is  defined,  a  finite  differential  coefficient, 
whether  that  differential  coefficient  is  bounded  or  unbounded  in  the  interval. 
The  property  (B),  in  the  statement  in  §  343,  of  the  fundamental  theorem  of 
the  differential  calculus,  is  covered,  in  a  wide  class  of  cases,  by  the  above 
theorem.  It  will  be  observed  that  the  corresponding  theorem  for  the  X-integral 
is  subject  to  the  condition  that  F(x)  must  be  of  bounded  variation,  as  other- 
wise f(x)  will  not  be  summable  (see  §  405),  whereas,  in  the  above  theorem, 
the  function  F  (x)  is  subject  to  no  condition  beyond  that  of  possessing  every- 
where a  finite  differential  coefficient. 

In  order  to  prove  the  theorem,  it  is  necessary  to  shew  that  the  function 
f(x)  satisfies,  in  relation  to  F  (x},  the  conditions  I,  II,  and  III,  of  §  464. 

(I)  To  shew  that  the  set  of  points  of  non-summability  of  f(x),  with 
respect  to  any  perfect  set  P,  contained  in  (a,  b),  is  non-dense  in  P,  we  observe 
(see  §401)  that/(#)  is  the  limit  of  a  sequence  {^«(#)}  of  continuous  functions. 
For,  if  -tyn(x}  denote  {F (x  -f  hn)  —  F(x)}/hn,  where  \hn]  is  a  sequence  of 
numbers  which  converges  to  zero,  and  is  independent  of  x,  these  functions 
•\lrn  (x)  are  all  pontinuous  functions  of  x. 

The  function  tyn  (x)  may  be  expressed  by  ^  (x,  y\  a  function  of  the  two 
variables  x,  y,  where  y  =  l/?i.  The  function  ^  (x,  y}  is  in  the  first  instance 
defined  only  for  values  of  y,  of  the  form  1/n,  but  it  may  be  extended  to  the 
case  in  which  y  has  all  values  in  the  interval  0  <  y  ^  1,  by  such  a  rule  as  that, 

/I        1 

when  y  is  in  the  interval    — — - , 

\n  +  I 

1 


%  te  y)  =  x  (x>  - ) 
\    "/ 


y  (  /      i  \      /    i 


1  _      1_  C  \  '  n+  I)      *\  '  n, 
n      ?i  +  l 

41—2 
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The  function  ^  (x,  y),  so  defined  for  the  plane  set  of  points  a  &  x  ^  b, 
0  <  y  ^  1,  is  continuous  with  respect  to  y,  for  each  value  of  x.  In  accordance 
with  the  theorem  of  §  323,  there  must  be,  in  every  interval  on  the  line  y  =  0, 
points  at  which  ^  (x,  y)  is  continuous  with  respect  to  (x,  y),  and  therefore 
with  respect  to  x.  Therefore  f(x),  which  is  lim  ^  {x,  y\  is  point- wise  dis- 

2/~0 

continuous ;  and  this  is  also  the  case  if  only  those  values  of  x  are  considered 
which  belong  to  a  perfect  set  P.  It  follows  that  the  points  of  infinite  dis- 
continuity of  f(x),  with  respect  to  P,  form  a  set  that  is  non-dense  in  P. 
This  set  contains  the  set  of  points  of  non-summability  of  f(x}  with  respect 
to  P.  Therefore  f(x)  satisfies  the  condition  I,  of  §  464. 

(II)  If  (a,  /3)  is  an  interval  in  which  f(x)  is  summable,  we  have,  in 

r/3 
accordance  with  the  theorem  of  §  406,  F(P)  —  F(cL)  =  I    f(x)dx.    Let  it  be 

J  a 

now  assumed  that  (a,  /?)  is  an  interval  such  that  the  relation 

F(P)-F(«)=\*  f(x)dx 

J  a.' 

has  been  shewn  to  hold,  at  some  stage  of  the  process  of  construction  of  the 
D-integral  of  f(x},  for  all  intervals  (of,  /3')  interior  to  (a,  (3).  If  a'  ~  a,  /3'  ~  /:?, 
F(a)  converges  to  F(a),  and  F(/3')  converges  to  F(/3),  on  account  of  the 

f3 
continuity  of  the   function   F(x}.     It   follows   that,       f(x)dx,  defined  as 

J  a 
[? 

lim       /    f(x}dx,  is  equal  to  F(@)  —  F(a).    Therefore  the  condition  II,  of 

<>•  ~  ai  ft  ~  ft  J    a 

§  464,  is  satisfied. 

(III)  Let  it  be  assumed  that,  in  each  interval  (a,  /3)  contiguous  to  a  perfect 
set  P,  the  D-integral  of  f(x]  has  been  shewn  to  exist,  and  to  be  equal  to 
F  (/3)  —  F  (a).    The  following  theorem  will  be  proved : 

If  the  continuous  function  F  (x}  have,  in  the  perfect  set  P,  a  finite  dif- 
ferential coefficient  at  every  point,  then  the  set  of  those  points  of  P  which  are 
points  of  divergence  of  the  series  STFn,  where  Wn  is  the  fluctuation  of  F  (x)  in 
the  interval  (an,  @n)  contiguous  to  P,  is  non-dense  in  P. 

In  case  the  set  of  points  of  divergence  of  the  series  %Wn  is  not  non- 
dense  in  P,  there  must  be  an  interval  containing  a  portion  of  P,  in  which 
the  set  is  everywhere  dense  in  that  portion  of  P.  Let  it  therefore  be  assumed 
that  the  set  of  points  of  divergence  of  %Wn  is  everywhere  dense  in  P.  It 
will  then  be  proved  that  there  exist  points  of  P  aiTwhich  one  of  the  derivatives 
of  F  (x)  is  infinite ;  and  this  is  inconsistent  with  the  assumption  that  F  (x} 
has  a  finite  differential  coefficient  at  every  point  of  P.  The  theorem  will 
then  have  been  established. 

In  every  interval  containing  a  point  of  P,  the  series  S  Wn  is  divergent 
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it  then  follows  that  in  this  interval  there  is  an  infinite  number  of  values  of 
n  for  which  Wn/(j3n  —  «n)  exceeds  a  fixed  positive  number  k  ;  for  if 

Wn<k(8n-«n), 

00 

for  all  values  of  n  with  the  exception  of  a  finite  number,  the  sum  2TFn,  for 

m 

some  fixed  value  of  m,  would  be  ^  &2  (@n  —  «n),  and  thus  the  series  would  be 
convergent. 

It  £,  77  be  the  points  of  («„,  /3n)  at  which  F  (x)  has  its  upper  and  lower 
limits,  we  have 

F(&-F  fo)  =  {F  (|)  -  F  (an)}  +  {F  («w)  -  F  (T,)}  ; 

and  since  both  the  terms  on  the  right-hand  side  are  =  0,  one  of  them  is 
greater  than  %  {F(^)  —  F  (?))}.    There  exists  therefore  a  point  yn,  in  (an,  /8n), 

such  that    -- ^^  is  greater  than  ^  — 1 —  •    It  thus  appears  that 

7n  ~  «n  2         &n  ~  *n 

there  exists  a  set  of  points  c1}  c2,  ...  cm,  ...  all  belonging  to  P,  and  dense  in 
P,  such  that  there  is  a  point  ym  on  the  right  of  each  point  cm,  for  which 

>  X,  where   X   is   an   arbitrarily   chosen   positive    number. 


7m  —  Cm 

From  the  continuity  of  F(x)  at  cm,  an  interval  <rm  containing  cm,  of  length 
not  exceeding  em,  and  not  containing  the  point  ym,  can  be  so  determined 

F(ym)-F(cm)\ 


that 


lies  between  the  two  numbers 


»V         f 

im       ^m  I 

for  all  points  x  in  crm.  The  number  em  is  taken  to  belong  to  a  sequence  {em}, 
of  decreasing  numbers,  which  converges  to  zero.  Any  point  £,  contained  in 
an  infinite  number  of  the  intervals  <rm,  belongs  to  P.  For  the  length  of  crm, 
not  exceeding  em,  which  contains  the  point  cm,  of  P,  the  point  £  is  a  limiting 
point  of  P,  and  therefore  belongs  to  the  set.  Also  ym  —  £  does  not  exceed 
therefore  ym  converges  to  £,  as  m  increases  indefinitely; 


it  follows  that 


,  as  m  ~  oo  ,  has  its  lower  limit  >  A, ;  and  thus, 


at  the  point  £,  there  is  a  derivative  numerically  >  X. 

It  has  now  been  shewn  that  all  the  points  £,  interior  to  an  infinite  number 
of  the  intervals  cm,  form  a  residual  set  L  relative  to  the  perfect  set  P.  At  every 
such  point  £  there  is  a  derivative  numerically  greater  than  k. 

Let  X  have  successively  the  values  in  an  increasing  sequence  {Xn}  which 
diverges,  and  let  [Ln]  be  the  corresponding  sequence  of  sets  L,  residuals  with 
respect  to  P.  A  point  that  does  not  belong  to  any  of  the  sets  Ln  belongs  to 
the  sets,  of  the  first  category  relative  to  P,  that  are  complementary  to  the 
sets  Ln,  and  all  these  complementary  sets  form  an  enumerable  sequence  of 
sets,  non-dense  in  P;  therefore  the  points  that  belong  to  all  the  sets  [Ln] 
form  a  residual  set,  relatively  to  P.  At  a  point  of  this  residual  set,  there  is  a 
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derivative  that  is  numerically  greater  than  \n,  whatever  value  n  may  have, 
and  thus  there  is  an  infinite  derivative.  It  has  now  been  shewn  that  the 
assumption  made  is  inconsistent  with  the  existence  of  a  finite  differential 
coefficient  at  each  point  of  P.  Thus  it  has  been  shewn  that  the  condition  III, 
of  §  464,  is  satisfied. 

It  has  been  shewn  that  the  conditions  I,  II,  III,  of  §  464,  are  all  satisfied 
by  /  (x),  and  it  only  remains  to  be  proved  that  the  condition  (4)  of  §  464  is 
satisfied.  We  shall  prove  the  following  theorem  : 

If  F  (x)  have  a  finite  differential  coefficient  f  (x\  at  every  point  of  a  perfect 

00 

set  P,  which  is  summable  over  P,  and  the  series  2  W(an,bn)  is  convergent, 

n=l 

where  W  (an,  bn)  denotes  the  fluctuation  of  F  (x)  in  the  interval  (an,  bn)  con- 
tiguous to  P,  then 


F(b)-F(a)=        f(x}dx+       {F(bn)-F(an)}> 
J  (P)  »-i 

where  (a,  6)  is  the  interval  in  which  P  is  contained. 
In  the  theorem  of  §  468,  let 

Xn(x)  =  F(x}-F(an\ 
then  the  function 

g  (x)  =  F(a)  +  i  [F(bn)  -  F(an)}  +  {F  (x}  -  F  (am)} 

a 

has,  at  almost  all  points  of  P,  the  differential  coefficient  zero.  The  summation 
in  the  second  term  on  the  right-hand  side  is  taken  for  all  the  intervals  (an,  bn), 
interior  to  (a,  x},  and  x  is  taken  to  be  in  the  interval  (am,  bm). 

Let  F(x)  =  g(x)  +  h(x),  then  h(x}  has  a  differential  coefficient,  equal  to 
f(x),  at  almost  all  points  of  P. 

Thus,  if/(#)  is  summable  over  P,  we  have 


f(x}dx, 

where  f(%)=f(x)  a^  all  points  of  P,  and  is  elsewhere  equal  to  zero. 
It  now  follows  that 

F(b)-F(a)=f    f(x}dx+  1  {F(bn)-F(an}}. 

J  (P)  n  =  l 

It  has  thus  been  shewn  that  all  the  conditions  necessary  for  the  existence 

of  the  D-integral  in  (a,  b)  are  satisfied,  and  that  the  integral  is  equal  to 

rx 

F  (6)  —  F  (a).   It  is  clear  that,  in  the  interval  (a,  x},  the  jD-integral  I    /  (x)  dx 

J  a 

exists,  and  is  equal  to  F  (x)  —  F  (a). 
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PROPERTIES  OF  THE  DENJOY  INTEGRAL. 

472.  A  D-integral  that  is  not  also  an  Z-integral  cannot  be  an  absolutely 
convergent  integral,  that  is,  the  integral  of  the  absolute  value  of  the  function 
cannot  exist.  For  if  a  and  ft  are  two  points  of  non-summability  of  the  function 
f(x),  and  the  Z-integral  of  f(x)  exists  in  every  interval  (a',  ft')  interior  to 

Fff 

(a,  ft),  the  limit  of      |/(#)  |  dx,  as  «'  ~  a,  ft'  ~  ft,  is  infinite. 

J  a' 

Iffw  (x),f(2)  (x)  be  two  functions,  both  integrable  (D),  in  the  interval  (a,  b), 
then  their  sum  is  also  integrable  (D),  in  the  interval,  and 

f  {/(1)  (*)  +/(2)  0*0}  dx  =  f /«  (*)  dx+  fV(2)  (*)dx. 

J  a  J  a  J  a 


Let  JjT(1),  H®  be  the  sets  of  points  of  non-summability  of  the  two  functions 
/(1)  (x),  /(2>  (x),  respectively.  The  points  of  non-summability  of /(1)  (x)  +/(2)  (x) 
form  the  closed  set  M(HW,  H(2)),  which  may  be  denoted  by  H ;  and  this  is  the 
sum  of  a  perfect  set  Plt  and  an  enumerable  set  Ml.  It  has  been  shewn  in 
§  466,  that  the  set  of  points  of  non-summability  of  either  of  the  functions 
/(1>  (^)>/(2>  (x)>  assumed  to  be  integrable  (D),  with  respect  to  any  perfect  set 
whatever,  contained  in  (a,  b),  is  non-dense  in  that  set,  and  that  the  same  is  true 
as  regards  the  points  of  non-convergence  of  the  sum  of  the  fluctuations  of 
their  integrals  in  the  intervals  contiguous  to  the  perfect  set.  We  may  there- 
fore, in  the  process  of  construction  of  each  of  the  integrals 

[  /w  (x)  dx,    I  /(2)  0)  dx, 

J  a  J  a 

employ  the  set  H,  instead  of  HU,  or  JT<2>.  We  shall  have  Pl  =  M(P1w,P1w), 
and  in  general  Pa  =  M(Pa,(l\  Pa(2)).  Moreover  if,  at  any  stage  of  the  process,  it 
has  been  shewn  that  the  theorem  holds  for  every  interval  («',  j3')  interior  to 
the  interval  (a,  /3),  we  see  that,  since  the  integrals  of  /w  (x),  /(2)  (x)  in  (a',  /3') 
converge  to  their  integrals  in  (a,  /3),  as  a  ~  a,  /3f  ~  /3,  the  theorem  also  holds 
for  the  interval  (a,  /3).  The  same  theorem  therefore  holds  for  the  integrals 
obtained  at  each  stage  of  the  process  of  construction  of  the  J[)-integral  of 
f(1)  (.»)  -f/(2)(#),  and  therefore  for  the  final  integrals  taken  over  the  interval  (a,  b). 

473.    In  the  case  in  which  the  set  H,  of  points  of  non-summability  of 
the  function  f(x),  has  measure  zero,  the  function  is  necessarily  summable  over 

r 

the  nucleus  P,  of  H,  because  m  (P)  =  0,  and  thus  /     f(x}  dx  =  0.    In  this  case, 

J(P) 

the  D-integral  over  (a,  6),  assumed  to  exist,  is  the  sum  of  the  Z)-integrals 
over  the  intervals  contiguous  to  P.  In  order  that  the  D-integral  may  exist,  it 
is  sufficient  that  the  sum  of  the  fluctuations  of  the  integrals  over  the  intervals 
contiguous  to  P  should  be  convergent.  When  this  condition  is  satisfied,  so 
that  the  Z)-integral  of  f(x),  over  (a,  6),  exists,  it  is  not  necessarily  the  case 
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that/(#)  has  an  HL-integral  over  (a,  6),  because  the  condition  for  the  exist- 
ence of  the  latter  is,  in  accordance  with  E.  H.  Moore's  theorem,  given  in 
§  457,  that  the  sum  of  the  fluctuations  of  the  integrals  of  f(x)  taken  over  the 
intervals  contiguous  to  the  set  H  should  be  convergent;  and  this  condition 
is  more  stringent  than  the  condition  that  the  sum  of  the  fluctuations  over 
the  intervals  contiguous  to  the  nucleus  P,  of  H,  should  be  convergent.  This 
explains  why  the  sum  of  two  functions,  both  of  which  have  ./LL-integrals, 
does  not  necessarily  possess  an  .flX-integral  (see  §  460),  although,  as  has 
been  shewn  above,  the  sum  of  two  functions,  each  of  which  is  integrable  (D), 
is  also  integrable  (D). 

474.  The  following  property  of  a  function  that  is  integrable  (D)  will  now 
be  established: 

If  f(x)  be  integrable  (D),  in  the  interval  (a,  b),  and  <j>(x)  be  bounded  and 
monotone  in  the  interval,  then  the  product  f  (x)  <f>  (x)  is  integrable  (D),  in  (a,  b), 
and  satisfies  the  relation 

-  I  F(ss)d4>(oD), 


rx 

where  F  (x)  denotes  the  indefinite  integral  I  f(x)dx. 

J  a 

It  follows  from  this  theorem  that  the  product  of/(#)  into  any  function  of 
bounded  variation  in  (a,  b)  is  integrable  (D). 

To  prove  the  theorem  it  is  necessary  to  shew  that  the  function  f(x)  <£  (x) 
satisfies  the  conditions  given  in  §  464,  in  relation  to  the  function 

\F(x)<t>(x}\-  !XF(x)d<f>(x). 

L  J  a        .'  « 

If,  in  the  interval  (a,  /3),  the  function  f(x)  is  summable,  we  have  (§  447) 


/  (x)  (j>(x)dx=\    </>  (x)  dF  O), 

i  .'a 

rs  r  ~\&       t'& 

and  therefore        /  (x)  <j>  (x)  dx  =  \F  (x)  <j>  (x)  \    -      F  (x)  d<f>  (x) ; 

J  a  L  Ja         '/a 

thus  the  theorem  holds  in  the  interval  (a,  /3). 

Next,  if  the  theorem  holds  in  every  interval  (a',  /3'),  interior  to  the  interval 
(a,  ft),  it  will  be  shewn  to  hold  for  (a,  ft). 

We  have  to  shew  that,  as  «'  ~~  a,  ft'  ~  ft, 


r          i13'   p' 

\/*r\/l          f 

L  Ja         J  a 

r  n*3    f3 

converges  to  -^  («)£(#)      "      F(x)d$(x). 

Ja         •/  a 
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fa' 
Since  0  (x)  is  monotone,  it  is  clear  that  I    F  (x)  d<f>  (x)  is  equal  to 

J  a 

F(a")fa'd<f>(x), 

J  a. 

where  a"  is  some  number  in  the  interval  (a,  a'),  and  this  is  equal  to 


therefore  lim  f  JP  (a;)  (20  (a?)  =  F(a)  {0  (a  +  0)  -  0  (a)}. 

a'~a  J  a 

Similarly,  we  have 

lim  I  V<»  (20  (a?)  =  ^(/S)  {0  08)  -  0  (0  -  0)}  ; 
P~P  •>  P 

rft'  rp 

and  therefore  lim      I    F(x)d(j>(x)  =  \   F(x)d$(x) 

<x'~a,  |3'~j8  J  a'  J  a. 

-F(£)  (003)  -009-0)} 


Since 
lim 


the  result  follows. 

ps 
Hence,  as      f  (x)  0  (*•)  c?«  is  defined  to  be 

J  a. 


rff> 

lim     I   f(x)$(x)dx, 

o'~a,  /3'~|3  J  a' 


we  see  that  the  theorem  holds  for  the  interval  (a,  /3). 

A  point  in  the  neighbourhood  of  which  f(x)  is  summable  is  also  a  point 
in  the  neighbourhood  of  which  f(x)  <f>(x)  is  summable;  thus  the  set  H  is  the 
same  for  the  function/  (x)  0(#),  as  for  f(x). 

We  have  now  to  shew  that  the  condition  III,  of  |  464,  is  satisfied  for  the 
function  f(x)  0  (x). 

If  (an>  fin)  be  an  interval  contiguous  to  the  perfect  set  P,  in  which  the 
integral  off(x)  0  (x)  has  already  been  constructed,  we  have 

f  *  f(x)  0K*)  dx  =  \F  (x)  0  (x)  T  -  {X  F(x}  d<f>  (x). 

•   »-n  L  J  an         J  «n 

If  £,  t]  are  the  points  of  the  interval  at  which  F(x)  attains  its  upper  and 
lower  boundaries,  we  have  to  estimate  W  (an,  fin),  the  fluctuation  of 

f(x)  0  (x)  dx 

r  i* 

in  the  interval.    The  fluctuation  of    F  (x)  0  (x)       is  not  greater  than 

Jo-n 


-F(an)}  0  (fin)  -  {F(v)-F(an)}  0  («„); 
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it  being  assumed  that  (f>(x)  is  a  non-diminishing  function.    Since 

F(an)>0,    F(rj)-F(an)<0, 


the  fluctuation  is  not  greater  than  {F(^)-F(r})}  $(/3n),  or  than  <£  (b)  W(an,  £„), 
where  W(an,(3n)  is  the  fluctuation  of  F  (x). 


Since  I    F(x)  d<j>  (x)  lies  between  F(^)  {<£  (/3W)  —  </>  («„)}  and 


the  fluctuation  of  the  integral  does  not  exceed  {<£  (/3n)  —  <£  (aw)}  TF(an,/3M),  or 
is  less  than  {<£  (6)  —  (f>(a)}  W(an,  (3n)-    Thus  W'(an,0n)  is  less  than  a  fixed 

00 

multiple  of  TT(an,  @n).    If  then  2   W(an,  /8n)  is  convergent,  so  also  is 


It  now  follows  that  a  point  of  the  perfect  set  P,  at  which  S  W  («n,  /3W)  is 

»-i 

00 

convergent,  is  also  a  point  of  convergence  of  2  W  (an,  /9n).     For  the  result 

ra  =  l 

obtained  above  may  be  applied  to  the  part  of  P  contained  in  a  neighbour- 
hood of  the  point,  so  chosen  that,  in  it,  S  W(an,  ftn)  is  convergent. 

It  has  thus  been  shewn  that  the  set  of  points  of  P  which  are  not  points  of 
convergence  of  2  W  (an,  /3n)  is  non-dense  in  P,  since  the  corresponding  con- 
dition for  2  W  (an,  /3W)  is  satisfied.  Therefore  the  condition  III,  of  §  464, 
holds  for  the  function  f  (#)  0  (ai). 

For  the  complete  establishment  of  the  theorem,  there  only  remains  to  be 
proved  that: 

If  f(x)  be  summable  in  a  perfect  set  P,  contained  in  an  interval  (a,  /3),  then 


i  r 
F(x~) 

|L 


(P) 

the  summation  being  taken  for  all  the  intervals  (aw,/3n)  contiguous  toP,  it  being 
assumed  that  the  sum  is  absolutely  convergent. 

This  theorem  can  then  be  applied  to  the  perfect  set  contained  in  any  of 
the  intervals  contiguous  to  any  one  of  the  perfect  sets  Plt  P2,  ...,  Pp,  ..., 
employed  in  §  465,  the  conditions  of  the  theorem  being  in  each  case  satisfied. 

Let/  (a;)  be  expressed  by  the  sum  of  the  two  functions  /(#),/>(£•);  where 
/  (#)  =/(#),  in  P,  and/2  (#)  =/(#)  in  a^  interior  points  of  the  intervals  (an,  /3n). 

Let  F,  (a;)  =  I  Vi  (^)  d,v,  and  F2  (x)  =  [  */a  («)  ^»J  thus  F(x)  =  -^ 

J  a  ^  a 
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The  L-integral  I  fl(x)<l>(x)dx  exists,  being  equal  to        fl(x)^>(x)dx; 

J  a.  J   (P) 

also  I  /2  (x)  4>  (x)  dx  exists  as  a  Z)-integral,  being  equal  to  2  I    fz  (x)  <J>  (x)  dx. 

•'fit  J  "  ff 

In  each  interval  (an,  @n),  F\(x)  is  constant,  thus 

[I  /  « 

_J  Ct^j  *>    Q.ft  L. 

Now  [V(0j  </>  (0)T  -  PV(0)  d(j>  (x)  . 

Ja         J  a 


°          •0 


a  a 


+  (*)$(*)      -        ,  («)d^  («); 

\  L  _!«•'* 

the  expression  in  the  first  bracket  on  the  right-hand  side  is  equal  to 

r/s  r 

f\  (^')  <^>  (*)  ^>  °r  to  I     /j  (a?)  <^>  (x)  dx. 

J  a  J  (  P) 

In  order  to  prove  the  theorem,  it  is  then  sufficient  to  shew  that 

i  2 


a  a 


this  is  the  special  case  of  the  original  theorem  which  arises  when  /(#)  =  0, 
over  the  set  P. 

Consider  the  first  m  of  the  intervals  (an,  /37l);  and  let 

(7i>  ^i)»  (7a>  ^a)  •  •  •  (Ym+i>  &m+i),  where  71  =  a,  5W+1  =  /3, 
be  the  intervals  complementary  to  the  intervals  (a1?  ^j),  ...  («m,  /3OT). 

We  have 


H-l  «n 


r-m+1  |~  ~lSr        r=m+l   r& 

=     2       jP8(0)c6-(0)  j    —     2     I 

/•  =  !      L     '  Jrr  ''  =  !     •/> 

right-hand  side  may  be  written  as 

—  FZ  (jr)}  +        2        Fy  (7,.)  {</>  (Sr)  —  <^)  (7,.)} 


rr  '''-I     •'Y 

and  the  expression  on  the  right-hand  side  may  be  written  as 


F.  (x)  d(f>  (x), 

•»•=!      J  y,. 

which  is  equivalent  to 

r/s 

<r)}  +  /     .^m  (#) 


where  i/rrn  (^)  is  defined  to  be  Fz  (7,.)  —  F2  (x},  in  the  intervals  (7,.,  Sr),  and  to 
be  zero  in  the  intervals  (aly  fa),  (a2>  fa),...  (dm,  /3m). 
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_  _ 

The  first  term,  is  numerically  less  than  <f)    2     |  F2  (Sr)  —  F2  (yr)  |,  where  0 

r-l 
r  —  m-f-l 

is  the  upper  boundary  of  <£  (a?)  |  in  (a,  /3);  and    S      ^  (Sr)  —  -^2  (yr)  |  does  not 

r  =  l 

exceed  the  sum  of  the  fluctuations  of  F2  (x)  in  the  intervals 


contained  in  the  intervals  (yr,  &r).  It  follows,  from  the  convergence  of  the 
series  of  fluctuations  in  the  intervals  contiguous  to  P,  that  the  first  term  is 
arbitrarily  small,  if  m  be  sufficiently  great;  thus  the  first  term  is  <  em,  where 
lim  ew  =  0. 

>n~oo 

If  x  is  not  a  point  of  P,  we  have  tym  (x)  =  0,  for  all  sufficiently  large  values 
of  m',  and  if  a?  is  a  point  of  P,  it  is  in  one  of  the  intervals  (7,.,  8r),  where  yr 
converges  to  x,  as  m  is  indefinitely  increased,  and  thus  tym  (x}  converges 
to  zero.  Since  tym  (x)  is  bounded,  for  all  values  of  m  and  x,  we  have,  by  a 
theorem  established  in  §  445, 


r/3  r/3 

lim  I    TJrm  (x)  d(j>  (x}  =      [lira  tym  (#)]  d$  (x)  =  0. 

OT~  oo  J  a  J  a    m~oo 

It  has  now  been  shewn  that  the  limit,  as  ra~oo  ,  of 

[10      r0  «-m  r  r  "i/3n      r^w  ) 

JP.  (0)^(0)      -      F,(x)d$(x)-   2    ]  LP,(0)^(0)       -       F2(x)d^(x) 
Ja         J  a  71  =  1     I   L  J«n        ^ajl  J 


is  zero.    The  theorem  has  therefore  been  established;  and  it  has  been  proved 
that 


which  is  an  extension,  to  the  D-integral,  of  the  theorem  of  §  445,  for  the  L- 
integral. 

475.  As  in  §  445,  it  may  be  shewn  that  the  second  mean  value  theorem 
is  applicable  to  the  integral  of  f  (x)  <j>  (x),  where  f(x)  is  integrable  (D),  and 
<f>  (x}  is  monotone  and  bounded. 

rb 

For     F(x)  d<f>  (x}  =  F(fi)  {(f)  (b)  —  <f>  (a)},  where  ^  is  some  number  in  the 
J  a 

interval  (a,  b);  therefore 


if  we  assume  that  F(a)  =  0.    This  is  equivalent  to 

rb  />  rb 

f(x)  <£  (x)  dx  =  <j>(a)      f(x)  dx  +  (f>  (b)     f(x)  dx. 

J  a  J  a  J  M 
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The  more  general  form  of  the  theorem,  including  Bonnet's  form,  may  be 
deduced,  as  in  §  422.  For  the  application  to  the  representation  of  a  function 
that  is  integrable  (D)  by  trigonometrical  or  other  series,  the  result  here 
obtained  is  of  decided  importance. 


EXTENSIONS   OF   THE   DEFINITION   OF  THE   DENJOY  INTEGRAL. 

476.  A  generalization  of  the  definition,  given  in  §  464,  of  the  Denjoy 
integral  has  been  made  by  Denjoy*  himself,  by  W.  H.  Youngf,  and  by 

oo 

Khintchine^:,  independently  of  each  other.    Instead  of  the  series  2   W(an,  fin), 

«=i 

in  which  the  terms  are  the  fluctuations  of  V(an,  x)  in  the  intervals  (an>  fin), 

00 

the  series  2     V(an,fin)    is  employed.    Thus,  for  the  definition  (4),  in  §  464, 
»i=i 

the  following  is  substituted: 

(4)'  Let  P  be  a  perfect  set  of  points  in  an  interval  (a,  fi),  contained  in 
(a,  b),  and  let  it  be  assumed  that  f  (x)  is  summable  over  the  set  P,  and  suppose 
that  V  (a',  fi')  has  been  defined  for  every  interval  (a',  0'),  of  (a,  fi),  which  con- 
tains no  points  of  6(  as  interior  points.  Let  (an,  fin),  where  n  =  I,  2,  3,..., 
denote  the  intervals  of  (a,  fi)  that  are  contiguous  to  P,  and  let  it  be  assumed 

00 

that  the  series  2  j  V(an,  fin)   is  convergent.    Then  V(a,  fi)  is  defined  by 
n=l 

P(a,j3)=  2  V(an,fin)+!     f(x)dx. 
»=i  J  (py 

The  definitions  (1),  (2),  and  (3),  of  §  464,  are  unaltered. 

Instead  of  the  condition  III,  of  §  464,  the  following  is  substituted  : 

III'.  For  every  perfect  set  P,  if  F(on,  fin)  have  been  calculated  for  every 
interval  (an,  fin)  contiguous  td*P,  the  set  of  points  of  P  which  are  not  points  of 

oo 

convergence  of  2     V  (a.n ,  /3n)  |  is  non-dense  in  P. 


As  before,  a  point  of  convergence  of  2    V(an,  ftn)\  is  one  which  has  a 

n=l 

neighbourhood  such  that  the  series  of  those  terms  for  which  (an,  fin)  is 
interior  to  the  neighbourhood  is  convergent.  Thus  the  neighbourhood  con- 
tains a  part  of  P,  such  that  the  series  is  convergent,  when  taken  for  the 
intervals  contiguous  to  that  part  of  P.  The  conditions  I,  II,  of  §  464,  remain 
unaltered. 

*  Annales  de  Vecole  normale,  (3),  vol.  xxxm,  p.  127,  and  (3),  vol.  xxxiv,  p.  181.  These  memoirs 
are  a  continuation  of  researches  in  the  Journal  de  Math.  (7),  vol.  r,  p.  105,  and  Bull,  de  la  soc. 
Math,  de  France,  vol.  XLIII. 

t  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  175. 

J  Comptes  Rendus,  Paris,  vol.  CLXII,  p.  287. 
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00 

It  will  be  observed  that,  if  the  series  2  TF(an,$n)  is  convergent,  so  also 

n=l 

00 

is  the  series  2    V(an,  /3M)|;  but  that  the  converse  does  not  hold.    It  thus 

w  =  l 

appears  that  the  Z)-integral  is  a  special  class  of  the  functions  V(a,  x)  which 
satisfy  the  wider  set  of  conditions  obtained  by  substituting  (4)'  and  III'  for 
(4)  and  III  respectively. 

When  the  function  f(x)  is  such  that  all  the  definitions  (1),  (2),  (3),  (4)' 
and  I,  II,  III'  are  realized,  the  actual  construction  of  V(a,  b)  is  carried  out 
precisely  in  the  same  manner  as  in  the  case  of  the  D-integral,  no  essential 
change  being  necessary  in  the  proof  of  existence  of  the  integral. 

477.  As  has  already  been  pointed  out  in  §  466,  the  conditions  (3)  and  II 
contain  a  postulation  that  V  (a,  x)  shall  be  a  continuous  function  of  x,  and  it 
was  shewn  that  the  _D-integral  constructed  in  §  464  actually  satisfies  the 
requirement  of  this  postulation.  It  will  however  be  shewn  that  when  the 
new  conditions  (4)'  and  III'  are  substituted  for  (4)  and  III,  the  postulation 
of  continuity  of  V  (a,  x},  or  of  F  (x),  is  not  satisfied  unless  the  function  /(#) 
satisfies  a  certain  condition. 

The  set  H  denotes  the  aggregate  of  the  points  at  which  f(x)  is  not  sum- 
mable,  and  of  the  points  in  the  neighbourhood  of  which  £  j  V(an,  /?„)  is  not 
convergent. 

The  function  V  (a,  x}  is  clearly  continuous  at  -any  point  which  does  not 
belong  to  H,  since  V  (a,  ac  +  h)—  V  (a,  x)  is  an  Z-integral,  for  sufficiently 
small  values  of  h  \  .  Next,  let  G  be  a  perfect  set,  contained  in  an  interval 
(a,  /6),  and  assume  that  f(x}  is  summable  over  G,  and  that  every  point  of  G 
is  a  point  of  convergence  of  the  series  2  V  (an,  (3n)  ,  when  the  summation  is 
taken  for  those  intervals  contained  in  (a,  ft)  which  are  contiguous  to  G. 

We  have 


* 


F  (a,  x  +  h)  -  V  (a,  x)  =  [  f     f(x)  dx  +  2  f  "/(« 

L-MG)  -'«„ 

where  (an,  6}l)  is  contiguous  to  G.    As  A  ~  0,  we  see  that     I      f(x)  dx\      ~  0,  on 

L '  (G)  Ja 

account  of  the  continuity  of  an  //-integral.    In  case  x  and  x  +  h  are  both 

r      rin  ~\x+n 

points  of  G,  \  2       f(x)  dx  also  converges  to  zero,  as  h  ~  0,  in  such  a 

L      J  nn  J  a; 

manner  that  x  +  h  is  always  a  point  of  (r.    But  if  x  +  h  is  not  restricted  to 
be  a  point  of  G,  it  is  in  the  interior  of  some  interval  (am,  bm),  contiguous  to  G, 

[rbn  ~lx+h  rx+h 

2       f(®)  dx         contains  the  term  I       f(®)  dx,  if  h  be  positive, 

/»£> 

or  I       f(x)  dx,  in  case  h  be  negative.    It  may  happen  that,  as  h  converges 

J  x+h 
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rx+h  rbm 

in  any  manner  to  zero,  I        f(x)  dx,  or  I       f(x)  dx,  does  not  converge  to 

J  am  J  x+h 

zero,  and  in  that  case  V  (a,  x)  would  not  be  continuous  at  the  point  x,  of  G. 

rx+h  rv 

We  have    I       f(x)dx   £  Wm,  where  Wm  is  the  upper  boundary  of       f(x)dx 

J  am  J  a' 

for  all  intervals  (a,  V)  contained  in  (am,  bm).  Let  it  be  supposed  that,  for  an 
infinite  sequence  of  values  of  m,  Wm  is  greater  than  some  positive  number  k. 
However  small  A  may  be,  there  will  be  one  of  these  intervals  (ar,  br),  for 
which  Wr  >  k,  in  the  interval  (x,  x  +  h).  Hence,  in  (ar,  br),  there  will  be  two 
points  ar'  =  x  +  h^  and  brf  =  x  +  hz,  for  which  the  terms  considered  differ  by 
at  least  k.  If  we  take  a  sequence  of  values  of  h,  such  that  x  +  h  approaches  x 
along  the  sequence  \ar'},  we  obtain  a  limit  which  differs  by  at  least  k  from 
the  limit  obtained  when  x  4-  h  has  a  sequence  of  values  {&/}.  It  thus  appears 
that  V  (a,  x  +  h)  —  V  (a,  x}  could  not,  for  both  sequences  of  values  of  h,  have 
the  limit  zero.  Hence,  with  the  hypothesis  made,  V (a,  x)  would  be  dis- 
continuous at  the  point  x.  In  order  that  V  (a,  x)  may  be  continuous,  it  is 
therefore  necessary  and  sufficient  that  lim  Wm  =  0.  This  condition  must  be 

m~x 

satisfied  for  all  the  perfect  sets  contained  in  all  the  intervals  contiguous  to 
each  of  the  sets  P1;  P2,  ...  Py,  employed  in  the  construction  given  in  §  464. 

It  thus  appears  that  the  postulation  contained  in  (3)  and  II,  which  is 
equivalent  to  the  postulation  that  the  integral  V  (a,  x)  constructed,  as  in 
§  464,  but  with  (4)'  and  III'  substituted  for  (4)  and  III,  is  satisfied  only  if  the 
upper  boundary  of  V(an,  x},  in  the  interval  (an,  /3n),  converges  to  zero,  as 
n  ~  QO  ,  for  each  of  the  perfect  sets  contained  in  any  interval  contiguous  to 
Py,  where  Py  is  any  one  of  the  perfect  sets  employed  in  the  construction  of 
V(a,b). 

[x 
478.    An  integral  F(x)~  I    f(x)dx,  constructed  in  accordance  with  the 

J  a 

system  of  definitions  and  postulations  (1),  (2),  (3),  (4)',  and  I,  II,  III',  which 
include  the  condition  that  F  (x)  is  a  continuous  function,  will  be  termed  a 
D^fF-integral,  or  Denjoy-Khintchine-Toung  integral.  It  is  clear  that  a 
Z)-integral  is  also  a  DKY-integra\,  but  that  the  converse  is  not  true. 

It  is  shewn,  as  in  §  472,  that  the  sum  theorem  holds*  also  for  the 
ZX^F-mtegral : 

If  /(1)<X>,  /(2)O)  have  DKY -integrals  in  (a,  b),  then  f®  (x)  +/w  (x)  has 
also  a  DKY -integral  in  the  interval,  and 

I   {/  w  (as)  +  f  <2>  (x)}  dx=l   fM  (x)  dx  +  I'  /<2)  (x)  dx. 

J  a  J  a  J  a 

The  proof  given  in  §  466  is  applicable  to  shew  that  the  set  of  those 
points  of  any  given  perfect  set  P,  each  of  which  is  either  a  point  of  non- 

*  This  is  contrary  to  a  statement  of  W.  H.  Young,  loc.  cit.,  pp.  208  and  211. 
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summability  with  respect  to  P,  of  the  function  f(x),  that  is  integrable  (DKT), 
or  else  a  point  of  non-convergence  of  2  F  (/3n)  —  F  (an)  \  taken  for  the  intervals 
(«n>  $n)>  contiguous  to  P,  is  non-dense  in  P. 

The  theorem  that  the  D-integral  F(x)  of  a  function  f(x),  that  is  integrable 
(Z)),  has  almost  everywhere  a  differential  coefficient,  equal  to  f(x),  does  not 
hold  in  general  for  the  _DJT F-integral.  The  corresponding  theorem  for  the 
latter  integral  has  been  formulated  by  Denjoy*,  who  has  introduced  for  the 
purpose  the  conception  of  an  approximate  differential  coefficient ;  thus : 

A  continuous  function  F  (x)  possesses,  at  a  point  XQ,  an  approximate  dif- 
ferential coefficient,  finite,  and  equal  to  F'  (#0),  if  the  set  of  points  x  for  which 

F(x)-F(xQ) 


SC  WQ 

has  the  metric  density  unity  at  the  point  x0,  whatever  value  the  positive  number 
e  may  have. 

The   property  of  the   indefinite  D/fF-integral    is   then   formulated   as 
follows  f : 


If  F(x)  be  the  integral   I    f(x)dx,  of  a  function  f(x)  which  is  integrable 

J  a 

(DKY)  in  the  interval  (a,  b),  F  (x)  has  at  almost  every  point  of  (a,  b),  an 
approximate  differential  coefficient,  or  else  an  ordinary  differential  coefficient, 
equal  to  f(x). 

The  necessary  and  sufficient  conditions  have  been  formulated  by  Khintchine 
(loc.  cit.},  that  the  indefinite  DKY-integra\  F(x)  should  have  almost  every- 
where a  differential  coefficient,  equal  to  f(x), 


THE  YOUNG  INTEGRAL. 

479.  The  most  general  definition  of  a  non-absolutely  convergent  integral 
has  been  given  j  by  W.  H.  Young.  His  definition  is  obtained  by  relaxing  the 
conditions  from  which  it  is  inferred  that  the  DJTF-iiitegral  is  continuous. 
Accordingly  the  Young  integral,  or  F-integral,  is  not  necessarily  a  continuous 
function  of  its  upper  limit.  When  it  is  in  fact  continuous,  it  is  then  a 
DKY- integral.  When  a  is  a  limit  of  points  of  H  on  its  right,  and  ft  is  a 
limit  of  points  of  H  on  its  left,  the  definition  (3),  and  the  assumption  II,  of 
§  464,  are  no  longer  taken  to  hold  for  all  sets  of  intervals  («„,  fin)  such 
that  an  ~  a,  ftn  ~  /3,  but  only  when  an  and  /3n  are  restricted,  for  all  values  of  n, 
to  be  points  of  H,  the  set  of  points  of  non-summability  of  the  function  f(x). 

*  Annales  de  I'ecole  normale,  (3),  vol.  xxxiu,  p.  170. 

t  Ibid.  (3),  vol.  xxxiv,  p.  184. 

J  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  175. 
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Thus  the  postulation  II  is  replaced  by  the  condition 

f/s  /•£» 

f(x)dx  =     [lim]  f(x}dx, 

J  o  an~a.,Pn~fi  J  an 

where  the  square  bracket  is  taken  to  indicate  that  the  limit  does  not 
necessarily  exist  unless  all  the  points  orn,  ftn  belong  to  the  set  H.  When  a  is 
not  a  limiting  point  of  a  part  of  H  on  its  right,  the  points  an  will  be  un- 
restricted, as  in  the  definition  of  the  JMf  F-integral,  and  a  similar  remark 
holds  as  regards  ft.  When  neither  a  nor  ft  are  such  limiting  points,  the 
bracket  may  be  removed  from  the  limit,  as  it  is  then  unrestricted. 

An  effect  of  the  introduction  of  this  restriction,  as  regards  the  mode  of 
approach  of  the  end-points  of  intervals  to  their  limits,  is  that 

F(x)=\Xf(x}dx 

J  a 

is  not  necessarily  continuous  at  a  point  x  which  belongs  to  the  set  H. 
However,  at  such  a  point,  F(x)  always  belongs  to  the  aggregate  of  limits  of 
F  (x  +  h),  or  of  F  (x  -  h),  as  h  ~  0. 

In  order  that  F  (x}  may  be  continuous  in  the  whole  interval  (a,  b),  it  is 
clearly  necessary  that  the  restricted  limits  and  the  unrestricted  limits  should 
in  every  case  give  the  same  result.  Thus  the  F-integral,  when  it  is  con- 
tinuous, is  necessarily  a  DK  F-integral,  but  of  course  not  necessarily  a 
D-integral. 

480.  It  will  be  shewn  that  the  function  F  (x)  constructed,  as  in  the  case 
of  the  DK  F-integral,  or  the  Z)-integral,  with  the  continuity  condition 
modified  in  the  manner  explained  above,  has  the  following  property : 

In  any  closed  interval  (a,  ft),  contained  in  (a,  b),  F  (x)  assumes  all  values 
between  its  upper  and  lower  boundaries  in  the  interval  (a,  ft). 

Let  (7,  8)  be  an  interval  contained  in  (a,  {3),  arid  such  as  to  contain  a 
sequence  of  intervals  (<yn,  8n),  for  which  yn~y,  and  Sn~&,  and  such  that  in 
each  of  the  closed  intervals  (yn,  Bn)  the  condition  is  satisfied  that  F(x)  takes 
any  value  between  its  upper  and  lower  boundaries  in  that  interval. 

Also  let  it  be  assumed  that  ^(7),  F(8)  are  respectively  the  limits  of 
F(<yn),  F(&n).  Let  k  be  any  number  between  the  upper  and  the  lower  boundaries 
of  F(x)  in  the  closed  interval  (7,  8).  If  n  be  sufficiently  great,  k  is  between 
the  upper  and  lower  boundaries  of  F(x)  in  the  interval  (7,1,  Sn),  and  is  there- 
fore the  value  of  F(x)  at  some  point  in  that  interval,  and  therefore  in  (7,  8). 
Thus  the  theorem  holds  for  the  interval  (7,  8). 

Commencing  with  the  fact  that  the  theorem  holds  for  any  interval  (7,  8) 
in  which  f(x)  is  integrable  (L),  and  following  out  the  various  stages  of  con- 
struction of  F  (x)  for  the  interval  (a,  ft),  as  in  §  465,  we  see,  by  continually 
applying  the  method  of  passing  to  the  limit,  that  the  theorem  can  be  estab- 
lished for  the  interval  (a,  ft). 

H.  42 
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481.  The  following  extension  of  the  theorem  of  §  474  will  now  be 
established  : 

If  f  (x)  have  a  Y  -integral  F(x)rin  (a,  b),  which  is  bounded  in  the  interval, 
and  (j>  (x)  be  a  bounded  monotone  function  defined  in  the  same  interval,  then 
f(x)  <f>  (x)  is  also  integrable  (F)  in  (a,  b),  and 


r 
J 


b  r  ~\b        rb 

f(x)  <t>(x)dx=\F  (x)  (f)(x)\   -      F(x)  d(f>  (x}. 

a  L  Ja      J  a 


It  will,  in  the  first  place,  be  shewn  that,  if  F(x)  be  bounded  in  (a,  ft),  and 

r/3» 
such  that  F(x)  d<f>  (x)  exists,  in  accordance  with  the  definition  of  §  445,  for 

•  «* 
each  of  the  intervals  (an,  /Sw),  of  a  sequence  of  intervals  contained  in  (a,  ft), 

r? 
such  that  an  ~  a,  ftn  ~  ft,  then  I    F(x)  d<t>  (x)  also  exists. 

J   a 

Let  £  =  </>  (x),  and  suppose  ^  (£)  to  be  the  function  defined  as  in  §  445,  for 
the  interval  (g  (a),  g  (ft))  of  £.  By  hypothesis,  %  (|)  is  summable  in  each 
interval  (<f>  (aw),  </>(/3n)X  of  £.  Let  E(A,  n)  denote  the  set  of  points  of  this 
interval  for  which  %  (£)  >  A,  where  A  is  an  arbitrarily  chosen  number;  then 
the  set  E(A,  n)  is  measurable.  It  then  follows  that  the  set  E  (A),  of  all 
points  £,  of  the  interval  (0  (a  +  0),  </>  (/3  —  0)),  at  which  %  (£)  >  A,  is  measur- 
able, since  it  is  the  outer  limiting  set  of  the  sequence  of  sets  E  (A,  n),  as  n  is 
increased  indefinitely.  It  follows  that  %  (£)  is  measurable  in  the  interval 
(</>  (a  -f  0),  </>  (ft  —  0))  ;  and  since  %  (£)  is  bounded,  it  is  summable  in  the  inter- 
val. Moreover,  by  a  known  property  of  the  .^-integral,  we  have 


Since  ^  (£)  has  the  value  ^(a)  in  the  interval  ((/>  (a),  <£(a  +  0)),  and  the 
value  F({3)  in  the  interval  (</>(/3  -  0),  (f)  (/?)),  we  have 

-  0)} 


therefore       F(x)d$(x)  exists,  and  has  the  value 


Next,  it  will  be  shewn  that,  if  the  theorem 

[""  F(x)  $  (x)dx  =  \F(x)  cf)  (x)Yn  -  rnF(x)  d$  (x) 

J    an  \_  ja-n          J    an 

holds  for  each  of  the  intervals  (an,  ftn),  it  also  holds  for  the  interval  (a,  ft), 
provided  Ffa)  -  F(a),  F(ftn)  ~F(/3). 
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We  have 
lim  \F  (x)  <f>  (x)  \ftn  =  \F  (x)  <f>  (x)Y  -  F  (ft)  [<j>  (ft)  -</>(£-  0)} 

n~ao    [_  J  «j|          l_  _}  a 

it  follows  that 
|V(a)0(*)]    -[   F(x)d<j)(x)  =  \im\\F(x)(j)(x)\n  -  I  K  F(x)d(j>(x)l. 

J  a         J   a  W~co    (  [_  J  Ojj          J    an  J 

/•/3  f£n 

Since  I  f(x)  (f)  (x)  dx  is  defined  to  be  lim      f(x)(f>(x)dx,  it  being  assumed 

J   o  W~ao  J    an 

that  f(x)  <j>(x)  is  integrable  (F)  in  each  of  the  intervals  (an,  fin),  it  follows 

/•a 

that,  if  I     f(x)  <f>  (x)  dx  exists,  and  is  equal  to 


[1  0w        CPn 
F({K)<j>(as)\     -       F(x 
Jan       J  an 

rfi 
for  every  value  of  n,  then       f(®)  <j>  (*)  dx  exists,  and  is  equal  to 


It  must  now  be  shewn  that  the  condition  III'  is  satisfied  by  the  integrals 
off(x)  <£(#),  when  it  is  assumed  to  be  satisfied  for  the  integrals  off(x). 

If  (crn,  y3n)  be  an  interval,  contiguous  to  the  perfect  set  P,  in  which  the 
integral  f  (x)  $>(x)  has  already  been  constructed,  we  have 


/   i  nr1 1  /T\  ( nr\  /V-r*  —    I    //    /  T*  \  /T\  i  'T*  i  I  H   (  T>\  rt  /T\  i  o^  i 

/    ^^y  Y/  \     /  ^*'t*/  —    1  •*     \     /  V  \     /  I  \     /  ^T^  \     / 

^  0^1  L  Jan       ^  an 

=  ( JF (£B)  -  ^(«M)}  0  (/3B)  +  F  («B)  (0  (/8B)  -  </>  (OB))  - 

If  F  denote  the  upper  boundary  of  F(x)  in  all  the  intervals  (an,  ftn) 
contiguous  to  P,  and  ^  is  the  upper  boundary  of  (f)  (x)  in  (a,  b),  we  have 

{Pn 
•    an 

it  being  assumed  that  (/>  (x)  is  non-decreasing. 

00 

It  follows  that,  if  2  |  F  (ftn)  —  F(an)  ]  is  convergent,  so  also  is 


It  is  thence  seen  that,  if  the  set  of  points  of  a  perfect  set  which  are  points 
of  divergence  of  the  sum  of  the  absolute  values  of  the  integrals  off(x)  taken 
over  the  intervals  contiguous  to  the  perfect  set,  is  non-dense  in  the  perfect 
set,  the  same  condition  holds  as  regards  the  absolute  values  of  the  integrals 

42 ii 
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off(x)  <f>  (x).  For,  if  P  be  the  part  of  the  perfect  set  contained  in  any  inter- 
val (a,  ft),  such  that  the  sum  of  the  absolute  values  of  the  integrals  of  f(x) 
over  the  intervals  contained  in  (a,  ft),  contiguous  to  P,  is  convergent,  it  has 
been  shewn  that  the  sum  of  the  integrals  of  f(x)  <f>(x)  is  absolutely  conver- 
gent. Thus  the  condition  III'  is  satisfied  for  the  function  f(x)  <£  (x). 

Lastly,  it  must  be  shewn  that,  if  f(x)  be  summable  in  a  perfect  set  P 
contained  in  an  interval  (a,  ft),  then 

/3         p3  °°     (T  "T3"         f9"  ) 

-     F(x)d$(x)  =  2  \\F(x)<l>(x)\     -      F(x)d<t>(x)\ 

a         J  a  n  =  l    (\_  J^          Jo,,  J 


(P) 

the  summation  being  taken  for  all  the  intervals  (an,  ftn)  contiguous  to  P,  it 
being  assumed  that  the  sum  on  the  right-hand  side  is  absolutely  convergent. 
The  proof  of  the  corresponding  theorem  for  the  D-integral,  given  in  §  474,  is 
applicable  to  the  present  case. 

We  are  now  in  a  position  to  carry  out  all  the  stages  of  the  construction  of 
the  Y-  integral  of  f(x)  <j>(x),  corresponding  to  the  various  stages  in  the  con- 
struction of  the  F-integral  of  f  (ac).  It  has  been  shewn  that  the  construction 
in  the  former  case  can  always  be  carried  out  when  it  can  be  carried  out  in  the 
latter  case,  and  that  at  each  stage  the  relation 

I  V(»  </>  O)  dx  =  \F(x)  0  (aO?  -  jft  F(x)  d<f>  (x) 
holds  good.    The  theorem  is  therefore  completely  established. 

482.  It  has  been  shewn  that  F(x)  assumes,  in  any  interval,  all  the  values 
between  its  upper  and  lower  boundaries  in  that  interval.  Thus  if  U,  L  be  the 
upper  and  lower  boundaries  of  F  (x)  in  (a,  6),  we  have 


{</>  (6)  -  </>  (a)}  ^  I'  F(x)  d<f>  (x)  ^U{<t>(b)-<i>  (a)}  ; 

J  a 


L 

it  follows  that 


where  x  is  some  number  such  that  a  £  x  ^  b. 
It  now  follows  that 

f(x)  0  0)  dx  =  $  (a)  !  X  f(x)  dx+(f>  (b)  I   f(x)  dx, 

a  J  a  J  x 

which  is  the  second  mean  value  theorem  for  the  function  f(x),  which  is  inte- 
grable  ( Y).    The  more  general  form  of  the  theorem  may  be  deduced,  as  in  §  422. 
It  has  been  assumed  throughout  that  F(x)  is  bounded  in  the  interval 
(a,  b),  and  it  is  therefore  subject  to  this  condition  that  the  existence  of 


as  given  by  the  formula  for  integration  by  parts,  has  been  established. 
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It  may  however  happen  that,  in  the  final  stage  of  the  construction  of  the 
F-integral  of  a  function  f(x),  infinite  discontinuities  of  the  function  F(x} 
appear.  Thus  there  exist  F-integrals  which  possess  points  of  infinite  dis- 
continuity, and  are  thus  unbounded. 

It  can  be  seen  that  the  theorem 

f  "  {/i  (*)  +/.  («))  dx  =  f /i  <*)  **  +  f /•  (*)  dm 

J  a  J  a  J  a 

does  not  necessarily  hold  for  F-integrals,  unless  both  the  integrals  on  the 
right-hand  side  are  DK  F-integrals.  If,  for  example  the  set  H,  for  /i(#), 
consists  of  the  sequence  c1(  c2,  ...  cn,  ...  converging  to  zero,  and  the  corre- 
sponding set  for  fz  (x)  consists  of  a  different  sequence  c/,  C2',  . . .  cn',  . . .  also 

r1  f1  r1 

converging  to  zero,  I  yi  (x)  dx  is  defined  as  lira  /  yj  (x)  dx,  and  I  /2  (x}  dx  is 

J  o  «~oo  J  cn  Jo 

rl  ri 

defined  as  lim       fz(x)dx.   But  I     {/i(#)+/2(#)}  dx  would  be  defined  as  the 

w~oo  .'  Cn  J  0 

ri 
limit  of    I    {/!  (x)  +  /2  (#)}  dx,   where  c   has  values   belonging  to  both   the 

J  c 

ri 

sequences  |cn},  {c,/}.    Since  it  is  not  necessarily  the  case  that        f-^(x)dx  con- 

J  cn 
~\  rl  rl 

verges  to      /j  (x)  dx,  or  that  |   /2  (®)  dx  converges  to   I  fa  (x)  dx,  it  follows 

.0  J  cn  JO 

that  the  sum  theorem  does  not  necessarily  hold  for  F-integrals  which  do  not 
belong  to  the  class  of  DK  F-integrals. 


EXAMPLES. 

•  ^ 

OC          JC  X 


12         1 

1  *.    Let  /  (#)«=  2x  sin  —  •  —  cos  -^  ,  for  0  <  x  £  a,  and  /  (0)  =  0.    We  have 


f  (x^i  =  -j~  ( 
dx  \ 

(a 
for  0  ^.x  <«.          |  (f)  (x}  |  dx  increases  indefinitely  as  e~0,  and  therefore  /(.#)  is  not  sum- 

fa  \~  l~\a  fa 

mable  in  the  interval  (0,  a).    But    /    f(x)dx  =  \x2sin—2\   ;  and  thus  I    /  (x)  dx  exists  as 
a  ZMntegral,  which  is  also  an  ZTZ-integral. 

d 


2t.    Let  < 

in  the  interval  (a,  /3),  where  m^2.    The  function  $ap(x)  is  not  summable  in  the  neigh- 

//3 
^ofl  (*')  ^  exists  as  a  /^integral,  which  is  also  an 

//Z-integral,  and  it  has  the  value  zero.    Let  H.  be  any  non-dense  closed  set  of  points  in  the 
interval  (a,  6)  ;  and  let/(.r)  have  the  value  zero  at  each  point  of  H,  and  in  each  interval 

*  See  Denjoy,  Annales  de  Vecole  normale,  (3),  vol.  xxxiv,  p.  181. 
t  Ibid.,  p.  208. 
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i"fl 

(a,  0)  contiguous  to  ff,  let  it.  have  the  value  <£a/3  (x).    The  integral  I    <£  (x)dx  exists,  and 

J  "• 

/X 
f(x)dxin 

a 
(a,  /3)  is  <i(/3-a)2.    It  follows  that  the  sum  of  the  fluctuations  of  J    /(*)<&  in  all 

the  intervals  is  absolutely  convergent.    It  is  now  easily  seen  that,  if  G  be  the  nucleus  of  H, 

/a 
f(x]  dx  exists,  and  has  the  value  zero,  in  every  interval  (an,  /3W)  contiguous 
<hi 

to  G  ;  and  moreover  the  sum  of  the  fluctuations  of  the  integrals  f    f(x}  dx,  in  the  inter- 

J  *n 

vals  (aw,  £„),  is  absolutely  convergent.  Therefore  f(x)  has  a  .D-integral,  which  has  the 
value  zero  in  any  interval  of  which  the  end-points  belong  to  H. 

3*.  Let  G  be  a  perfect  set,  of  measure  zero,  in  the  interval  (a,  b).  In  any  interval 
(an,  0n)i  contiguous  to  G,  let/(#)  =  l/(/3B-  a.J,  for  are<  ^  <  ita+flj,  and/»  =  -  l/(/3n-  an), 
for  %(an  +  $n)^x  </3n  ;  and  let/(#)  have  any  set  of  values  over  G. 

We  have  /    nf(x}dx—Q,  for  all  values  of  n;  also  I  /(a?)cte«=|.    It  thus  appears 

•/  an  J  "n 

that  /(a;)  is  not  integrable  (D\  because  every  point  of  G  is  a  point  of  non-convergence  of 
the  sum  of  the  fluctuations  of  the  integral  in  the  contiguous  intervals.  At  the  middle 
point  of  any  interval  (an,  /3re),  we  have  F(x)  =  %;  hence,  at  any  point  of  G  which  is  a' 
limiting  point  on  both  sides,  we  have 


At  a  point  of  G  which  is  an  end-point  of  a  contiguous  interval,  F(x]  is  discontinuous  on 
the  side  away  from  the  contiguous  interval.  The  integral  F(x)  is  accordingly  a  F-integral, 
which  is  not  a  ^)A'Ir-integral. 

4+.    Let  C  denote  Cantor's  perfect  set,  of  measure  zero,  defined  in  the  interval  (0,  1) 

(see  §  83,  Ex.    1).      In  an  interval  (an,   /3H),  contiguous  to  C,  let  /(#)  =  —  73-   —  •,,  for 

m  (PH  ~  an) 

an<x<^(an  +  l3n),  and/(^}=  --  —  --     for  ^  (an+(3n)  <^</3n;  where  m  is  such  that 

m  \Pn      an) 

(i)m=/3ji-an-    Over  the  set  C,f(x]  may  have  arbitrarily  assigned  values. 

f  fin 

Since   I      f(x)  dx  =  0,  we  have  F(x)  =  0,  at  all  points  of  C;  and  in  (an,  /3n),  the  maxi- 
/  «» 

C  x  I 

mum  value  of  I     /  (x)  dx  is  —  . 

J  »n  2m 

The  values  of  F(x  +  Q),  F(x  —  Q\  at  a  point  x,  of  C,  are  the  limits  of  the  numbers  —  , 


for  the  contiguous  intervals  in  the  neighbourhoods  on  the  right  and  left  of  the  point  ;  and 
these  limits  are  both  zero,  since  m  increases  indefinitely  with  n  ;  therefore  F(x)  is  a  con- 
tinuous function.  The  integral  is  not  a  .ZWntegral  ;  for,  if  x  be  any  point  of  (7,  there  are 
contained  in  any  neighbourhood  of  x,  contiguous  intervals  for  which  m  has  all  values  which 
exceed  some  fixed  value  ;  and  thus  2  Wn  diverges  at  every  such  point  x.  To  see  this,  we 
observe  that  an  integer  r  can  be  so  determined  that  the  distance  of  x  from  the  nearer  end- 
point  of  a  given  neighbourhood  d,  of  x,  is  >  3~r  ;  and  thus  d  will  contain  two  consecutive 
divisions,  when  the  whole  interval  (0,  1)  is  divided  into  3''  +  1  equal  parts.  We  may  suppose 
the  intervals  (an,  /3n)  to  be  arranged  in  descending  order  of  their  lengths,  and  such  that 

*  W.  H.  Young,  Proc.  Lond.  Math.  Soc.  (2),  vol.  xvi,  p.  209. 
t  See  W.  H.  Young,  loc.  cit.,  p.  199. 
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those  which  are  of  equal  length  are  arranged  in  their  order  from  left  to  right.  The  sub-set 
of  C  in  any  of  the  3r  +  1  equal  parts  is  similar  to  C,  so  that  the  values  of  m,  for  those 
contiguous  intervals  that  are  interior  to  d,  will  contain  all  integers  from  and  after  some 
fixed  one. 

Every  point  of  O  is  a  point  of  convergence  of  the  sum  of  the  absolute  values  of  the 
integrals  of  f(x)  taken  over  all  the  contiguous  intervals  (an,  /3n),  since  all  these  integrals 
have  the  value  zero.  It  follows  that  F(x}  is  a  Z)/f  F-integral. 

5*.  Let/(#)  be  the  function  denned  in  Ex.  4,  and,  denoting  |  (an4-/3m)  by  yn,  let  </>  (x) 
be  defined  in  each  interval  (yn,  /3n)  by  the  same  rule  as  that  by  which  f(x)  was  defined  in 
the  interval  (0,  1)  ;  and  let  (f>  (#)  =  0,  at  all  points  not  in  an  interval  (yn,  /3n).  The  points 
of  non-summability  of  $  (x)  form  the  set  C,  which  consists  of  (7,  together  with  a  set 
similar  to  C  in  each  of  the  intervals  (yn,  /3n).  It  is  clear  that  the  set  C  is  non-dense  in  the 
set  C.  The  points  of  non-summability  of  the  function  /(#)  +  $  (x)  are  the  same  as  those 
of  <f)  (x)  ;  that  is  they  form  the  set  C. 

We  have  I      {/(••£)  +  $  (#)}  dx=  I   "  '/(#)  dx=*  —  ;  and  if  (a^,,  fim}  denote  those  inter- 

J  an  J  «n  zm 

vals  contiguous  to  0  (x)  that  are  in  (an,  j9n),  we  have 


where  r  depends  on  s,  as  m  does  on  n. 

Those  points  of  C  that  belong  to  C  are  points  of  divergence  of  the  sum  of  the  absolute 
values  of  the  integrals  of  /(#)  +  </>  (#),  taken  over  the  contiguous  intervals  (aw,  yn),  (a^,  /3ng), 
because,  in  any  interval  8  which  encloses  such  a  point,  there  is  an  indefinitely  great 
number  of  the  intervals  (aM,  y,4),  and  the  sum  of  the  series  of  terms  I  /2m,  taken  for  these 
intervals,  diverges.  This  is  however  not  the  case  for  those  points  of  C  which  do  not  belong 
to  C,  because  such  a  point  is  interior  to  an  interval  (yn,  /3re),  or  is  at  the  end  yn  of  such  an 


interval,  and  the  series  2 


I    "*  {/(#)  +  <£(#)}  dx   is  equal  to  —  —  - 
J  «?w  in  (pn  -  a 


on»),  which 


does  not  exceed  -  —  .    The  set  C  being  non-dense  in  C,  it  now  follows  in  accordance  with 
2m 

ri 
the  theorem  of  §  478,  that  the  integral  I    {/(#)  +  $  (x)}  dx  exists  as  a  Z)JOr-integral,  and 

Jo 

ri  ri 

is  equal  to  |    f(x}dx+\    tf>  (x}  dx,  which  has  the  value  zero. 
Jo  Jo 

6  1.    Let  G  be  a  perfect  set  of  content  zero,  in  the  interval  (a,  b),  and  let 

f(x)  =  nl(ln-an\  for  an^x  <%(an  +  bn), 

f(x}=-nj(bn-an\  for  \(an  +  bn}^x^bn, 

where  the  intervals  («„,  bn)  are  all  contiguous  to  G.  The  function  f(x)  may  be  defined  arbi- 
trarily in  those  points  of  G  which  are  limiting  points  on  both  sides.  The  function  F(x)  is 
zero  at  all  the  points  of  (7,  and  it  is  continuous  in  each  of  the  closed  intervals  (an,  &„).  The 
value  of  F(x}  at  the  point  £  (an  +  bn)  is  \n  (an  +  bn}l(bn  -  an).  Thus  at  each  point  of  G,  F  (x) 
has  an  infinite  discontinuity  ;  at  a  point  x  which  is  a  limiting  point  of  G  on  both  sides,  we 
have  ^O  +  0)=  +co,  F(x  +  0')  =  Q,  F(x-Q)  =  0,  F  (x-Q)=  -  QO  .  Thus  F(x)  is  a  F-integral, 
with  a  set  of  points  of  infinite  discontinuity. 

*  See  W.  H.  Young,  loc.  cit.,  pp.  37,  38.  The  result  there  given  is  not  in  accordance  with  that 
in  the  text  ;  it  is  there  stated  that  the  series  of  integrals  diverges  in  the  neighbourhood  of  every 
point  of  C,  and  thus  that  f(x)  +  <p(x)  has  no  DAT-integral. 

t  See  W.  H.  Young,  loc.  cit.,  p.  205. 
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Maxima  and  minima  of,  of  two  variables,  416 ;  linear  in  each  interval  of  a  set,  461 

Generation,  Cantor's  principle  of,  90 
Graphs,  Functions  defined  by,  367 

Heine-Borel  theorem,  102 

Incrementary  ratios,  354 ;  Boundaries  of,  359 

Indeterminate  forms  ;  Evaluation  of,  337 

Infinitesimals ;  Non-existence  of,  41 ;  Veronese's  theory  of,  56 

Integers ;  Sets  of  sequences  of,  152 

Integrable  functions;  Particular  cases  of,  439;  null-functions,  451 

Integral  Calculus,  Fundamental  theorem  of,  431,  452,  548,  636 

Integral  numbers,  Operations  on,  1 1 

Integral,  Riemann,  432  ;  Upper  and  lower  Riemann,  433 ;  Conditions  of  existence  of,  436 ; 
Properties  of  definite  Riemann,  442;  Riemann,  of  functions  of  two  or  more  variables,  446 ; 
Cauchy's  definition  of  improper,  464 ;  Riemann,  over  unbounded  interval,  468 ;  Change 
of  variable  in  a  single  Riemann,  476 ;  Repeated  Riemann,  479 ;  Improper  double,  488  ; 
Double,  over  an  infinite  domain,  495 ;  Transformation  of  double,  491) ;  Riemann- 
Stieltjes,  506 ;  Upper  and  lower  Riemann-Stieltjes,  509 ;  Lebesgue,  of  a  measurable 
function,  519;  Other  definitions  of  an,  526;  Lebesgue,  as  measure  of  a  set  of  points, 
528 ;  the  Riemann,  as  a  Lebesgue  integral,  529 ;  Lebesgue,  as  function  of  a  set  of 
points,  530;  Equivalent  Lebesgue,  532  ;  Properties  of  Lebesgue,  533;  Indefinite,  540; 
Upper  and  lower  semi-,  547  ;  Total  variation  of  an  indefinite,  557  ;  Generalized  inde- 
finite, 416;  Indefinite,  of  a  function  of  two  variables,  560;  Repeated  Lebesgue,  573; 
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Approximate  representation  of,  as  a  Riemann  sum,  585 ;  Lebesgue,  over  an  unbounded 
field,  589 ;  Change  of  independent  variable  in  a  Lebesgue,  592 ;  Harnack's  definition 
of  an,  599;  Lebesgue-Stieltjes,  605;  Bellinger's,  609;  Non-absolutely  convergent, 
616;  Harnack-Lebesgue,  616;Denjoy,  629;  Properties  of  the  Denjoy,  647;  Denjoy- 
Khintchine- Young,  655 ;  Young,  656 

Integration ;  Geometrical  interpretation  of,  440 ;  Double,  by  parts,  461,  563 
Intervals,  Open  and  closed,  51 ;  Sets  of,  96  ;   Lebesgue  chain  of,  109 
Irrational  numbers,  19 ;  Dedekind's  theory  of,  23 ;  Cantor's  theory  of,  27 

Limits,  Arithmetical  theory  of,  36 ;  Method  of,  37 
Lines  of  invariability,  326 

Magnitude,  Theory  of,  20,  53 

Maxima  and  Minima,  283,  326  ;  of  a  function  of  two  variables,  416 
Mean  value  theorem,  of  Differential  Calculus,  335  ;  of  Integral  Calculus,  564,  652,  660 
Measure,  Problem  of,  158;  of  open  and  closed*  sets,  159;  Exterior  and  Interior,  165;  of 
limits  of  sequences  of  closed  sets,  Jordan's,  183 ;  of  sets  related  to  a  system  of  sets,  174 
Measurement,  52 
Metric  density,  178 

Negative  numbers,  17 

Nets,  Systems  of,  65 

Null-function,  Point-wise  discontinuous,  306 

Order,  Notion  of,  2 

Order-functions,  227 

Order-types,  203  ;  Addition  and  Multiplication  of,  204  ;  of  continuum,  209  ;  77,  6,  IT,  208          « 

Ordinal  numbers,  Definition  of,  4,  216 ;  Transfinite,  88 ;  of  second  class,  91,  218 ;  Limiting, 

218;  Arithmetic  of,  225  ;  Sum  and  product  of,  217 
Oscillation,  of  function  at  a  point,  284  ;  of  function  in  an  interval,  328 

Rational  numbers,  Aggregate  of,  19 ;  Section  of  aggregate  of,  23 

Real  numbers,   Dedekind's  definition,  24 ;    Cantor's  definition,  29 ;    Operations  on,  32 ; 

Convergent  sequences  of,  34;  Equivalence  of  two  definitions  of,  39  ;  Representation  of, 

45 ;  everywhere  dense,  50 
Rectifiable  curves,  318 
Repeated,  limits  of  functions,  381 ;  Existence  and  equality  of,  limits  of  functions,  384 ; 

integrals,  479,  573 

Saltus,  283  ;  -function,  291 

Schwarz's  theorem,  347 ;  on  partial  differential  coefficients,  401 

Semi-continuous  functions,  290 

Sequences,  Simple,  6;   Convergent,  27;   Ascending  and  descending,  205;   Double,  526; 

Monotone  double,  536 
Sets  of  intervals,  or  cells ;  Limiting  point  of,  63 ;  Properties  of,  96  ;  External,  internal,  and 

semi-external  points  of,  101 
Sets  of  points,  58;  Analysis  of,  123;  Closed,  73,  94,  112,  120,  142;  Counex,  142;  Degrees 

of  points  in,  123;  dense  in  themselves,  73;  Derivatives  of,  71,  78;  Detached,  142  ; 

Everywhere   dense,    74;    Frontier  of,    75,    141;    Improper   limiting  point    of,    70; 

Inner  limiting,  128;  Interior  points  of,  74;  Irreducible,  120;  Isolated,  73;  Limiting 
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points  of,  67;  Measurable,  166;  Non-dense,  74;  Non-dense  cloud,  112;  Non-dense 
perfect,  112;  of  first  and  second  species,  71;  of  first  and  second  categories,  129; 
Open,  75,  143;  Perfect,  73;  Reducible,  120;  Residual,  131,  132;  Sequences  of  closed, 
94, 175  ;  Transfinite  derivatives  of,  95  ;  Unbounded,  60 ;  Upper  and  lower  boundaries 
of,  59 ;  of  cardinal  number  of  continuum,  232 

Square,  Representation  of,  in  linear  interval,  424 

Systems  of  Nets,  65 

Total  fluctuation,  311 

Unenumerable ;  Proof  that  continuum  is,  81 
Unity,  Notion  of,  1 

Variable,  Real,  256 
Variation,  307,  321 
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